đ€ AI Summary
Conventional refrigerantsâparticularly hydrofluorocarbons (HFCs)âare potent greenhouse gases, necessitating environmentally benign alternatives. However, the known candidate molecular space remains extremely limited (~300 compounds), with few newly experimentally validated molecules and severely scarce thermodynamic data, hindering purely data-driven discovery approaches.
Method: We propose Refgen, a knowledge-integrated generative framework that jointly embeds thermodynamic priorsâincluding equation-of-state constraints and full-cycle thermodynamic simulationâinto a reinforcement learning fine-tuning pipeline. It synergistically combines sequence-based molecular modeling, property prediction, and physics-guided generation.
Contribution/Results: Refgen enables physically consistent *de novo* molecular design under data scarcity, transcending conventional refrigerant chemical space boundaries. It significantly improves thermodynamic feasibility and environmental sustainability (e.g., low global warming potential, zero ozone depletion potential) of generated candidates. The framework establishes a scalable, interpretable, and physics-aware generative paradigm for green refrigerant discovery.
đ Abstract
Most refrigerants currently used in air-conditioning systems, such as hydrofluorocarbons, are potent greenhouse gases and are being phased down. Large-scale molecular screening has been applied to the search for alternatives, but in practice only about 300 refrigerants are known, and only a few additional candidates have been suggested without experimental validation. This scarcity of reliable data limits the effectiveness of purely data-driven methods. We present Refgen, a generative pipeline that integrates machine learning with physics-grounded inductive biases. Alongside fine-tuning for valid molecular generation, Refgen incorporates predictive models for critical properties, equations of state, thermochemical polynomials, and full vapor compression cycle simulations. These models enable reinforcement learning fine-tuning under thermodynamic constraints, enforcing consistency and guiding discovery toward molecules that balance efficiency, safety, and environmental impact. By embedding physics into the learning process, Refgen leverages scarce data effectively and enables de novo refrigerant discovery beyond the known set of compounds.