The design of high-performance fuels is crucial for achieving clean and efficient combustion of engines. In the current study, a framework for machine learning (ML) model-based fuel design is presented to identify compounds with desired properties for internal combustion engine (ICE) applications. Descriptors computed from newly proposed structural and positional-based group extraction method in this study are used as input in ML models due to the simplicity and computational-efficiency. The ML models were trained and validated using publically available experimental data for up to 1135 compounds. The results demonstrated a high level of predictive accuracy, with R2 values generally exceeding 0.99 for 11 physicochemical properties including melting point, boiling point, enthalpy of vaporization, surface tension, dynamic viscosity, low heating value, liquid density, yield sooting index, cetane number, research octane number and motor octane number. Furthermore, by employing the developed ML models to predict new data, a Fuel Property Database containing 1135 fuels with 12 fuel properties is established, enhanced by an interface for a user-friendly Fuel Physicochemical Properties Prediction Tool that facilitates swift property predictions and database expansion. The Pearson correlation coefficient (PCC) approach is subsequently employed to assess the correlation between descriptors and predicted or experimental properties. The strong agreement between the correlations affirms the effective predictive performance of the ML models within the calibrated data range. Additionally, a PCC analysis is conducted to reveal the individual effects of influential factors and their interactions, highlighting significant features like molecular weight size, degree of branching, and functional group content that influence physical and chemical properties, thereby providing valuable insights for fuel design and optimization. Finally, a initial comprehensive data-driven screening was carried out to identify potential fuel candidates that meet the key property limits for combustion applications in both spark-ignition (SI) and compression-ignition (CI) engines.
Read full abstract