This research concentrates on the application of machine learning techniques to predict combustion, performance, and emission parameters in a dual-fuel spark ignition (SI) engine powered by neat gasoline and E20 ethanol dual fuel. The goal is to overcome the limitations posed by repeated engine experiments and nonlinear test results. In order to optimise engine parameters, the research seeks to develop efficient machine learning models with high generalizability and employ an optimisation strategy to determine the optimal engine settings. Input for training and evaluating machine learning algorithms, such as Artificial Neural Networks (ANN), and Ensemble LS Boosting was derived from experimental data from a combustion test engine, which includes Neat gasoline, and Ethanol dual fuel blend E20 at various load conditions. The dataset includes engine combustion, performance, and emission indices such as brake thermal efficiency (BTE), Exhaust Gas Temperature (EGT), Hydrocarbons (HC), Carbon monoxide (CO), Carbon dioxide (CO2), and Nitrogen oxides (NOx), under various operating conditions. Load and brake-specific fuel consumption (BSFC) were training input attributes. Using a comprehensive experimental database of input-output engine parameters, the Artificial Neural Network (ANN) and Ensemble LS Boosting were constructed. The training data points were resampled to generate multiple training datasets for training different models. 50 test samples were used to evaluate the generalisation capability of the machine learning models, while BTE, EGT, CO, CO2, HC and NOx, were the primary parameters subject to prediction. The optimal machine learning method was determined by comparing R-squared (R2) values, root mean square error (RMSE), mean square error (MSE), and mean absolute error (MAE). Using multiple hyperparameter tuning iterations, the agreement between actual and predicted values for diverse Ensemble LS Boost algorithms was evaluated. The Ensemble LS Boost model exhibits the maximum level of agreement between predicted and experimental engine parameters across all datasets when compared to the other ANN models. This finding was corroborated by additional research based on test datasets, specifically the test sample interpolation data, which measures generalisation ability. The study also focuses on developing and applying two unique, interactive Simulink models for the Spark Ignition (SI) engine that are tailored for Neat Gasoline and Ethanol E20 test fuels under all loads. The key component of the model-based development technique in MATLAB and Simulink was the incorporation of sophisticated machine learning algorithms, i.e., Ensemble Least-Squares (LS) Boosting, to the model-based development workflow which produced reliable results. Implementing an Ensemble LS Boost machine learning framework is therefore highly recommended as an efficient method for predicting and optimising the combustion, performance, and emission characteristics of dual-fuel gasoline engines utilising Ethanol-based dual-fuel blends.