Permutation Importance Research Articles

The transportation sector accounts for 61.5% of global oil consumption and is responsible for 29% of the world’s total energy demand. Passenger transportation utilizes around 50%–60% of the energy used for transportation-related activities. Accurate prediction of future transportation energy consumption is essential for governments to make well-informed decisions regarding transportation infrastructure development and utilization, which supports the United Nations’ Sustainable Development Goals (SDGs) and advances the shift to a net-zero carbon economy. With the expected increase in population, vehicles, and economic growth, it is essential to predict the energy demand to ensure sustainable urban transportation. This is crucial not only for economic prosperity but also for promoting human health and mitigating carbon emissions. Therefore, transportation energy demand prediction plays a vital role in designing sustainable future urban transportation and making informed energy investment and policy decisions. This study proposes a novel methodology and investigates for the application of machine learning stacking ensemble method with hyperparameter tuning and multicollinearity removal to predict transportation energy demand in Turkey based on historic data from 1975–2019. The dataset includes GDP, year, vehicle miles traveled, population, oil price, passenger miles traveled, and ton-miles traveled as features. A performance evaluation and comparison of 19 machine learning algorithms is first carried out to find the best candidate for the stacking ensemble models, including eXtreme Gradient Boosting algorithm. This performance comparison uses all features and also only two of them during the training phase, and it takes into consideration a 4-fold cross-validation. A combination of permutation importance and hierarchical clustering algorithm on the Spearman rank-order correlations is used for dimensionality reduction of the dataset. Extra Tree Regressor and ADABoost Regressor, which are both placed in the second level of the suggested models, are two meta-regressors that are proposed for stacking ensembles because they perform better compared to single machine learning algorithm. In total, eight stacking ensemble models – four for each of the meta-regressors – were developed and investigated considering all features and only two of them separately. Six metrics – R-squared, MSE, MAE, RMSE, RMSLE, and MAPE – are used to assess all models. The Extra Trees Regressor can be used as a meta-regressor in the best proposed stacking ensemble model to predict the energy demand for transportation. This model achieves an R-squared value of approximately 0.99 when all the features are taken into consideration. When only two features from the dataset are considered the same stacking ensemble model can achieve an accuracy of 0.98. These findings have the potential to contribute to the development of more accurate models and results, which can, in turn, lead to improved strategies for managing future transportation energy demand. Additionally, this research can support the advancement of alternative technologies that promote sustainable urban development, ultimately helping to move towards a net-zero carbon economy.

Read full abstract

The runoff prediction can provide scientific basis for flood control, disaster reduction and water resources planning. Due to a large number of uncertainties in runoff prediction, it is difficult to make precise predictions. To improve the accuracy of runoff prediction, this study combines techniques of Long Short-Term Memory (LSTM) and Light Gradient Boosting Machine (LightGBM) in machine learning with reciprocal error method to develop an integrated data-driven model (i.e., LSTM-LightGBM) for runoff prediction. To demonstrate its applicability, the model is applied to the annual runoff prediction of the Caiqi hydrological monitoring station in the Shiyang River in an arid area. Indicators include Error of Peak (EP), Nash-Sutcliffe Efficiency (NSE), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) are adopted to evaluate the prediction performance of the LSTM, LightGBM, and LSTM-LightGBM methods under the same hyperparameter combinations. Then, the interpretability of LSTM and LightGBM models is also explored based on the permutation importance method and Shapley Additive exPlanations (SHAP) values, respectively. Finally, future annual runoff at the Caiqi for the next 50 years (2025–2075) is predicted based on LSTM-LightGBM model under 12 climate scenarios. Therefore, results show that: 1. the integrated model (LSTM-LightGBM) has good performance than two single models in NSE (0.92), RMSE (0.075 million m3) and MAE (0.046 million m3) and EP value (i.e., for bridging the peak-valley runoff). 2. In this case, it is found that four feature variables have the greatest influence on the target variables through the interpretable analysis. 3. The 12 combined climate scenarios used in this investigation produced generally steady predictions. The scenarios with the highest and lowest mean values are GFDL RCP 6.0 (3.12 × 108m3) and IPSL RCP 2.6 (3.04 × 108m3), respectively, with a decrease of 24.09 % and 26.03 % compared to the mean annual runoff of 4.11 × 108m3 in the baseline period (1955–2017). These findings can provide scientific bases for future water resources planning in the downstream of the Shiyang River Basin.

Read full abstract

Permutation Importance Research Articles

Related Topics

Articles published on Permutation Importance

Robot Ground Media Classification Based on Hilbert–Huang Transform and Attention-Based Spatiotemporal Coupled Network

Constructing a Model of Poplus spp. Growth Rate Based on the Model Fusion and Analysis of Its Growth Rate Differences and Distribution Characteristics under Different Classes of Environmental Indicators

Investigating the mental health of university students during the COVID-19 pandemic in a UK university: a machine learning approach using feature permutation importance

Fast grading method based on data driven capacity prediction for high-efficient lithium-ion battery manufacturing

Mid-Infrared Variable Selection for Soil Organic Matter Fractions Based on Soil Model Systems and Permutation Importance Algorithm.

An extended car-following model considering backward-looking effect: A machine learning approach

Unravelling the effects of dynamic urban thermal environment on utility-scale floating photovoltaic electricity generation

Predicting Suitable Habitats for China’s Endangered Plant Handeliodendron bodinieri (H. Lév.) Rehder

Exploring the variable importance in random forests under correlations: a general concept applied to donor organ quality in post-transplant survival

Can local explanation techniques explain linear additive models?

Machine learning model for predicting late recurrence of atrial fibrillation after catheter ablation

Interpretable high-stakes decision support system for credit default forecasting

Hybrid machine learning model for hourly ozone concentrations prediction and exposure risk assessment

Skin cancer classification using explainable artificial intelligence on pre-extracted image features

Coarse Aggregate Shape Classification Method Based on Per-Optuna-LightGBM Model

The relationship and predictive value of dementia and frailty for mortality in patients with surgically managed hip fractures

Fracture toughness evaluation of ground granulated blast furnace slag concrete using experimental study and machine learning techniques

Prediction of transportation energy demand in Türkiye using stacking ensemble models: Methodology and comparative analysis

Application, interpretability and prediction of machine learning method combined with LSTM and LightGBM-a case study for runoff simulation in an arid area

Research on predicting the diffusion of toxic heavy gas sulfur dioxide by applying a hybrid deep learning model to real case data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Permutation Importance Research Articles

Related Topics

Articles published on Permutation Importance

Robot Ground Media Classification Based on Hilbert–Huang Transform and Attention-Based Spatiotemporal Coupled Network

Constructing a Model of Poplus spp. Growth Rate Based on the Model Fusion and Analysis of Its Growth Rate Differences and Distribution Characteristics under Different Classes of Environmental Indicators

Investigating the mental health of university students during the COVID-19 pandemic in a UK university: a machine learning approach using feature permutation importance

Fast grading method based on data driven capacity prediction for high-efficient lithium-ion battery manufacturing

Mid-Infrared Variable Selection for Soil Organic Matter Fractions Based on Soil Model Systems and Permutation Importance Algorithm.

An extended car-following model considering backward-looking effect: A machine learning approach

Unravelling the effects of dynamic urban thermal environment on utility-scale floating photovoltaic electricity generation

Predicting Suitable Habitats for China’s Endangered Plant Handeliodendron bodinieri (H. Lév.) Rehder

Exploring the variable importance in random forests under correlations: a general concept applied to donor organ quality in post-transplant survival

Can local explanation techniques explain linear additive models?

Machine learning model for predicting late recurrence of atrial fibrillation after catheter ablation

Interpretable high-stakes decision support system for credit default forecasting

Hybrid machine learning model for hourly ozone concentrations prediction and exposure risk assessment

Skin cancer classification using explainable artificial intelligence on pre-extracted image features

Coarse Aggregate Shape Classification Method Based on Per-Optuna-LightGBM Model

The relationship and predictive value of dementia and frailty for mortality in patients with surgically managed hip fractures

Fracture toughness evaluation of ground granulated blast furnace slag concrete using experimental study and machine learning techniques

Prediction of transportation energy demand in Türkiye using stacking ensemble models: Methodology and comparative analysis

Application, interpretability and prediction of machine learning method combined with LSTM and LightGBM-a case study for runoff simulation in an arid area

Research on predicting the diffusion of toxic heavy gas sulfur dioxide by applying a hybrid deep learning model to real case data