Random Forest Regression Research Articles

Customer maintenance is of vital importance to the enterprise management. Valuable assessment and efficient prediction for customer ordering behavior can offer better decision-making and reduce business costs significantly. According to existing studies about customer behavior regularity segment and demand prediction most focus on e-commerce and other fields with large amount of data, making them not suitable for small enterprises and data features like sparsity and outliers are not mined when doing regularity quantification. Additionally, more and more complex network structures for demand prediction are proposed, which builds on the assumption that all the samples have predictive value, ignoring the fine-grained analysis of different time series regularity with high cost. To deal with the above issues, a multi-step regularity assessment and joint prediction system for ordering time series is proposed. For extracting features, comprehensive assessment of customer regularity based on entropy weight method with the result of predictability quantification using K-Means clustering algorithm, real entropy, LZW algorithm and anomaly detection adopting Isolation Forest algorithm not only gives an objective result to ‘how high the regularity of customers is’, filling the gap in the field of regularity quantification, but also provides a theoretical basis for demand prediction models selection. Prediction models: Random Forest regression, XGBoost, CNN and LSTM network are experimented with sMAPE and MSLE for performance evaluation to verify the effectiveness of the proposed regularity quantitation method. Moreover, a merged CNN-BiLSTM neural network model is established for predicting those customers with low regularity and difficult to predict by traditional machine leaning algorithms, which performs better on the data set compared to others. Random Forest is still used for prediction of customers with high regularity due to its high training efficiency. Finally, the results of prediction, regularity quantification, and classification are output from the intelligent system, which is capable of providing scientific basis for corporate strategy decision and has highly extendibility in other enterprises and fields for follow-up research.

Read full abstract

Purpose: This study proposes to evaluate the effectiveness of Random Forest (RF) compared to Classification and Regression Trees (CART) in prediction of hotel star ratings. The objective is to identify the algorithm that provides the most reliable and accurate classification outcomes based on diverse hotel attributes in accordance with the standard categorization of star hotel categories. This is necessary due to the important role of accurate star ratings in guiding consumer choices and enhancing competitive positioning in the hospitality industry. Method: This study conducted a comprehensive dataset about Hotel in Banyumas Regency, including location, facilities, the size of rooms, type of rooms, price of rooms, and customer reviews, subjected to training through both RF and CART algorithms. Both algorithms are evaluated using accuracy, precision, recall, and F1 score. Additionally, both algorithms due to in the same preprocessing while performing hyperparameter tuning improve the efficacy of each model. Result: The results showed that RF achieved the best overall accuracy and robustness than CART across all tests conducted. Furthermore, RF also outperformed CART in classification effectiveness among classes, including enhanced precision and recall scores across multiple stars rating categories, signifying increased generalization and consistency in classification tasks. RF classifier consistently surpassed the CART classifier in terms of both accuracy and F1-score throughout all random states and test sizes, with a highest score of 0.9932 at a random state of 100 and a test size of 0.4. The most reliable results were obtained using RF with 42 random states and a test size of 0.2, resulting in an accuracy of 0.9909, precision of 1.0, recall of 1.0, and F1 score of 1.0. Simultaneously, CART shows values of 0.9818, 1.0, 1.0, and 1.0, respectively, while maintaining the same variation. This consistent performance, regardless of fluctuations, illustrates the robustness and suitability of RF for classification tasks compared to CART. Novelty: This study offered new insights about the implementation of machine learning about hotel star rating predictions using RF and CART algorithms. Also, the novelty of the collected hotel dataset used in this study. A detailed comparative analysis was also provided, contributing to the existing literature by showing the effectiveness of RF over CART for this specific application. Future studies could explore the integration of additional machine learning methods to further enhance prediction accuracy and operational efficiency in the hospitality industry.

Read full abstract

Random Forest Regression Research Articles

Related Topics

Articles published on Random Forest Regression

Application of machine learning for predictions of consecutive dependent data of type {[(a, b)->c]->d}

Predicting and Monitoring Anxiety and Depression: Advanced Machine Learning Techniques for Mental Health Analysis

Analysis of Lung Disease Prediction using Machine Learning Algorithms

Prediction of municipal waste generation using multi-expression programming for circular economy: a data-driven approach.

UAV-Based Multispectral Winter Wheat Growth Monitoring with Adaptive Weight Allocation

Characterisation and prediction of mechanical properties in laser powder bed fusion-printed parts: a comparative analysis using machine learning

Modeling rate of penetration using hybridization of Artificial Neural Network and Artificial Fish Swarm algorithm

A multi-step regularity assessment and joint prediction system for ordering time series based on entropy and deep learning

Predictive Modeling of the Hydrate Formation Temperature in Highly Pressurized Natural Gas Pipelines

Machine Learning Models for Predicting Significant Liver Fibrosis in Patients with Severe Obesity and Nonalcoholic Fatty Liver Disease.

Improvement of Saline–Alkali Soil and Straw Degradation Efficiency in Cold and Arid Areas Using Klebsiella sp. and Pseudomonas sp.

Metasurface-enabled multifunctional single-frequency sensors without external power

FEM-driven machine learning approach for characterizing stress magnitude, peak temperature and weld zone deformation in ultrasonic welding of metallic multilayers: application to battery cells

Development of a quantitative prediction algorithm for human cord blood-derived CD34+ hematopoietic stem-progenitor cells using parametric and non-parametric machine learning models

Early estimation of glutelin to gliadin ratio in wheat grain using high-dimensional and hyperspectral reflectance

Enhanced Crop Leaf Area Index Estimation via Random Forest Regression: Bayesian Optimization and Feature Selection Approach

A prototype early warning system for diarrhoeal disease to combat health threats of climate change in the asia-pacific region

Optimising Neural Networks for Enhanced Fracture Density Prediction in Surrounding Rock of Coalbed Methane Reservoir

Performance Comparison of Random Forest (RF) and Classification and Regression Trees (CART) for Hotel Star Rating Prediction

Unmasking Neuroendocrine Prostate Cancer with a Machine Learning-Driven Seven-Gene Stemness Signature That Predicts Progression.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Random Forest Regression Research Articles

Related Topics

Articles published on Random Forest Regression

Application of machine learning for predictions of consecutive dependent data of type {[(a, b)-&gt;c]-&gt;d}

Predicting and Monitoring Anxiety and Depression: Advanced Machine Learning Techniques for Mental Health Analysis

Analysis of Lung Disease Prediction using Machine Learning Algorithms

Prediction of municipal waste generation using multi-expression programming for circular economy: a data-driven approach.

UAV-Based Multispectral Winter Wheat Growth Monitoring with Adaptive Weight Allocation

Characterisation and prediction of mechanical properties in laser powder bed fusion-printed parts: a comparative analysis using machine learning

Modeling rate of penetration using hybridization of Artificial Neural Network and Artificial Fish Swarm algorithm

A multi-step regularity assessment and joint prediction system for ordering time series based on entropy and deep learning

Predictive Modeling of the Hydrate Formation Temperature in Highly Pressurized Natural Gas Pipelines

Machine Learning Models for Predicting Significant Liver Fibrosis in Patients with Severe Obesity and Nonalcoholic Fatty Liver Disease.

Improvement of Saline–Alkali Soil and Straw Degradation Efficiency in Cold and Arid Areas Using Klebsiella sp. and Pseudomonas sp.

Metasurface-enabled multifunctional single-frequency sensors without external power

FEM-driven machine learning approach for characterizing stress magnitude, peak temperature and weld zone deformation in ultrasonic welding of metallic multilayers: application to battery cells

Development of a quantitative prediction algorithm for human cord blood-derived CD34+ hematopoietic stem-progenitor cells using parametric and non-parametric machine learning models

Early estimation of glutelin to gliadin ratio in wheat grain using high-dimensional and hyperspectral reflectance

Enhanced Crop Leaf Area Index Estimation via Random Forest Regression: Bayesian Optimization and Feature Selection Approach

A prototype early warning system for diarrhoeal disease to combat health threats of climate change in the asia-pacific region

Optimising Neural Networks for Enhanced Fracture Density Prediction in Surrounding Rock of Coalbed Methane Reservoir

Performance Comparison of Random Forest (RF) and Classification and Regression Trees (CART) for Hotel Star Rating Prediction

Unmasking Neuroendocrine Prostate Cancer with a Machine Learning-Driven Seven-Gene Stemness Signature That Predicts Progression.

Application of machine learning for predictions of consecutive dependent data of type {[(a, b)->c]->d}