Big Data as a Tool for Building a Predictive Model of Mill Roll Wear

Natalia Vasilyeva,Elmira Fedorova,Alexandr Kolesnikov

doi:10.3390/sym13050859

Natalia Vasilyeva, Elmira Fedorova + Show 1 more

Open Access

https://doi.org/10.3390/sym13050859

Copy DOI

Abstract

Big data analysis is becoming a daily task for companies all over the world as well as for Russian companies. With advances in technology and reduced storage costs, companies today can collect and store large amounts of heterogeneous data. The important step of extracting knowledge and value from such data is a challenge that will ultimately be faced by all companies seeking to maintain their competitiveness and place in the market. An approach to the study of metallurgical processes using the analysis of a large array of operational control data is considered. Using the example of steel rolling production, the development of a predictive model based on processing a large array of operational control data is considered. The aim of the work is to develop a predictive model of rolling mill roll wear based on a large array of operational control data containing information about the time of filling and unloading of rolls, rolled assortment, roll material, and time during which the roll is in operation. Preliminary preparation of data for modeling was carried out, which includes the removal of outliers, uncharacteristic and random measurement results (misses), as well as data gaps. Correlation analysis of the data showed that the dimensions and grades of rolled steel sheets, as well as the material from which the rolls are made, have the greatest influence on the wear of rolling mill rolls. Based on the processing of a large array of operational control data, various predictive models of the technological process were designed. The adequacy of the models was assessed by the value of the mean square error (MSE), the coefficient of determination (R2), and the value of the Pearson correlation coefficient (R) between the calculated and experimental values of the mill roll wear. In addition, the adequacy of the models was assessed by the symmetry of the values predicted by the model relative to the straight line Ypredicted = Yactual. Linear models constructed using the least squares method and cross-validation turned out to be inadequate (the coefficient of determination R2 does not exceed 0.3) to the research object. The following regressions were built on the basis of the same operational control database: Linear Regression multivariate, Lasso multivariate, Ridge multivariate, and ElasticNet multivariate. However, these models also turned out to be inadequate to the object of the research. Testing these models for symmetry showed that, in all cases, there is an underestimation of the predicted values. Models using algorithm composition have also been built. The methods of random forest and gradient boosting are considered. Both methods were found to be adequate for the object of the research (for the random forest model, the coefficient of determination is R2 = 0.798; for the gradient boosting model, the coefficient of determination is R2 = 0.847). However, the gradient boosting algorithm is recognized as preferable thanks to its high accuracy compared with the random forest algorithm. Control data for symmetry in reference to the straight line Ypredicted = Yactual showed that, in the case of developing the random forest model, there is a tendency to underestimate the predicted values (the calculated values are located below the straight line). In the case of developing a gradient boosting model, the predicted values are located symmetrically regarding the straight line Ypredicted = Yactual. Therefore, the gradient boosting model is preferred. The predictive model of mill roll wear will allow rational use of rolls in terms of minimizing overall roll wear. Thus, the proposed model will make it possible to redistribute the existing work rolls between the stands in order to reduce the total wear of the rolls.

Highlights

The metallurgical industry is one of the leading sectors of the Russian economy
Compositional models that are resistant to overtraining, noise, and outliers show themselves in the best way
The aim of the work is to develop a predictive model of rolling mill roll wear based on a large array of operational control data containing information about the time of filling and unloading of rolls, rolled assortment, roll material, and the time during which the roll is in operation

Summary

Introduction

The metallurgical industry is one of the leading sectors of the Russian economy. The products manufactured by this industry are used in construction, mechanical engineering, the chemical industry, and many other industries [1,2,3]. The relevance of the work is thanks to the fact that the construction of linear and multidimensional regression models based on a large data set does not provide a highquality result, as it does not allow taking into account complex and multi-connected dependencies between the input variables. In this case, compositional models that are resistant to overtraining, noise, and outliers show themselves in the best way. The proposed model will make it possible to redistribute the existing work rolls between the stands in order to reduce the total wear of the rolls

Theoretical Basis

Object and Problem Statement

Fragment

Algorithm

Using Big Data to Develop Linear Predictive Models

Assessment of the Model Quality

Comparison

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: May 12, 2021
Citations: 41	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Big Data as a Tool for Building a Predictive Model of Mill Roll Wear

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Predicting stand attributes of loblolly pine in West Gulf Coastal Plain using gradient boosting and random forests
X.W Lou ... Y.H Weng
Canadian journal of forest research. Journal canadien de la recherche forestiere | VOL. 51
X.W Lou, et. al.X.W Lou ... Y.H Weng
17 Nov 2020
Canadian journal of forest research. Journal canadien de la recherche forestiere | VOL. 51

Modeling and Mapping of Forest Fire Occurrence in the Lower Silesian Voivodeship of Poland Based on Machine Learning Methods
Slobodan Milanović ... Jan Kaczmarowski
Forests | VOL. 14
Slobodan Milanović, et. al.Slobodan Milanović ... Jan Kaczmarowski
26 Dec 2022
Forests | VOL. 14

Ultra-High-Cycle Fatigue Life Prediction of Metallic Materials Based on Machine Learning
Xuze Zhang ... Min Shen
Applied sciences | VOL. 13
Xuze Zhang, et. al.Xuze Zhang ... Min Shen
15 Feb 2023
Applied sciences | VOL. 13

ANALYSIS OF PARAMETERS AND MODELING OF WEAR ROLLS OF HIGH-SPEED WIRE BLOCKS
V Samokhval ... Ye Stasko
Collection of scholarly papers of Dniprovsk State Technical University (Technical Sciences) | VOL. 1
V Samokhval, et. al.V Samokhval ... Ye Stasko
16 Jun 2022
Collection of scholarly papers of Dniprovsk State Technical University (Technical Sciences) | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big Data as a Tool for Building a Predictive Model of Mill Roll Wear

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry