Abstract

ABSTRACT In this paper, a data mining based methodology for process identification from historical data was proposed. Thereon, it considers the phases of process understanding, data collection, data preparation, data modeling, and model evaluation. As some parts of historical data are irrelevant, a data selection step, based on the Gaussian Mixture Model (GMM) clustering algorithm, was considered. Additionally, the methodology includes a data informativity step to study the richness of data. In this regard, the condition number (CN) and the extended CN for ridge regression (RR CN) were used. To evaluate the approach, 2 years of industrial thickener historical data were used. Thereafter, data were prepared and an ARX (Auto-Regressive with eXogenous inputs) model structure was adopted to identify the model. To estimate input delays, Granger causality was used. As for fit criteria, least square regression was tested and compared to ridge regression as a less sensitive method to multicollinearity. The results were then evaluated based on the 20-step ahead prediction and compared to existing methods. In this context, the proposed approach gave the best results with an R 2 of 98.11% and 62.70% for 1 and 20-step ahead predictions, respectively.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.