FIGAN: A Missing Industrial Data Imputation Method Customized for Soft Sensor Application

  • Abstract
  • References
  • Citations
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

Missing data is quite common in the industrial field, resulting in problems in downstream applications, as most data driven methods used in these applications rely on complete and high-quality dataset to build a high-quality model. Existing methods deal with missing data individually regardless of its downstream application, treating all variables equally without considering their different roles in the downstream application. This would affect imputation performance for key variables, thus deteriorating the accuracy of the downstream model. A considerable challenge is how to refine the missing data imputation task. In this paper, a new method termed fine-tuned imputation GAN (FIGAN) is designed to achieve customized data imputation for industrial soft sensor. The major contribution of the paper lies in two aspects: 1) different from the original imputation GAN (GAIN) which treats all variables equally, FIGAN is guided by a soft sensor module so as to achieve customized data imputation by performing improved data imputation on quality-related variables. Enhanced accuracy for the final industrial soft sensor would be possible; 2) in addition, since labels of the soft sensor might also have missing data, a soft sensor with pseudo labeling is designed to conquer the problem with data imputation and label prediction being optimized interactively. Case studies on a converter steelmaking process and a penicillin fermentation process show the feasibility of the proposed FIGAN. It is noted that such customized imputation could be readily transferred to other downstream applications with missing data. Note to Practitioners—Industrial data is often incomplete and needs proper treatment. Meanwhile, downstream applications with preprocessed data vary under different industrial scenes. The focus of this study is to develop a customized data imputation method for specific downstream applications such as soft sensing. A fine-tuned imputation GAN is designed with a soft sensor module so as to guide the data imputation for good imputation on key variables of the soft sensor. Considering that missing data exists not only in measurement variables but also in soft sensor labels, a semi-supervised soft sensor is designed to handle missing data in the labels, optimized together with the imputation model. The customized data imputation can thus improve the final performance of the downstream model which is a soft sensor in this work. The customization could be transferred to other applications such as an anomaly detection model as well.

ReferencesShowing 10 of 34 papers
  • Cite Count Icon 45
  • 10.1016/j.conengprac.2020.104392
Mode-cloud data analytics based transfer learning for soft sensor of manufacturing industry with incremental learning ability
  • Mar 30, 2020
  • Control Engineering Practice
  • Jie Wang + 1 more

  • Open Access Icon
  • Cite Count Icon 213
  • 10.1109/tbme.2018.2874712
Estimating Missing Data in Temporal Data Streams Using Multi-Directional Recurrent Neural Networks.
  • Oct 8, 2018
  • IEEE Transactions on Biomedical Engineering
  • Jinsung Yoon + 2 more

  • Open Access Icon
  • PDF Download Icon
  • Cite Count Icon 8
  • 10.3390/ijerph16122093
The Potential of Remedial Techniques for Hazard Reduction of Steel Process by Products: Impact on Steel Processing, Waste Management, the Environment and Risk to Human Health
  • Jun 1, 2019
  • International Journal of Environmental Research and Public Health
  • Kiri Rodgers + 4 more

  • Open Access Icon
  • Cite Count Icon 93
  • 10.1016/j.ifacol.2018.09.406
Variational Autoencoders for Missing Data Imputation with Application to a Simulated Milling Circuit
  • Jan 1, 2018
  • IFAC-PapersOnLine
  • John T Mccoy + 2 more

  • Open Access Icon
  • Cite Count Icon 6389
  • 10.1145/1390156.1390294
Extracting and composing robust features with denoising autoencoders
  • Jan 1, 2008
  • Pascal Vincent + 3 more

  • Cite Count Icon 48
  • 10.1109/jsen.2020.3024976
KNN-ST: Exploiting Spatio-Temporal Correlation for Missing Data Inference in Environmental Crowd Sensing
  • Sep 23, 2020
  • IEEE Sensors Journal
  • Ningrinla Marchang + 1 more

  • Cite Count Icon 177
  • 10.1109/tcst.2019.2897946
Robust Monitoring and Fault Isolation of Nonlinear Industrial Processes Using Denoising Autoencoder and Elastic Net
  • Mar 11, 2019
  • IEEE Transactions on Control Systems Technology
  • Wanke Yu + 1 more

  • Cite Count Icon 1292
  • 10.2200/s00196ed1v01y200906aim006
Introduction to Semi-Supervised Learning
  • Jan 1, 2009
  • Synthesis Lectures on Artificial Intelligence and Machine Learning
  • Xiaojin Zhu + 1 more

  • Cite Count Icon 34
  • 10.1109/tim.2020.3037953
Multichannel Diffusion Graph Convolutional Network for the Prediction of Endpoint Composition in the Converter Steelmaking Process
  • Nov 13, 2020
  • IEEE Transactions on Instrumentation and Measurement
  • Liangjun Feng + 5 more

  • Cite Count Icon 96
  • 10.2307/2282773
Missing Observations in Multivariate Statistics: I. Review of the Literature
  • Sep 1, 1966
  • Journal of the American Statistical Association
  • A A Afifi + 1 more

CitationsShowing 10 of 63 papers
  • Research Article
  • 10.1016/j.jprocont.2025.103373
Mutual information and attention-based variable selection for soft sensing of industrial processes
  • Feb 1, 2025
  • Journal of Process Control
  • Zhenhua Yu + 4 more

Mutual information and attention-based variable selection for soft sensing of industrial processes

  • Conference Article
  • 10.1109/ddcls55054.2022.9858447
Spatio-Temporal View GAIN for Data Imputation and Dynamic Soft Sensor
  • Aug 3, 2022
  • Jiayi Ren + 2 more

Soft sensor plays a key role in the safe operation of industrial processes and product quality control. Affected by closed-loop feedback, process data often demonstrate certain dynamic characteristics. In addition, traditional soft sensor methods are often based on the assumption of completeness. Once part of the data is missing, the above methods will collapse. In this paper, we propose a Spatio-Temporal view Generative Adversarial Imputation Network (GAIN) for data imputation and apply dynamic soft sensor for block missing accompanied with completely random missing data. Initially, a data pre-imputed strategy is designed to first impute the missing data through GAIN from the perspective of data distribution. Secondly, on the basis of pre-imputation, a data imputation strategy that integrates time and space information is proposed to impute the data precisely, which can comprehensively consider the time series of process data and the correlation between variables, and further improve the accuracy of data imputation. Finally, we propose a dynamic soft sensor method to make real-time predictions of the data. The effectiveness of the proposed method is verified by the Tennessee Eastman (TE) process.

  • Research Article
  • 10.1109/tase.2024.3472051
Cross-Modality Manifold Adaptive Network for Industrial Multimode Processes and Its Applications
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Xiao-Lu Song + 4 more

Cross-Modality Manifold Adaptive Network for Industrial Multimode Processes and Its Applications

  • Research Article
  • Cite Count Icon 18
  • 10.1007/s00521-023-08840-2
A systematic review of generative adversarial imputation network in missing data imputation
  • Jul 21, 2023
  • Neural Computing and Applications
  • Yuqing Zhang + 2 more

A systematic review of generative adversarial imputation network in missing data imputation

  • Conference Article
  • 10.23919/ccc63176.2024.10662234
Deep Residual Feature Generating Network: A Novel Method for Few-Shot Fault Diagnosis of Chillers
  • Jul 28, 2024
  • Kexin Jiang + 4 more

Deep Residual Feature Generating Network: A Novel Method for Few-Shot Fault Diagnosis of Chillers

  • Research Article
  • 10.1109/tim.2025.3574908
M2-STGN: A Missing Data-Tolerant Learning Model for Condition Monitoring of Wind Turbines
  • Jan 1, 2025
  • IEEE Transactions on Instrumentation and Measurement
  • Jian Xu + 3 more

M<sup>2</sup>-STGN: A Missing Data-Tolerant Learning Model for Condition Monitoring of Wind Turbines

  • Research Article
  • Cite Count Icon 1
  • 10.1109/jas.2024.124902
From Static and Dynamic Perspectives: A Survey on Historical Data Benchmarks of Control Performance Monitoring
  • Feb 1, 2025
  • IEEE/CAA Journal of Automatica Sinica
  • Pengyu Song + 3 more

From Static and Dynamic Perspectives: A Survey on Historical Data Benchmarks of Control Performance Monitoring

  • Open Access Icon
  • Research Article
  • Cite Count Icon 4
  • 10.1088/1361-6501/ad4384
Two-dimensional LSTM soft sensor using noisy process data
  • May 7, 2024
  • Measurement Science and Technology
  • Qiao Liu + 5 more

Establishing a reliable data-based soft sensor still faces a series of challenges, particularly the presence of outliers and different kinds of noise which are non-negligible in process data. To address these challenges, a correntropy-based two-dimensional long short-term memory (TLSTM) model is developed to handle noisy process data. First, the multidimensional time series samples are reconstructed into numerous two-dimensional input matrices in the feature and time directions. Then, the convolution and pooling operations are used to extract useful information in the process variables related to the quality variable. Meanwhile, a gating mechanism is employed to learn the internal representation of time series. Finally, a correntropy-based strategy is utilized to assign relatively small weights to outliers automatically, enabling reliable prediction. Two cases illustrate the reliability and advantages of TLSTM in effectively extracting quality-related features for prediction.

  • Research Article
  • Cite Count Icon 22
  • 10.1109/tii.2023.3272690
Quality Regularization-Based Semisupervised Adversarial Transfer Model With Unlabeled Data for Industrial Soft Sensing
  • Feb 1, 2024
  • IEEE Transactions on Industrial Informatics
  • Yan-Lin He + 2 more

Traditional soft sensors typically rely only on labeled data to predict key variables, despite the significant amount of unlabeled data that could provide valuable information. To solve this problem, a quality regularization based semisupervised adversarial transfer model (QR-SATM) is proposed. The idea of transfer learning is used in QR-SATM. QR-SATM comprises a pre-training model and a regression model. The pre-training model is an unsupervised model. And the regression model is a supervised model with a similar structure to the pre-training model, allowing for easy transfer between the two models. Firstly, the pre-training model is trained with unlabeled data to extract features. Then the trained parameters of pre-training model are transferred to the regression model, and the regression model is fine-tuned with labeled data. During fine-tuning the regression model, an improved quality regularization is introduced in order to select useful features and prevent overfitting. QR-SATM is validated by a real industrial dataset of purified terephthalic acid (PTA). The experimental results show the effectiveness of the proposed QR-SATM in accurately predicting key variables.

  • Research Article
  • Cite Count Icon 1
  • 10.1016/j.jprocont.2024.103297
A survey and experimental study for embedding-aware generative models: Features, models, and any-shot scenarios
  • Sep 21, 2024
  • Journal of Process Control
  • Jiaqi Yue + 3 more

A survey and experimental study for embedding-aware generative models: Features, models, and any-shot scenarios

Similar Papers
  • Research Article
  • Cite Count Icon 12
  • 10.1016/j.eswa.2023.121428
A missing manufacturing process data imputation framework for nonlinear dynamic soft sensor modeling and its application
  • Sep 6, 2023
  • Expert Systems with Applications
  • Liang Ma + 2 more

A missing manufacturing process data imputation framework for nonlinear dynamic soft sensor modeling and its application

  • Research Article
  • Cite Count Icon 18
  • 10.3141/2336-06
Airport Pavement Missing Data Management and Imputation with Stochastic Multiple Imputation Model
  • Jan 1, 2013
  • Transportation Research Record: Journal of the Transportation Research Board
  • J Farhan + 1 more

In practice, missing data in pavement condition databases have been one of the most prevalent problems in airport pavement management systems. Missing data present problems in pavement performance analysis and uncertainties in pavement management decision making. A number of data imputation approaches are available for handling missing data. This paper examines the limitations of the conventional data imputation methods and proposes a stochastic multiple imputation (MI) approach to overcome major limitations associated with conventional data imputation methods. A case study is presented to appraise the effectiveness of the proposed approach against three conventional data imputation methods, namely, substitution by mean, substitution by interpolation, and substitution by regression methods. The roughness and friction data of a 4-km-long runway pavement and the roughness data of a 4-km-long taxiway pavement were considered in the study. The effectiveness of auxiliary variables in data imputation models was also demonstrated. Results from the performance appraisal indicated that the proposed stochastic MI method yielded the smallest errors for the roughness as well as friction data. Furthermore, the substitution by mean method resulted in imputed values with the highest amount of deviations from the observed values, followed by the substitution by regression method, and the substitution by interpolation method. Therefore, it is concluded that the proposed stochastic MI method outperformed conventional methods in handling missing runway and taxiway pavement roughness and friction data and provides an effective approach to impute missing data required in an airport pavement management system.

  • Research Article
  • Cite Count Icon 22
  • 10.1109/tii.2021.3093386
Adversarial Attacks on Neural-Network-Based Soft Sensors: Directly Attack Output
  • Apr 1, 2022
  • IEEE Transactions on Industrial Informatics
  • Xiangyin Kong + 1 more

Neural-network-based soft sensors are widely employed in the industrial process. Such models have great significance to smart manufacturing. Considering the strict requirements of industrial production, it is vital to ensure the safety and robustness of these models in their actual deployment. However, recent research has shown that neural networks are quite vulnerable to adversarial attacks. By imposing tiny perturbation to the original sample, the fabricated adversarial sample can cheat these models to make wrong decisions. Such a phenomenon may bring serious trouble to the practical application of soft sensors. This article focuses on the adversarial attacks on industrial soft sensors. For the first time, we verify and analyze the effectiveness and deficiencies of the existing attack methods in the industrial soft sensor scenario. Based on solving these defects, this article proposes a novel perspective for attacking soft sensors. We analyze the optimization mechanism behind this new idea and then design two algorithms to perform attacks. The proposed methods more conform to the actual situation. Besides, compared with the existing approaches, the proposed methods have potentials to cause severer damages since their attacks are not only more concealed but also more likely to cheat the technicians to execute wrong operations. The research and analyses of the proposed methods lay a solid foundation for more thorough defenses against various attacks, which is quite necessary for making the deployed soft sensors more robust and secure.

  • Conference Article
  • 10.1109/ddcls55054.2022.9858447
Spatio-Temporal View GAIN for Data Imputation and Dynamic Soft Sensor
  • Aug 3, 2022
  • Jiayi Ren + 2 more

Soft sensor plays a key role in the safe operation of industrial processes and product quality control. Affected by closed-loop feedback, process data often demonstrate certain dynamic characteristics. In addition, traditional soft sensor methods are often based on the assumption of completeness. Once part of the data is missing, the above methods will collapse. In this paper, we propose a Spatio-Temporal view Generative Adversarial Imputation Network (GAIN) for data imputation and apply dynamic soft sensor for block missing accompanied with completely random missing data. Initially, a data pre-imputed strategy is designed to first impute the missing data through GAIN from the perspective of data distribution. Secondly, on the basis of pre-imputation, a data imputation strategy that integrates time and space information is proposed to impute the data precisely, which can comprehensively consider the time series of process data and the correlation between variables, and further improve the accuracy of data imputation. Finally, we propose a dynamic soft sensor method to make real-time predictions of the data. The effectiveness of the proposed method is verified by the Tennessee Eastman (TE) process.

  • Research Article
  • Cite Count Icon 41
  • 10.1176/appi.neuropsych.12040080
The Longitudinal Course of Post-Stroke Apathy Over Five Years
  • Oct 1, 2013
  • The Journal of Neuropsychiatry and Clinical Neurosciences
  • Henry Brodaty + 3 more

The prevalence of apathy is high after stroke, but its subsequent course remains unclear. We sought to determine the longitudinal course and predictors of apathy after stroke. Eligible patients admitted after a stroke and healthy control participants who were rated at least once on the Apathy Evaluation Scale were assessed over 5 years. Rates and levels of apathy in patients rose over 5 years. Significant risk factors for apathy were dementia, interval cerebrovascular events, poor physical functioning, and high depression scores. Apathy is common after stroke and becomes more prevalent with time, especially in those who show evidence of cognitive and functional decline.

  • Research Article
  • Cite Count Icon 13
  • 10.1161/strokeaha.115.007984
What is missing from my missing data plan?
  • May 7, 2015
  • Stroke
  • Sharon D Yeatts + 1 more

Under the intention-to-treat principle, all randomized subjects should be analyzed according to their randomly assigned treatment, regardless of treatment actually received or protocol compliance. Adherence to this principle requires that even subjects with missing outcome data be included in the analysis; in fact, the exclusion of such subjects can have important implications on power and bias. Statistical methods for dealing with missing data exist, but many questions remain unclear. Much statistical research has been devoted to the development and assessment of various methods for handling missing data.1 The choice of appropriate methodology requires assumptions on the mechanism underlying the missing data. All of these decisions should be made a priori, preferably before the trial starts but certainly before unblinding the trial. Related conversations between clinical investigators and the study statistician during the design phase often focus on more practical questions. Is there some threshold for the missing data rate below which the trial’s conclusions are unlikely to be affected? Under what circumstances can the missing data be excluded from the analysis without biasing estimation, or is imputation always the preferred approach? In this article, we discuss implications of missing outcome data from a practical standpoint. We describe potential reasons for missing data and suggest strategies to minimize its occurrence. We also present common imputation approaches and emphasize that because none of these approaches are universally preferred, the best analytic plan includes a series of sensitivity analyses. In any longitudinal trial where subjects are followed over some extensive period of time, lengthy follow-up makes missing data somewhat unavoidable. In stroke clinical trials, the primary outcome assessment often occurs at 90 days although there is evidence to suggest that additional follow-up may be beneficial. Subjects may expire, or withdraw informed consent, before primary outcome ascertainment. Subjects may become lost to the …

  • Research Article
  • 10.31767/su.4(91)2020.04.02
Software Implementation of Missing Data Recovery: Comparative Analysis
  • Dec 16, 2020
  • Statistics of Ukraine
  • N V Kovtun + 1 more

The paper contains a comparative analysis of the possibilities of using different software products to solve the problem of missing data on the example of the sample for which different variants of data skips are simulated. The study provided an opportunity to identify the strengths and weaknesses of these software products, as well as to determine the effectiveness of a particular method for different amounts of missed information. Thus, the easiest way to handle the situation with missing data is Statistica, but there are offered only simple methods of processing data with missing values in Statistica. So, this program will help to cope with the missed data when there is a small number of omissions (up to 10%). SPSS offers a wider range of data imputation methods than Statistica, and at the same time it offers a more user-friendly interface compared to the R or SAS programming language. In the R and SAS software environments, you can use different methods of missing data imputation from the simplest to the most complex, such as, for example, multiple imputation. Thus, R and SAS are the most powerful missing data recovery programs, but they are more complex for users because they require knowledge of the programming language. It is found out that none of the mentioned software-analytical environments has built-in procedures for processing categorical data with missing values. There are approaches that can be implemented by analogy for ordered categories in R and SAS software environments, but it does not cover all the needs of the analysis of research, which are implemented in the form of surveys with the results that are mostly presented as answers. The methods used to impute quantitative data cannot be applied to categorical data, even if numbers are used to encode responses. The study undoubtedly proved that handling the missing data, as well as the choosing of possible ways to use certain methods of data imputation in different software environments should be approached very carefully and the problem of imputation should be solved in each case based on careful analysis of the existing database, considering not only the characteristics of the data and the number of gaps, but also the specific of a particular study. Dealing with missing data involves a wide range of the issues, which includes both the exploration of the nature of gaps, the methodology for data processing and imputation, depending not only on their nature but also on the type and the use of various software environments on missing data imputation. It is planned in future research to assess the effectiveness of the recoverability of imputation methods in different software environments, as well as to develop methodological principles for restoring gaps for categorical data and implement them into practice.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 27
  • 10.1109/access.2020.2984588
Traffic Flow Prediction With Missing Data Imputed by Tensor Completion Methods
  • Jan 1, 2020
  • IEEE Access
  • Qin Li + 4 more

Missing data is inevitable and ubiquitous in intelligent transportation systems (ITSs). A handful of completion methods have been proposed, among which the tensor-based models have been shown to be the most advantageous for missing traffic data imputation. Despite their superior imputation accuracies, the adoption of these imputed data is not uniform in modern ITSs applications. The primary goal of this paper is to explore how to use tensor completion methods to support ITSs. In particular, we study how to improve traffic flow prediction accuracy under different missing scenarios. Specifically, three common missing scenarios including element-wise random missing, time-structured missing, and space-structured missing are considered. Four classical tensor completion models including Smooth PARAFAC Decomposition based Completion (SPC), CP Decomposition-based (CP-WOPT) Completion, Tucker Decomposition-based Completion (TDI), and High-accuracy Low-rank Tensor Completion (HaLRTC) are used to impute the missing data. Four well-known prediction methods including Support Vector Regression (SVR), K-nearest Neighbor (KNN), Gradient Boost Regression Tree (GBRT), and Long Short-term Memory (LSTM) are tested. The simple mean value interpolation completed traffic data is regarded as the baseline data. The extensive experiments show that improvements of traffic flow prediction can be achieved by increasing missing traffic data imputation accuracy at most cases. Interestingly we find that prediction accuracy cannot be improved by an imputation model when the sparsely observed training datasets already provide sufficient training samples.

  • Research Article
  • Cite Count Icon 59
  • 10.1016/j.conengprac.2011.10.007
Multiple model based LPV soft sensor development with irregular/missing process output measurement
  • Nov 9, 2011
  • Control Engineering Practice
  • Xing Jin + 3 more

Multiple model based LPV soft sensor development with irregular/missing process output measurement

  • Research Article
  • Cite Count Icon 71
  • 10.1093/swr/32.3.171
A Simplified Framework for Using Multiple Imputation in Social Work Research
  • Sep 1, 2008
  • Social Work Research
  • R A Rose + 1 more

Missing data are nearly always a problem in research, and missing values represent a serious threat to the validity of inferences drawn from findings. Increasingly, social science researchers are turning to multiple imputation to handle missing data. Multiple imputation, in which missing values are replaced by values repeatedly drawn from conditional probability distributions, is an appropriate method for handling missing data when values are not missing completely at random. However, use of this method requires developing an imputation model from the observed data. This is typically a rigorous and time-consuming process. To encourage wider adoption of multiple imputation in social work research, a simple framework for designing imputation models is presented. The framework and its ability to generate unbiased estimates are demonstrated in a simulation study. KEY WORDS: missing data; multiple imputation; nonresponse ********** Missing data are ubiquitous in social research, and missingness or nonresponse can represent a threat to the validity of inferences because of undue effects on efficiency, power, and parameter bias (Shadish, Cook, & Campbell, 2002). Social work researchers are now addressing missing data in a more rigorous manner. Recently, Saunders et al. (2006) and Choi, Golder, Gillmore, and Morrison (2005) described important data imputation methods and dispelled misunderstandings regarding popular imputation methods, such as mean substitution. Recent advances in analytic methods, such as multiple imputation (MI), are taking hold in social work research. With MI, missing values are replaced with values repeatedly drawn from simulated conditional probability distributions (Schafer, 1997), thus creating multiple versions of the data set. Each version of the data set is analyzed according to the data analysis model, and the multiple results are combined into point estimates (Rubin, 1996). A critical task in MI is to devise an imputation model (Allison, 2002) or missing data model (Graham, Olchowski, & Gilreath, 2007), which involves specifying the measures that are putatively associated with the missing values. Although this process adds additional steps, the specification of an imputation model and the creation of multiple data sets can produce less-biased estimates in the presence of missing data across a wide variety of data analysis techniques (Schafer, 1997). Besides MI, there are many other methods for addressing missing data (Schafer, 1999; Schafer & Graham, 2002). An equally rigorous method known as direct or full information maximum likelihood (FIML) estimation can produce unbiased estimates and correct standard errors in the presence of missing data. When the number of imputations is sufficiently large, identical missing data models will produce the same estimates under MI and FIML (Graham et al., 2007). Unlike MI, FIML is limited to maximum likelihood analytic techniques and the missing data model must be included in the analysis model. Although we focus on MI, the steps we describe for developing an imputation model are equally appropriate for use in the missing data model for FIML. On the basis of the MI literature, this article describes a framework for developing an imputation model for use with any free or commercial software package that performs MI. BRIEF REVIEW OF MISSING DATA CONCEPTS Generally, both MI and a broad range of missing data issues have received ample attention in the applied literature (Graham et al., 2007). We briefly discuss the three types of distributions that describe the randomness of nonresponse given that this property has consequences for the development of an imputation model. For a discussion of general missing data concepts that are not critical to understanding our discussion of imputation model development, we refer readers to Schafer and Graham (2002). Distribution of Nonresponse The probability distribution of nonresponse--more frequently referred to as the missing data or nonresponse mechanism (Rubin, 1976)--is both an important factor in the decision to impute with MI and a context for the development of an imputation model. …

  • Research Article
  • Cite Count Icon 13
  • 10.1002/apj.2422
Missing data treatment for locally weighted partial least square‐based modelling: A comparative study
  • Feb 11, 2020
  • Asia-Pacific Journal of Chemical Engineering
  • Wan Sieng Yeo + 2 more

Adaptive soft sensors including widely used locally weighted partial least square (LW‐PLS) have been established for online prediction, fault detection, and process monitoring. Nevertheless, majority of these existing adaptive soft sensors have zero tolerance to missing data, and the presence of missing data is inevitable due to sensor failures, routine maintenance, changes in sensor equipment over time, merging data from different system, and so forth. In the literature, limited studies could be found on the effects of missing data and the existing missing data imputation methods on the predictive performances of adaptive soft sensors. This work reports combined use of different missing data imputation techniques on LW‐PLS and the nonadaptive soft sensor, the partial least square (PLS). Well known trimmed score regression (TSR) and singular value decomposition (SVD) were employed in this study, and thus, the newly integrated TSR‐LW‐PLS and SVD‐LW‐PLS algorithms were proposed. Meanwhile, both existing TSR‐PLS and SVD‐PLS were used, and their results compared with the novel TSR‐LW‐PLS and SVD‐LW‐PLS algorithms. These algorithms were tested and evaluated using two examples having different percentages of missing data ranging from 5% to 40%. Results showed that TSR‐LW‐PLS model was superior compared with SVD‐LW‐PLS, TSR‐PLS, and SVD‐PLS as indicated by more than 100% improvements in prediction accuracy. It was also evident that TSR‐LW‐PLS is able to cope well with up to 20% of missing data.

  • Research Article
  • Cite Count Icon 113
  • 10.1016/j.kint.2020.07.035
A practical guide to multiple imputation of missing data in nephrology
  • Aug 18, 2020
  • Kidney International
  • Katrina Blazek + 3 more

A practical guide to multiple imputation of missing data in nephrology

  • Book Chapter
  • Cite Count Icon 4
  • 10.1016/b978-0-444-63456-6.50115-0
Developing a Soft Sensor for an Air Separation Process Based on Variable Selection in Dynamic Partial Least Squares
  • Jan 1, 2014
  • Computer Aided Chemical Engineering
  • Jialin Liu + 1 more

Developing a Soft Sensor for an Air Separation Process Based on Variable Selection in Dynamic Partial Least Squares

  • Dissertation
  • 10.11588/heidok.00024252
Strategies for cancer clinical trials with multiple biomarkers
  • Jan 1, 2018
  • Christina Habermehl

Strategies for cancer clinical trials with multiple biomarkers

  • Research Article
  • Cite Count Icon 75
  • 10.1016/j.knosys.2020.105592
Estimation of missing values in heterogeneous traffic data: Application of multimodal deep learning model
  • Feb 4, 2020
  • Knowledge-Based Systems
  • Linchao Li + 4 more

Estimation of missing values in heterogeneous traffic data: Application of multimodal deep learning model

More from: IEEE Transactions on Automation Science and Engineering
  • Research Article
  • Cite Count Icon 1
  • 10.1109/tase.2024.3417387
Prescribed Performance Tracking Control for Nonlinear Multiagent Systems With Distributed Observation Errors Compensation
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Xuefei Wang + 3 more

  • Research Article
  • Cite Count Icon 4
  • 10.1109/tase.2024.3498064
Knowledge Transfer Enabled Diverse Task Scheduling for Individualized Requirements in Industrial Cloud Platform
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Jiajun Zhou + 3 more

  • Research Article
  • Cite Count Icon 2
  • 10.1109/tase.2024.3412823
Whole-Body Safety-Critical Control Design of an Upper Limb Prosthesis for Vision-Based Manipulation and Grasping
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Zhijun Li + 3 more

  • Research Article
  • 10.1109/tase.2024.3510744
Adaptive Dynamic Surface Funnel Control of Hydraulic Systems With Finite-Time Prescribed Tracking Performance
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Zhangbao Xu + 3 more

  • Research Article
  • 10.1109/tase.2025.3551091
Lag Output Consensus for Second-Order Nonlinear Multiagent Systems Under PID Control
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Jin-Liang Wang + 3 more

  • Open Access Icon
  • Research Article
  • 10.1109/tase.2024.3429643
Design of an Assistive Controller for Physical Human–Robot Interaction Based on Cooperative Game Theory and Human Intention Estimation
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Paolo Franceschi + 4 more

  • Research Article
  • 10.1109/tase.2025.3597838
Controllable Mixture-of-Experts for Multivariate Soft Sensors
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Licheng Pan + 7 more

  • Research Article
  • 10.1109/tase.2025.3591656
VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Huilin Deng + 5 more

  • Research Article
  • Cite Count Icon 25
  • 10.1109/tase.2024.3445135
A Fixed-Time Consensus Control With Prescribed Performance for Multi-Agent Systems Under Full-State Constraints
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Shangbin Long + 5 more

  • Research Article
  • Cite Count Icon 7
  • 10.1109/tase.2023.3318007
Large-Scale Person Re-Identification for Crowd Monitoring in Emergency
  • Jan 1, 2025
  • IEEE Transactions on Automation Science and Engineering
  • Nayan Kumar Subhashis Behera + 3 more

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon