Currently, with the increasing scale of industrial systems, multisensor monitoring data exhibit large-scale dynamic Gaussian and non-Gaussian concurrent complex characteristics. However, the traditional principal component analysis method is based on Gaussian distribution and uncorrelated assumptions, which are greatly limited in practice. Therefore, developing a new fault detection method for large-scale Gaussian and non-Gaussian concurrent dynamic systems is one of the urgent challenges to be addressed. To this end, a double-layer distributed and integrated data-driven strategy based on Laplacian score weighting and integrated Bayesian inference is proposed. Specifically, in the first layer of the distributed strategy, we design a Jarque-Bera test module to divide all multisensor monitoring variables into Gaussian and non-Gaussian blocks, successfully solving the problem of different data distributions. In the second layer of the distributed strategy, we design a dynamic augmentation module to solve dynamic problems, a K-means clustering module to mine local similarity information of variables, and a Laplace scoring module to quantitatively evaluate the structural retention ability of variables. Therefore, this double-layer distributed strategy can simultaneously combine the different distribution characteristics, dynamism, local similarity, and importance of variables, comprehensively mining the local information of the multisensor data. In addition, we develop an integrated Bayesian inference strategy based on detection performance weighting, which can emphasize the differential contribution of local models. Finally, the fault detection results for the Tennessee Eastman production system and a diesel engine working system validate the superiority of the proposed method.
Read full abstract