Fault detection and classification (FDC) is important for semiconductor manufacturing to monitor equipment’s condition and examine the potential cause of the fault. Each equipment in the semiconductor manufacturing process is often accompanied by a large amount of sensor readings, also called status variable identification (SVID). Identifying the key SVIDs accurately can make it easier for engineers to monitor the process and maintain the stability of the process and wafer productive yields. This article proposes using the random forests algorithm to analyze the importance of SVIDs of equipment sensors, automatically filters the key SVID by using ${k}$ -means, and integrates various machine learning methods to verify the key SVIDs and identify key processing time and steps. Upon the key parameters are identified, the key processing time and steps are investigated subsequently. The ensemble models constructed on ${k}$ -nearest neighbors ( ${k}$ NNs) and naive Bayes classifiers are presented for classifying wafers as normal or abnormal. Data visualization of multidimensional key SVIDs is performed by using ${t}$ -distributed stochastic neighbor embedding ( ${t}$ -SNE) to create a graphical aid in FDC for the process engineer. An empirical study is conducted to validate the proposed data-driven framework for fault detection and diagnostic. The experimental results demonstrate that the proposed framework can detect abnormality effectively with highly imbalanced classes and also gain insightful information about the key SVIDs and corresponding key processing time and steps. Note to Practitioners —The challenges of equipment sensor data analytics in semiconductor manufacturing include building the classifier to detect wafer abnormality correctly, identification of key status variable identifications (SVIDs) and processing time and steps of abnormality, and data visualization of the abnormality in a high-dimensional feature space. This article proposes a data-driven framework for fault detection and classification (FDC) during the wafer fabrication process by incorporating several useful machine learning approaches. Experimental results demonstrate that the proposed data-driven framework can supply quality fault detection performances and provide valuable information regarding the critical SVIDs and associated key processing time for fault diagnostic. The engineers can utilize the extracted fault patterns to perform a prognosis of the aging effect on process tools or modules for health management.
Read full abstract