Abstract

Regression problems are present in many industrial applications, and many supervised learning algorithms have been devised over decades. However, available labeled examples are limited in some application settings; meanwhile, enormous unlabeled examples are relatively easy to collect. Thus, this work proposes a simple but effective method to cope with semi-supervised regression problems. We propose to use deep neural networks to develop our proposed method as deep learning has shown promising results in recent years. Our proposed method is a metric-based approach, and the goal is to learn an embedding space by metric learning with few labeled examples and enormous unlabeled examples. The regression estimation of the target data point is performed on the new space. We generate an artificial dataset based on several criteria to investigate whether the proposed model could make accurate predictions on the data samples that have specific properties. The experimental results point that our proposed model could capture the trend of a non-linear function and normally predict well even though this dataset comprises extreme outliers. Moreover, we conduct experiments on four datasets and compare our proposed work with several alternatives. The experimental results indicate that our proposed method achieves promising results. Besides performance evaluation, detailed analysis about our proposed method is also provided in this work.

Highlights

  • Regression problems are present in many industrial applications, including the prediction of product quality [22], aesthetic quality assessments [4], condition monitoring [9], and analysis of wafer probe test data [31]

  • We propose to use kNN to make predictions for the data points projected to the embedding space that is learned by the siamese network

  • This work focuses on semi-supervised regression, so we assume that a few labeled examples and enormous unlabeled examples are available at hand, and they are the elements of the training set

Read more

Summary

Introduction

Regression problems are present in many industrial applications, including the prediction of product quality [22], aesthetic quality assessments [4], condition monitoring [9], and analysis of wafer probe test data [31]. Sampling inspection is a popular way in the industry, in which the operators tend to estimate the overall product qualities with limited inspected products based on statistical inference. In this case, the inspected products comprise outputs, but the outputs for most products are missing. Data labeling is always performed manually, and requires the involve of domain experts in some domains such as medical diagnosis. A few labeled examples are available, and

Methods
Findings
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.