Abstract

The advent of artificial intelligence and machine learning is influencing the manufacturing industry profoundly, enabling unprecedented opportunities to improve manufacturing processes within the three dimensions time, quality and cost. With the introduction of digitization and industry 4.0, increasing amounts of data become available for processing and use in smart manufacturing systems. However, the various use cases for machine learning in manufacturing often require problem-specific datasets for training and evaluation of algorithms which are difficult to acquire, hindering both practitioners and academic researchers in this area. As the respective data frequently contains sensitive information, manufacturing companies rarely release datasets to the public. Further, the relevant attributes and features of available datasets are usually not evident, requiring time-consuming analysis to evaluate if a dataset fits a given problem. As a result, it can be challenging to develop and evaluate machine learning methods for manufacturing systems due to the lack of an overview of available datasets. This paper presents a comprehensive overview of 47 existing, publicly available datasets, mapped to various use cases in manufacturing with the goal of simplifying and stimulating research. The characteristics of the datasets are compared using a set of descriptive attributes to provide an outline and guidance for further research and application of machine learning in manufacturing. In addition, suitable performance metrics for the evaluation of classification use cases in manufacturing are presented.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call