The focus areas of the Industrial Engineering and Management journal include production, logistics, quality, operational research, information systems, technology, communication, industrial economics, regional development, management, organizational behavior, human resources, finance, accounting, marketing, education, training, and professional skills [1]. The aim of this journal is to become a reliable source of information for leaders in the field of industrial engineering management journals research, and to feature a rapid review process [2]. The subject discussed in this paper is data mining for industrial engineering and management. Knowledge Discovery in Databases Knowledge Discovery in Databases (KDD) is an iterative process of extracting implicit, previously unknown, and potentially useful knowledge as a production factor from large datasets [3]. It includes data selection, cleaning, integration, transformation, data mining (DM), and reporting. The KDD process consists of steps that are performed before conducting data mining (i.e., selection, pre-processing, and transformation of data), the actual DM, and subsequent steps (i.e., interpretation, and evaluation of results) [4]. DM refers to the specific step of applying one or more statistical, machine-learning, or imageprocessing algorithms to a particular dataset with the intent to extract useful patterns from the datasets [5]. DM is widely used in market segmentation, customer profiling, fraud detection, retail promotions, and credit risk analysis [6]. Data Mining With the rapid growth of databases in numerous modern enterprises, DM has become an increasingly valuable data analysis approach. The operations research community has made substantial contributions to this field, particularly by formulating and solving numerous DM problems as optimization problems. In addition, several operations research applications can be addressed using DM methods [7]. In recent years, the data mining field has experienced substantial interest from both academia and industry. DM problems are typically categorized as association, clustering, classification, and prediction [8]. DM involves various techniques, including statistics, neural networks, decision trees, genetic algorithms, and visualization techniques that have been developed over the years. Statistics Regression is one of the most crucial statistical methods applied to science, engineering, economics, and management [9]. Regression is useful for the prediction of the presence or absence of a characteristic or outcome based on values of a set of independent variables that are continuous, categorical, or both. Furthermore, it assumes that measures of dependent variables were independently and randomly sampled, all potentially relevant independent variables are in the model, and all independent variables in the model are relevant [10,11]. Artificial neural networks
Read full abstract