Abstract

Machine learning is a suitable pattern recognition technique for detecting correlations between data. In the case of unsupervised learning, the groups formed from these correlations must receive a label, which consists of describing them in terms of their most relevant attributes and their respective ranges of values so that they are understood automatically. In this research work, this process is called labeling. However, a challenge for researchers is to establish the optimal number of groups submitted to grouping, which influences the performance of this process. Therefore, this research aims to provide an inference approach to the number of clusters used in the clustering based on the range of attribute values, followed by automatic data labeling to maximize the understanding of the groups obtained. This methodology was applied to four databases, and the results show that it contributes to the interpretation of groups since it generates more accurate labels with a hit rate above 93%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call