DyClee-N&C: a clustering algorithm for heterogeneous data based situation assessment

Audine Subias,Louise Travé-Massuyès,Tom Obry

doi:10.1016/j.ifacol.2023.10.1742

Abstract

In data-based situation assessment applications, the proliferation of data acquired and recorded on current technological systems is a key issue in that data remain unlabeled because labeling would require too much time and implies prohibitive costs. The data should therefore speak for itself. The different situations, e.g., normal or faulty, must hence be learned only from the data. Clustering methods, also named unsupervised classification methods, can be used for that purpose. These methods are designed to cluster the samples according to some similarity criterion. The different clusters can be associated to different situations whose discrimination may be relevant to obtain a proper diagnosis.Numerous algorithms have been developed in recent years for clustering numeric data but these methods are not applicable to categorical data. This is the case of the algorithm DyClee, named DyClee-N in the paper. However, in many application domains, qualitative features are key to properly describe the different situations. DyClee-N was recast to produce a version, named DyClee-C that accepts categorical features, but only categorical features. This paper presents DyClee-N&C that subsumes both the numeric and categorical feature based algorithms DyClee-N and DyClee-C respectively. DyClee-N&C is applied to a data set of the literature for the evaluation of risk in the automobile domain and compared to state of the art clustering methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DyClee-N&C: a clustering algorithm for heterogeneous data based situation assessment

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Similar Papers

Machine learning algorithm for feature space clustering of mixed data with missing information based on molecule similarity
K Balaji
Journal of Biomedical Informatics | VOL. 125
K BalajiK Balaji
15 Nov 2021
Journal of Biomedical Informatics | VOL. 125

DyClee-C: a clustering algorithm for qualitative data based diagnosis

-

27 Aug 2020
27 Aug 2020

The use of autoencoders for training neural networks with mixed categorical and numerical features
Łukasz Delong ... Anna Kozak
ASTIN Bulletin | VOL. 53
Łukasz Delong, et. al.Łukasz Delong ... Anna Kozak
24 Apr 2023
ASTIN Bulletin | VOL. 53

Impact of categorical and numerical features in ensemble machine learning frameworks for heart disease prediction
Chandan Pan ... Ajoy Kumar Ray
Biomedical Signal Processing and Control | VOL. 76
Chandan Pan, et. al.Chandan Pan ... Ajoy Kumar Ray
05 Apr 2022
Biomedical Signal Processing and Control | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DyClee-N&C: a clustering algorithm for heterogeneous data based situation assessment

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine