A-posteriori Analyses of Pattern Recognition Results

Horst Langer,Susanna Falsaperla,Conny Hammer

doi:10.5194/egusphere-egu2020-19751

Abstract

&lt;p&gt;Data-driven approaches applied to to large and complex data sets are intriguing, however the results must be revised with a critical attitude. For example, a diagnostic tool may provide hints for a serious disease, or for anomalous conditions potentially indicating an impending natural risk. The demand of a high score of identified anomalies &amp;#8211; true positives - &amp;#160;comes together with the request of a low percentage of false positives. Indeed, a high rate of false positives &amp;#160;can ruin the diagnostics. Receiver Operation Curves (ROC) allows us to find a reasonable compromise between the need of accuracy of the diagnostics and robustness with respect to false alerts.&lt;/p&gt;&lt;p&gt;In multiclass problems success is commonly measured as the score for which calculated and target classification of patterns matches at best. A high score does not automatically mean that a method is truly effective. Its value becomes questionable, when a random guess leads to a high score as well. The so called &amp;#8220;Kappa Statistics&amp;#8221; is an elegant way to assess the quality of a classification scheme. We present some case studies demonstrating how such a-posteriori analysis helps corroborate the results.&lt;/p&gt;&lt;p&gt;Sometimes &amp;#160;an approach does not lead to the desired success. In thes cases, a sound a-posteriori analysis of the reasons for the failure often provide interesting insights into the problem, Those problems may reside in an inappropriate definition of the targets, inadequate features, etc. Often the problems can be fixed just by adjusting some choices. Finally, &amp;#160;a change of strategy may be necessary in order to achieve a more satisfying result. In the applications presented here, we highlight the pitfalls arising in particular from ill-defined targets and unsuitable feature selections.&lt;/p&gt;&lt;p&gt;The validation of unsupervised learning is still a matter of debate. Some formal criteria (e. g. Davies Bouldin Index, Silhouette Index or other) are available for centroid-based clustering where a unique metric valid for all clusters can be defined. Difficulties arise when metrics are defined individually for each single cluster (for instance, Gaussian Model clusters, adaptive criteria) as well as using schemes where centroids are essentially meaningless. This is the case in density based clustering. In all these cases, users are better off when asking themselves whether a clustering is meaningful for the problem in physical terms. In our presentation we discuss the problem of choosing a suitable number of clusters in cases in which formal criteria are not applicable. We demonstrate how the identification of groups of patterns helps the identification of elements which have a clear physical meaning, even when strict rules for assessing the clustering are not available.&amp;#160;&amp;#160;&amp;#160;&amp;#160;&lt;/p&gt;&lt;p&gt;&amp;#160;&lt;/p&gt;&lt;p&gt;&amp;#160;&lt;/p&gt;

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A-posteriori Analyses of Pattern Recognition Results

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A problem in diagnosing N3 disease using FDG-PET in patients with lung cancer —High false positive rate with visual assessment—
Masaki Hara ... Akihiko Iida
Annals of Nuclear Medicine | VOL. 18
Masaki Hara, et. al.Masaki Hara ... Akihiko Iida
01 Sep 2004
Annals of Nuclear Medicine | VOL. 18

Identifying eating disorders in adolescents and adults with overweight or obesity: A systematic review of screening questionnaires.
Eve T House ... Natalie B Lister
International Journal of Eating Disorders | VOL. 55
Eve T House, et. al.Eve T House ... Natalie B Lister
09 Jul 2022
International Journal of Eating Disorders | VOL. 55

A panel of seven protein tumour markers for effective and affordable multi-cancer early detection by artificial intelligence: a large-scale and multicentre case–control study
Yi Luan ... Xiaoqiang Liu
eClinicalMedicine | VOL. 61
Yi Luan, et. al.Yi Luan ... Xiaoqiang Liu
01 Jul 2023
eClinicalMedicine | VOL. 61

Influence of Monoenergetic Images at Different Energy Levels in Dual-Energy Spectral CT on the Accuracy of Computer-Aided Detection for Pulmonary Embolism
Guangming Ma ... Chenwang Jin
Academic Radiology | VOL. 26
Guangming Ma, et. al.Guangming Ma ... Chenwang Jin
22 Feb 2019
Academic Radiology | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A-posteriori Analyses of Pattern Recognition Results

Abstract

Talk to us

Similar Papers