Nearest-neighbor Classifier Research Articles

We explored the use of broadband colors to classify stars, galaxies, and quasi-stellar objects (QSOs). Specifically, we applied sharpened dimensionality reduction (SDR)-aided classification to this problem, with the aim of enhancing cluster separation in the projections of high-dimensional data clusters to allow for better classification performance and more informative projections. The main objective of this work was to apply SDR to large sets of broadband colors derived from the CPz catalog to obtain projections with clusters of star, galaxy, and QSO data that exhibit a high degree of separation. The SDR method achieves this by combining density-based clustering with conventional dimensionality-reduction techniques. To make SDR scalable and have the ability to project samples using the earlier-computed projection, we used a deep neural network trained to reproduce the SDR projections. Subsequently classification was done by applying a $k$-nearest neighbors ($k$-NN) classifier to the sharpened projections. Based on a qualitative and quantitative analysis of the embeddings produced by SDR, we find that SDR consistently produces accurate projections with a high degree of cluster separation. A number of projection performance metrics are used to evaluate this separation, including the trustworthiness, continuity, Shepard goodness, and distribution consistency metrics. Using the $k$-NN classifier and consolidating the results of various data sets, we obtain precisions of 99.7<!PCT!>, 98.9<!PCT!>, and 98.5<!PCT!> for classifying stars, galaxies, and QSOs, respectively. Furthermore, we achieve completenesses of 97.8<!PCT!>, 99.3<!PCT!>, and 86.8<!PCT!>, respectively. In addition to classification, we explore the structure of the embeddings produced by SDR by cross-matching with data from Gaia DR3, Galaxy Zoo 1, and a catalog of specific star formation rates, stellar masses, and dust luminosities. We discover that the embeddings reveal astrophysical information, which allows one to understand the structure of the high-dimensional broadband color data in greater detail. We find that SDR-aided star, galaxy, and QSO classification performs comparably to another unsupervised learning method using hierarchical density-based spatial clustering of applications with noise (HDBSCAN) but offers advantages in terms of scalability and interpretability. Furthermore, it outperforms traditional color selection methods in terms of QSO classification performance. Overall, we demonstrate the potential of SDR-aided classification to provide an accurate and physically insightful classification of astronomical objects based on their broadband colors.

Read full abstract

The alarming rates of biodiversity loss has draw attention of the scientific community in developing quantitative characterization methods of the state of life variety, ecosystem and habitats. One way to address such aspects involves acoustically monitoring the bird population, given its relation with the health and status of the ecosystem. Automated computational tools can be adopted to enhance the processing and interpretation of birdsongs. The performance and reliability of the system heavily depends on the input features extracted from the acoustic recordings. Numerous acoustical features for characterizing birdsongs are reported in the literature, however the determination of the most relevant ones remains elusive. Moreover, literature evidences a marked focus on classification or detection performance, providing limited details on the set of features that contribute to high performance. This study investigates the problem of discerning relevant acoustical features when dealing with the automatic classification of eight Colombian bird species. We adopt different audio signal processing techniques, namely temporal, spectral, cepstral and chroma analyses, to construct a heterogeneous set of features. Feature selection is implemented using principal components analysis, ReliefF feature ranking, and genetic algorithms. By considering nearest neighbors classifiers, support vector machines, neural networks and bayesian classifiers, 49 distinct machine learning models are tested. Selection schemes are fine-tuned looking for maximizing the classification performance. Our results demonstrate that effective machine learning classification of bird species can be achieved by refining a heterogeneous set of features through feature selection. Consistently, our work identifies Mel frequency Cepstral coefficients, spectral decrease, and spectral rolloff point as the most recurrent features across various feature selection schemes. By incorporating these features in a heterogenous subset totaling a minimum of 19 features, classification performances above 95% can be achieved using a nearest neighbor classifier with the Manhattan distance. These findings are particularly notable as they are derived from processing field-noise corrupted acoustic recordings. The outcomes of this study yield potential acoustical features suitable for the characterization of birdsongs. In addition, our methodological design may serve as a reference for extending the assessment of new features and machine learning models, thereby contributing to the development of effective and feasible conservation tools.

Read full abstract

Nearest-neighbor Classifier Research Articles

Related Topics

Articles published on Nearest-neighbor Classifier

Multilevel hybrid handcrafted feature extraction based depression recognition method using speech

Assessment of spatial distribution of schools in the northern area of Riyadh city

Ensemble Fusion Models Using Various Strategies and Machine Learning for EEG Classification.

A statistical evaluation of the sexual dimorphism of the acetabulum in an Iberian population.

Simultaneous Instance and Attribute Selection for Noise Filtering

A multi-average based pseudo nearest neighbor classifier

Supervised star, galaxy, and QSO classification with sharpened dimensionality reduction

REVOLUTIONIZING POWER TRANSFORMER FAULT DIAGNOSIS THROUGH COGNITIVE ARTIFICIAL INTELLIGENCE AND DISSOLVED GAS ANALYSIS INTEGRATION

Unveiling relevant acoustic features for bird species automatic classification

Research on a transmission line defect identification method based on voiceprint recognition

EARLY-STAGE DIABETES RISK PREDICTION USING MACHINE LEARNING TECHNIQUES BASED ON ENSEMBLE APPROACH

Comparative Study among Term Frequency-Inverse Document Frequency and Count Vectorizer towards K Nearest Neighbor and Decision Tree Classifiers for Text Dataset

Prediction of effective parameters for 3D printing of poly lactic acid-carbon fibre composites using intelligent frameworks based on mechanical response

An improved Differential evolution with Sailfish optimizer (DESFO) for handling feature selection problem

Tiny Machine Learning for Concept Drift.

A geometric algebra-based approach for myoelectric pattern recognition control and faster prosthesis recalibration

PIRAP: A Study on Optimized Multi-Language Classification and Text Categorization Using Supervised Hybrid Machine Learning Approaches

Interpretable metric learning in comparative metagenomics: The adaptive Haar-like distance.

Enhancing Speaker Recognition Models with Noise-Resilient Feature Optimization Strategies

Enhancing Nearest Neighbor Classification Performance through Dynamic Time Warping with Progressive Constraint

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Nearest-neighbor Classifier Research Articles

Related Topics

Articles published on Nearest-neighbor Classifier

Multilevel hybrid handcrafted feature extraction based depression recognition method using speech

Assessment of spatial distribution of schools in the northern area of Riyadh city

Ensemble Fusion Models Using Various Strategies and Machine Learning for EEG Classification.

A statistical evaluation of the sexual dimorphism of the acetabulum in an Iberian population.

Simultaneous Instance and Attribute Selection for Noise Filtering

A multi-average based pseudo nearest neighbor classifier

Supervised star, galaxy, and QSO classification with sharpened dimensionality reduction

REVOLUTIONIZING POWER TRANSFORMER FAULT DIAGNOSIS THROUGH COGNITIVE ARTIFICIAL INTELLIGENCE AND DISSOLVED GAS ANALYSIS INTEGRATION

Unveiling relevant acoustic features for bird species automatic classification

Research on a transmission line defect identification method based on voiceprint recognition

EARLY-STAGE DIABETES RISK PREDICTION USING MACHINE LEARNING TECHNIQUES BASED ON ENSEMBLE APPROACH

Comparative Study among Term Frequency-Inverse Document Frequency and Count Vectorizer towards K Nearest Neighbor and Decision Tree Classifiers for Text Dataset

Prediction of effective parameters for 3D printing of poly lactic acid-carbon fibre composites using intelligent frameworks based on mechanical response

An improved Differential evolution with Sailfish optimizer (DESFO) for handling feature selection problem

Tiny Machine Learning for Concept Drift.

A geometric algebra-based approach for myoelectric pattern recognition control and faster prosthesis recalibration

PIRAP: A Study on Optimized Multi-Language Classification and Text Categorization Using Supervised Hybrid Machine Learning Approaches

Interpretable metric learning in comparative metagenomics: The adaptive Haar-like distance.

Enhancing Speaker Recognition Models with Noise-Resilient Feature Optimization Strategies

Enhancing Nearest Neighbor Classification Performance through Dynamic Time Warping with Progressive Constraint