Constant Q Cepstral coefficients for classification of normal vs. Pathological infant cry

Hemant A Patil,Aastha Kachhi,Ankur T Patil

doi:10.1109/icassp43922.2022.9746946

Abstract

Classification of normal vs. pathological infant cry is an interesting and technologically challenging research problem due to quasi-periodic sampling of vocal tract spectrum by high pitch-source harmonics resulting in extremely poor spectral resolution for commonly used spectral features, such as Mel Frequency Cepstral Coefficients (MFCC). To that effect, in this paper, we propose a new approach of feature extraction based on Constant Q Transform (CQT) that is known to have variable spectro-temporal resolution w.r.t Heisenberg’s un-certainty principle in signal processing framework. Further, CQT is also known to preserve form-invariance property (than its Short-Time Fourier Transform (STFT) counterpart)-a desirable attribute of feature descriptors to be invariant w.r.t shape, shift, rotation, and scaling. CQT- based features are then transformed to the cepstral-domain to derive Constant Q Cepstral Coefficients (CQCC), which are then fed to statistical and discriminative classifiers, namely, Gaussian Mixture Model (GMM) and Support Vector Machine (SVM) respectively. CQCC-GMM and CQCC-SVM systems gave relatively better results than MFCC for various experimental evaluation factors for infant cry classification task on widely used and statistically meaningful Baby Chilanto Database. Relatively best performance, in particular, 99.82% accuracy (0.44% EER), is observed for CQCC-GMM system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Constant Q Cepstral coefficients for classification of normal vs. Pathological infant cry

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Data Augmentation for Infant Cry Classification
Aastha Kachhi ... Hemant A Patil
-
Aastha Kachhi, et. al.Aastha Kachhi ... Hemant A Patil
11 Dec 2022
11 Dec 2022

On significance of constant-Q transform for pop noise detection
Kuldeep Khoria ... Hemant A Patil
Computer Speech & Language | VOL. 77
Kuldeep Khoria, et. al.Kuldeep Khoria ... Hemant A Patil
11 Jun 2022
Computer Speech & Language | VOL. 77

Analysis of normal and pathological infant cries using bispectrum features derived using HOSVD
Anshu Chittora ... Hemant A Patil
-
Anshu Chittora, et. al.Anshu Chittora ... Hemant A Patil
01 May 2015
01 May 2015

Significance of Higher-Order Spectral Analysis in Infant Cry Classification
Anshu Chittora ... Hemant A Patil
Circuits, Systems, and Signal Processing | VOL. 37
Anshu Chittora, et. al.Anshu Chittora ... Hemant A Patil
04 Apr 2017
Circuits, Systems, and Signal Processing | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constant Q Cepstral coefficients for classification of normal vs. Pathological infant cry

Abstract

Talk to us

Similar Papers