PITCH ESTIMATION FRAMEWORK FOR SPEECH SEGREGATION USING COCHLEAGRAM MORPHING

M.j Khan M.J Khan

doi:10.57041/pjs.v67i4.605

Abstract

Computational auditory scene analysis (CASA) has significant role in speech segregation from monaural audio mixtures and generally a measure for performance of speech recognition systems. Pitch estimation has a substantial role in performance of CASA systems. This study presents a novel pitch estimation framework for speech segregation from monaural audio mixtures using cochleagram morphing. The proposed framework takes the rough estimation of target pitch from given audio mixtures containing speech and background interferences. Discrete set consisting morphed versions of cochleagram is obtained using k-Means clustering. The estimated pitch values are improved by validating and smoothing them to morphed cochleagram. Measure of refined estimated pitch contours along with harmonicity and temporal continuity are used to segregate target speech. The proposed framework produced 83.13% accuracy for MIR-1k dataset which is considerably higher than the existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PITCH ESTIMATION FRAMEWORK FOR SPEECH SEGREGATION USING COCHLEAGRAM MORPHING

Abstract

Talk to us

Similar Papers

More From: Pakistan Journal of Science

Lead the way for us

Journal: Pakistan Journal of Science	Publication Date: Jan 4, 2023
License type: CC BY-SA 4.0

Similar Papers

Computational Auditory Scene Analysis: Principles, Algorithms and Applications
Chris Darwin
The Journal of the Acoustical Society of America | VOL. 124
Chris DarwinChris Darwin
01 Jul 2008
The Journal of the Acoustical Society of America | VOL. 124

Monaural speech segregation based on pitch tracking and amplitude modulation
Guoning Hu ... Deliang Wang
-
Guoning Hu, et. al.Guoning Hu ... Deliang Wang
01 May 2002
01 May 2002

A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
Guoning Hu ... Deliang Wang
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 18
Guoning Hu, et. al. Guoning Hu ... Deliang Wang
01 Nov 2010
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 18

Time scale modification and vocal tract length normalization for improving the performance of Tamil speech recognition system implemented using language independent segmentation algorithm
S Saraswathi ... T V Geetha
International Journal of Speech Technology | VOL. 9
S Saraswathi, et. al.S Saraswathi ... T V Geetha
01 Dec 2006
International Journal of Speech Technology | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PITCH ESTIMATION FRAMEWORK FOR SPEECH SEGREGATION USING COCHLEAGRAM MORPHING

Abstract

Talk to us

Similar Papers

More From: Pakistan Journal of Science