Unsupervised speech segmentation: An analysis of the hypothesized phone boundaries

Odette Scharenborg,Mirjam Ernestus,Vincent Wan

doi:10.1121/1.3277194

Abstract

Despite using different algorithms, most unsupervised automatic phone segmentation methods achieve similar performance in terms of percentage correct boundary detection. Nevertheless, unsupervised segmentation algorithms are not able to perfectly reproduce manually obtained reference transcriptions. This paper investigates fundamental problems for unsupervised segmentation algorithms by comparing a phone segmentation obtained using only the acoustic information present in the signal with a reference segmentation created by human transcribers. The analyses of the output of an unsupervised speech segmentation method that uses acoustic change to hypothesize boundaries showed that acoustic change is a fairly good indicator of segment boundaries: over two-thirds of the hypothesized boundaries coincide with segment boundaries. Statistical analyses showed that the errors are related to segment duration, sequences of similar segments, and inherently dynamic phones. In order to improve unsupervised automatic speech segmentation, current one-stage bottom-up segmentation methods should be expanded into two-stage segmentation methods that are able to use a mix of bottom-up information extracted from the speech signal and automatically derived top-down information. In this way, unsupervised methods can be improved while remaining flexible and language-independent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of the Acoustical Society of America	Publication Date: Feb 1, 2010
Citations: 58	License type: other-oa

R Discovery Prime

R Discovery Prime

Unsupervised speech segmentation: An analysis of the hypothesized phone boundaries

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Unsupervised Hierarchical Semantic Segmentation with Multiview Cosegmentation and Clustering Transformers
Tsung-Wei Ke ... Jyh-Jing Hwang
-
Tsung-Wei Ke, et. al.Tsung-Wei Ke ... Jyh-Jing Hwang
01 Jun 2022
01 Jun 2022

Unsupervised Segmentation of RGB-D Images
Zhuo Deng ... Longin Jan Latecki
-
Zhuo Deng, et. al.Zhuo Deng ... Longin Jan Latecki
01 Jan 2015
01 Jan 2015

Unsupervised Segmentation Method for Color Image Based on MRF
Yimin Hou ... Wei Meng
-
Yimin Hou, et. al.Yimin Hou ... Wei Meng
01 Jun 2009
01 Jun 2009

Segmentation Based Interest Points and Evaluation of Unsupervised Image Segmentation Methods
Piotr Koniusz ... Krystian Mikolajczyk
-
Piotr Koniusz, et. al.Piotr Koniusz ... Krystian Mikolajczyk
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised speech segmentation: An analysis of the hypothesized phone boundaries

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America