High quality recordings and transcriptions of speech via remote platforms

Jonathan Wright,Christopher Cieri,Neville Ryant,James Fiumara,Jeremy Zehr,Robert Parker,Mark Liberman

doi:10.1121/10.0008577

Abstract

High quality recordings and transcriptions of speech are important to a wide variety of disciplines from linguistics and human language technology to biomedical screening, diagnosis and tracking. The wide availability of internet connections and powerful mobile devices offers low cost opportunities for collecting speech data at scale. But even in the era of open source software, numerous challenges remain. For example, video call applications such as Zoom have become widespread and allow for recording, but only provide lossy codecs with low frame rates, and often contain missing, repeated, or interpolated frames, as well as freezes, longer dropouts and other audio artefacts. LDC has recently developed a suite of tools to allow high quality internet-based audio recordings and transcription, with a premium on portability and flexibility, using secure cloud computing services for storage and back-end processing. We present here the current design and capabilities of our software, as well as availability in terms of open source code and app distribution. We also discuss future plans; planned capabilities include 2 + sided conversational recordings, connecting participants via the internet, a modern extension of Conversational Telephone Speech (CTS) collections.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High quality recordings and transcriptions of speech via remote platforms

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system
S Matsoukas ... Bing Xiang
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
S Matsoukas, et. al.S Matsoukas ... Bing Xiang
01 Sep 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

Automatic transcription of conversational telephone speech
T Hain ... D Povey
IEEE Transactions on Speech and Audio Processing | VOL. 13
T Hain, et. al.T Hain ... D Povey
01 Nov 2005
IEEE Transactions on Speech and Audio Processing | VOL. 13

Comparing Human and Machine Errors in Conversational Speech Transcription
Andreas Stolcke ... Jasha Droppo
-
Andreas Stolcke, et. al.Andreas Stolcke ... Jasha Droppo
20 Aug 2017
20 Aug 2017

New features in the CU-HTK system for transcription of conversational telephone speech
T Hain ... P.C Woodland
-
T Hain, et. al.T Hain ... P.C Woodland
01 May 2001
01 May 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High quality recordings and transcriptions of speech via remote platforms

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America