A development of a speech data transcription tool for building a spoken corpus

Yeonguk You,Yongjin Kwak,Yunsoo Kim,Jaeeun Park,Hyangrae Noh,Yoonjoong Kim

doi:10.1109/ictc.2018.8539450

A development of a speech data transcription tool for building a spoken corpus

Yeonguk You, Yongjin Kwak + Show 4 more

https://doi.org/10.1109/ictc.2018.8539450

Copy DOI

Publication Date: Oct 1, 2018

Citations: 1

Affiliation: Hanbat National University

#Speech Transcription #Transcription Process + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this study, we developed a speech data transcription tool that integrates speech segmentation, speaker classification, speech transcription, and editing processes for the purpose of shortening transcription time of audio data. The system converts the speech data into standardized transcription data that is used as an input to a spoken corpus construction system. The speech segmentation and speaker classification process was developed using deep learning technologies and the transcription process uses the Google API. It was confirmed that the experiment performed to compare with the existing ELAN and notepad tool saves half of the processing time

Full Text