Transcript Of Lecture Research Articles

The potential of structural classification methods for automatic speech recognition (ASR) has been attracting the speech community since they can realize the unified modeling of acoustic and linguistic aspects of recognizers. However, the structural classification approaches involve well-known tradeoffs between the richness of features and the computational efficiency of decoders. If we are to employ, for example, a frame-synchronous one-pass decoding technique, features considered to calculate the likelihood of each hypothesis must be restricted to the same form as the conventional acoustic and language models. This paper tackles this limitation directly by exploiting the structure of the weighted finite-state transducers (WFSTs) used for decoding. Although WFST arcs provide rich contextual information, close integration with a computationally efficient decoding technique is still possible since most decoding techniques only require that their likelihood functions are factorizable for each decoder arc and time frame. In this paper, we compare two methods for structural classification with the WFST-based features; the structured perceptron and conditional random field (CRF) techniques. To analyze the advantages of these two classifiers, we present experimental results for the TIMIT continuous phoneme recognition task, the WSJ transcription task, and the MIT lecture transcription task. We confirmed that the proposed approach improved the ASR performance without sacrificing the computational efficiency of the decoders, even though the baseline systems are already trained with discriminative training techniques (e.g., MPE).

Latent Dirichlet allocation (LDA) is a new paradigm of topic model which is powerful to capture the latent topic information from natural language. However, the topic information in text streams, e.g. meeting recording, lecture transcription and conversational dialogue, are inherently heterogeneous and nonstationary without explicit boundaries. It is difficult to train a precise topic model from the observed text streams. Furthermore, the usage of words in different paragraphs within a document is varied with different composition styles. In this paper, we present a new hierarchical segmentation model (HSM) where the heterogeneous topic information in stream level and the word variations in document level are characterized. We incorporate the contextual topic information in stream-level segmentation. The topic similarity between sentences is used to form a beta distribution reflecting the prior knowledge of document boundaries in a text stream. The distribution of segmentation variable is adaptively updated to achieve flexible segmentation and is used to group coherent sentences into a topic-specific document. For each pseudo-document, we further use a Markov chain to detect the stylistic segments within a document. The words in a segment are accordingly generated by the same composition style, which differs from the style of the next segment. Each segment is represented by a Markov state, and so the word variations within a document are compensated. The whole model is trained by a variational Bayesian EM procedure and is evaluated on using TDT2 corpus. Experimental results show benefits by using the proposed HSM in terms of perplexity, segmentation error, detection accuracy and F measure.

Transcript Of Lecture Research Articles

Related Topics

Articles published on Transcript Of Lecture

Frances Fox Piven – Plenary Lecture UK Social Policy Association Annual Conference 8th July 2014

Sequential Linefeed Insertion into Lecture Transcriptions for Real‐Time Captioning

Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training

Was hat Husserl in Wien außerhalb von Brentanos Philosophie gelernt? Über die Einflüsse auf den frühen Husserl jenseits von Brentano und Bolzano

Two-dimensional packing problems in telecommunications

Using speech recognition for real-time captioning and lecture transcription in the classroom

Sequential Linefeed Insertion into Lecture Transcription for Real-time Captioning

Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition

22. Transcribe Your Class: Using Speech Recognition to Improve Access for At-Risk Students

Saying things that hurt

State Aid and Restrictions on Free Movement: Two Sides of the Same Coin?

Justice John Marshall Harlan: Lectures on Constitutional Law, 1897-98

Topic-Based Hierarchical Segmentation

大学講義におけるパソコン通訳の訳出率に及ぼす通訳者要因の影響

Comparative Use of Podcasts vs. Lecture Transcripts as Learning Aids for Dental Students

Hegel's Contested Legacy: Rethinking the Relation between Art History and Philosophy

Fire in 1788: The Closest Ally

行学院日朝の法華経談義書について

The use of okay, right and yeah in academic lectures by native speaker lecturers: Their ‘anticipated’ and ‘real’ meanings

Theology lectures as lexical environments: A case study of technical vocabulary use

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Transcript Of Lecture Research Articles

Related Topics

Articles published on Transcript Of Lecture

Frances Fox Piven – Plenary Lecture UK Social Policy Association Annual Conference 8th July 2014

Sequential Linefeed Insertion into Lecture Transcriptions for Real‐Time Captioning

Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training

Was hat Husserl in Wien außerhalb von Brentanos Philosophie gelernt? Über die Einflüsse auf den frühen Husserl jenseits von Brentano und Bolzano

Two-dimensional packing problems in telecommunications

Using speech recognition for real-time captioning and lecture transcription in the classroom

Sequential Linefeed Insertion into Lecture Transcription for Real-time Captioning

Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition

22. Transcribe Your Class: Using Speech Recognition to Improve Access for At-Risk Students

Saying things that hurt

State Aid and Restrictions on Free Movement: Two Sides of the Same Coin?

Justice John Marshall Harlan: Lectures on Constitutional Law, 1897-98

Topic-Based Hierarchical Segmentation

大学講義におけるパソコン通訳の訳出率に及ぼす通訳者要因の影響

Comparative Use of Podcasts vs. Lecture Transcripts as Learning Aids for Dental Students

Hegel's Contested Legacy: Rethinking the Relation between Art History and Philosophy

Fire in 1788: The Closest Ally

行学院日朝の法華経談義書について

The use of okay, right and yeah in academic lectures by native speaker lecturers: Their ‘anticipated’ and ‘real’ meanings

Theology lectures as lexical environments: A case study of technical vocabulary use