Neural Network Based Language Models Research Articles

Transportation agency personnel gain valuable knowledge through their work, but such knowledge is lost if it is not documented properly after the worker leaves the organization. The risk of losing institutional knowledge is a current problem at state departments of transportation, including the North Carolina Department of Transportation (NCDOT), due to high personnel turnover. State transportation agencies have implemented knowledge repositories in the form of lessons learned/best practices databases to address this problem. However, motivating end-users to use such databases is challenging. This paper addresses this challenge through novel artificial intelligence technology whereby a neural network–based language model is implemented as part of the NCDOT’s new knowledge management program: Communicate Lessons, Exchange Advice, Record (CLEAR). The CLEAR program encompasses a database of lessons learned/best practices and a website to access and search the database. The developed methodology involves training a language model on transportation construction texts and using that trained model in a novel algorithm enabling users to search the CLEAR database easily. The developed language-processing model provides an easily accessible interface to suggest the most relevant CLEAR data based on the end-user’s searched keywords. The model learns an inference model of construction domain–specific vocabulary extracted from various sources, such as contract documents, textbooks, and specifications, to make meaningful connections between lessons learned/best practices in the CLEAR database and project-specific knowledge. The developed model has been validated by project managers for projects at various life cycle stages. The automation of information retrieval is intended to encourage NCDOT personnel to use and embrace the CLEAR program as part of their routine work to improve project workflow. In the long run, the NCDOT will benefit from consistent usage of the CLEAR program and its high quality content, thereby leading to enhanced institutional knowledge and organizational innovation.

We propose an integrated end-to-end automatic speech recognition (ASR) paradigm by joint learning of the front-end speech signal processing and back-end acoustic modeling. We believe that “only good signal processing can lead to top ASR performance” in challenging acoustic environments. This notion leads to a unified deep neural network (DNN) framework for distant speech processing that can achieve both high-quality enhanced speech and high-accuracy ASR simultaneously. Our goal is accomplished by two techniques, namely: (i) a reverberation-time-aware DNN based speech dereverberation architecture that can handle a wide range of reverberation times to enhance speech quality of reverberant and noisy speech, followed by (ii) DNN-based multicondition training that takes both clean-condition and multicondition speech into consideration, leveraging upon an exploitation of the data acquired and processed with multichannel microphone arrays, to improve ASR performance. The final end-to-end system is established by a joint optimization of the speech enhancement and recognition DNNs. The recent REverberant Voice Enhancement and Recognition Benchmark (REVERB) Challenge task is used as a test bed for evaluating our proposed framework. We first report on superior objective measures in enhanced speech to those listed in the 2014 REVERB Challenge Workshop on the simulated data test set. Moreover, we obtain the best single-system word error rate (WER) of 13.28% on the 1-channel REVERB simulated data with the proposed DNN-based pre-processing algorithm and clean-condition training. Leveraging upon joint training with more discriminative ASR features and improved neural network based language models, a low single-system WER of 4.46% is attained. Next, a new multi-channel-condition joint learning and testing scheme delivers a state-of-the-art WER of 3.76% on the 8-channel simulated data with a single ASR system. Finally, we also report on a preliminary yet promising experimentation with the REVERB real test data.

Neural Network Based Language Models Research Articles

Related Topics

Articles published on Neural Network Based Language Models

A Comparative Study of Khasi Speech Recognition Systems with Recurrent Neural Network-Based Language Model

Grammatical versus Spelling Error Correction: An Investigation into the Responsiveness of Transformer-Based Language Models Using BART and MarianMT

Exploring Social Biases of Large Language Models in a College Artificial Intelligence Course

Methodical Systematic Review of Abstractive Summarization and Natural Language Processing Models for Biomedical Health Informatics: Approaches, Metrics and Challenges

Developing a Construction Domain–Specific Artificial Intelligence Language Model for NCDOT’s CLEAR Program to Promote Organizational Innovation and Institutional Knowledge

Demystifying GPT and GPT-3: How they can support innovators to develop new digital accessibility solutions and assistive technologies?

Do Semantic Vectors Contain Traces of Biophilic Connections Between Nature and Mental Health?

Anonymization of German financial documents using neural network-based language models with contextual word representations

Sentiment Analysis with Cognitive Attention Supervision

English

Neural candidate-aware language models for speech recognition

HLHLp: Quantized Neural Networks Training for Reaching Flat Minima in Loss Surface

Analysis of Neural Network Based Language Modeling

Analysis of Neural Network Based Language Modeling

Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition.

User identification via neural network based language models

Enhancing recurrent neural network-based language models by word tokenization

An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition

Automatic estimation of extra-linguistic information in speech and its integration into recurrent neural network-based language models for speech recognition

워드 임베딩과 품사 태깅을 이용한 클래스 언어모델 연구

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Neural Network Based Language Models Research Articles

Related Topics

Articles published on Neural Network Based Language Models

A Comparative Study of Khasi Speech Recognition Systems with Recurrent Neural Network-Based Language Model

Grammatical versus Spelling Error Correction: An Investigation into the Responsiveness of Transformer-Based Language Models Using BART and MarianMT

Exploring Social Biases of Large Language Models in a College Artificial Intelligence Course

Methodical Systematic Review of Abstractive Summarization and Natural Language Processing Models for Biomedical Health Informatics: Approaches, Metrics and Challenges

Developing a Construction Domain–Specific Artificial Intelligence Language Model for NCDOT’s CLEAR Program to Promote Organizational Innovation and Institutional Knowledge

Demystifying GPT and GPT-3: How they can support innovators to develop new digital accessibility solutions and assistive technologies?

Do Semantic Vectors Contain Traces of Biophilic Connections Between Nature and Mental Health?

Anonymization of German financial documents using neural network-based language models with contextual word representations

Sentiment Analysis with Cognitive Attention Supervision

English

Neural candidate-aware language models for speech recognition

HLHLp: Quantized Neural Networks Training for Reaching Flat Minima in Loss Surface

Analysis of Neural Network Based Language Modeling

Analysis of Neural Network Based Language Modeling

Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition.

User identification via neural network based language models

Enhancing recurrent neural network-based language models by word tokenization

An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition

Automatic estimation of extra-linguistic information in speech and its integration into recurrent neural network-based language models for speech recognition

워드 임베딩과 품사 태깅을 이용한 클래스 언어모델 연구