Advances In Natural Language Processing Research Articles

Recent advances in natural language processing (NLP) have heightened the interest of the medical community in its application to health care in general, in particular to stroke, a medical emergency of great impact. In this rapidly evolving context, it is necessary to learn and understand the experience already accumulated by the medical and scientific community. The aim of this scoping review was to explore the studies conducted in the last 10 years using NLP to assist the management of stroke emergencies so as to gain insight on the state of the art, its main contexts of application, and the software tools that are used. Data were extracted from Scopus and Medline through PubMed, using the keywords "natural language processing" and "stroke." Primary research questions were related to the phases, contexts, and types of textual data used in the studies. Secondary research questions were related to the numerical and statistical methods and the software used to process the data. The extracted data were structured in tables and their relative frequencies were calculated. The relationships between categories were analyzed through multiple correspondence analysis. Twenty-nine papers were included in the review, with the majority being cohort studies of ischemic stroke published in the last 2 years. The majority of papers focused on the use of NLP to assist in the diagnostic phase, followed by the outcome prognosis, using text data from diagnostic reports and in many cases annotations on medical images. The most frequent approach was based on general machine learning techniques applied to the results of relatively simple NLP methods with the support of ontologies and standard vocabularies. Although smaller in number, there has been an increasing body of studies using deep learning techniques on numerical and vectorized representations of the texts obtained with more sophisticated NLP tools. Studies focused on NLP applied to stroke show specific trends that can be compared to the more general application of artificial intelligence to stroke. The purpose of using NLP is often to improve processes in a clinical context rather than to assist in the rehabilitation process. The state of the art in NLP is represented by deep learning architectures, among which Bidirectional Encoder Representations from Transformers has been found to be especially widely used in the medical field in general, and for stroke in particular, with an increasing focus on the processing of annotations on medical images.

Read full abstract

With the growing volume and complexity of laboratory repositories, it has become tedious to parse unstructured data into structured and tabulated formats for secondary uses such as decision support, quality assurance, and outcome analysis. However, advances in natural language processing (NLP) approaches have enabled efficient and automated extraction of clinically meaningful medical concepts from unstructured reports. In this study, we aimed to determine the feasibility of using the NLP model for information extraction as an alternative approach to a time-consuming and operationally resource-intensive handcrafted rule-based tool. Therefore, we sought to develop and evaluate a deep learning-based NLP model to derive knowledge and extract information from text-based laboratory reports sourced from a provincial laboratory repository system. The NLP model, a hierarchical multilabel classifier, was trained on a corpus of laboratory reports covering testing for 14 different respiratory viruses and viral subtypes. The corpus includes 87,500 unique laboratory reports annotated by 8 subject matter experts (SMEs). The classification task involved assigning the laboratory reports to labels at 2 levels: 24 fine-grained labels in level 1 and 6 coarse-grained labels in level 2. A "label" also refers to the status of a specific virus or strain being tested or detected (eg, influenza A is detected). The model's performance stability and variation were analyzed across all labels in the classification task. Additionally, the model's generalizability was evaluated internally and externally on various test sets. Overall, the NLP model performed well on internal, out-of-time (pre-COVID-19), and external (different laboratories) test sets with microaveraged F1-scores >94% across all classes. Higher precision and recall scores with less variability were observed for the internal and pre-COVID-19 test sets. As expected, the model's performance varied across categories and virus types due to the imbalanced nature of the corpus and sample sizes per class. There were intrinsically fewer classes of viruses being detected than those tested; therefore, the model's performance (lowest F1-score of 57%) was noticeably lower in the detected cases. We demonstrated that deep learning-based NLP models are promising solutions for information extraction from text-based laboratory reports. These approaches enable scalable, timely, and practical access to high-quality and encoded laboratory data if integrated into laboratory information system repositories.

Read full abstract

Advances In Natural Language Processing Research Articles

Related Topics

Articles published on Advances In Natural Language Processing

Structuring Information from Plant Morphological Descriptions using Open Information Extraction

Advances in Natural Language Processing A Thorough Examination

Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey

Applications of Natural Language Processing for the Management of Stroke Disorders: Scoping Review.

How ChatGPT (AI) is likely to become a Potential Threat (or not) to Human Imagination and Creativity?

Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

An empiric validation of linguistic features in machine learning models for fake news detection

Automated Patient Note Grading: Examining Scoring Reliability and Feasibility.

Exploring the Role of Emotions in Arabic Rumor Detection in Social Media

Masked Language Modeling for Resource Constrained Biological Natural Language Processing.

Collective events and individual affect shape autobiographical memory

Deep lexical hypothesis: Identifying personality structure in natural language.

Children in-conflict chatbot system using natural language processing technique

Solving Math Word Problems concerning Systems of Equations with GPT-3

The Potential of Visual ChatGPT for Remote Sensing

Expanding the methodological toolbox: Machine-based item desirability ratings as an alternative to human-based ratings

Predicting implicit attitudes with natural language data

Natural Language Processing for Clinical Laboratory Data Repository Systems: Implementation and Evaluation for Respiratory Viruses.

Catch Me If You Can: Deceiving Stance Detection and Geotagging Models to Protect Privacy of Individuals on Twitter

Research and implementation of visual question and answer system based on deep learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Advances In Natural Language Processing Research Articles

Related Topics

Articles published on Advances In Natural Language Processing

Structuring Information from Plant Morphological Descriptions using Open Information Extraction

Advances in Natural Language Processing A Thorough Examination

Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey

Applications of Natural Language Processing for the Management of Stroke Disorders: Scoping Review.

How ChatGPT (AI) is likely to become a Potential Threat (or not) to Human Imagination and Creativity?

Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

An empiric validation of linguistic features in machine learning models for fake news detection

Automated Patient Note Grading: Examining Scoring Reliability and Feasibility.

Exploring the Role of Emotions in Arabic Rumor Detection in Social Media

Masked Language Modeling for Resource Constrained Biological Natural Language Processing.

Collective events and individual affect shape autobiographical memory

Deep lexical hypothesis: Identifying personality structure in natural language.

Children in-conflict chatbot system using natural language processing technique

Solving Math Word Problems concerning Systems of Equations with GPT-3

The Potential of Visual ChatGPT for Remote Sensing

Expanding the methodological toolbox: Machine-based item desirability ratings as an alternative to human-based ratings

Predicting implicit attitudes with natural language data

Natural Language Processing for Clinical Laboratory Data Repository Systems: Implementation and Evaluation for Respiratory Viruses.

Catch Me If You Can: Deceiving Stance Detection and Geotagging Models to Protect Privacy of Individuals on Twitter

Research and implementation of visual question and answer system based on deep learning