Text Segmentation Research Articles

Abstract Machine reading comprehension (MRC) refers to the process of instructing machines to comprehend and respond to inquiries based on a provided text. There are two primary methodologies for achieving this: extracting answers directly from the text or predicting them. Extracting answers involves anticipating the specific segment of text containing the answer, pinpointed by its starting and ending indices within the paragraph. Despite the increasing interest in MRC, exploration within the framework of the Arabic language faces limitations due to various challenges. A significant impediment arises from the inadequacy of resources available for Arabic textual content, which impedes the development of effective models. Furthermore, the inherent intricacies of Arabic, manifesting in its diverse linguistic forms including classical, modern standard, and colloquial, present distinctive hurdles for tasks involving language comprehension. This paper proposes an enhanced version of the bidirectional attention flow (BIDAF) model for Arabic MRC, constructed upon the Arabic Span-Extraction-based Reading Comprehension Benchmark (ASER). ASER comprises 10,000 sets of questions, answers, and passages, partitioned into a training set constituting 90% of the data and a testing set making up the remaining 10%. By introducing a new input feature based on parts-of-speech (POS) word embeddings and replacing Bidirectional Long Short-Term Memory (bi-LSTM) with bidirectional gated recurrent unit, significant improvements were observed. Eight different POS word embeddings were generated using both Continuous Bag of Words (CBOW) and Skip-gram methods, with varying dimensionalities. Evaluation metrics, including exact match (EM) and F1-measure, were utilized to assess model performance, with emphasis on the latter for its accuracy. The proposed enhanced BIDAF model achieved a remarkable accuracy of 75.22% on the ASER dataset, demonstrating its efficacy in Arabic MRC tasks. Additionally, rigorous statistical evaluation using a two-tailed paired samples t-test further validated the findings, highlighting the significance of the proposed enhancements in advancing Arabic language processing capabilities.

Incidental findings of aortic aneurysms (AAs) often go unreported, and established patients are frequently lost to follow-up. Natural language processing (NLP) offers a promising solution to address these issues. While rule-based NLP methods have shown some success, recent advancements in transformer-based large language models (LLMs) remain underutilized. This study has three aims: (1) to evaluate the effectiveness of our innovative transformer-based NLP pipeline regarding AA detection; (2) to detail the clinical impact by quantifying the number of patients who could benefit from such technology; and (3) to use this information to help coordinate appointments with patients, ensuring proper monitoring and management. 3229 radiology reports were divided into three batches with varying class balance. Each entry was processed through our innovative NLP pipeline, where it was fragmented using regular expression (regex) functions to isolate relevant textual segments. These segments were subsequently processed through our "question and find" (Q&F) function, powered by Google's BERT, a well-established transformer LLM. This Q&F function extracted aortic diameter measurements, flagging measurements that exceeded a predefined threshold. Following detection, we conducted comprehensive chart reviews and contacted primary care providers (PCPs) and patients to categorize aneurysms as "known" or "incidental." We also assessed whether patients with known aneurysms were adhering to regular yearly screenings and coordinated follow-up appointments. Evaluation of the three batches showed high F1 scores: 99.4% (95% CI [98.5-100]), 96.7% (95% CI [95.0-98.2]), and 98.9% (95% CI [98.0-99.6]). Overall measurement accuracy was 98.9% (95% CI [97.6-100]), 99.6% (95% CI [99.3-99.9]), and 98.1% (95% CI [96.8-99.4]). Compared to manual chart reviews, the NLP system demonstrated superior accuracy and fewer errors: 12 vs. 22 (p=0.084), 47 vs. 98 (p=0.000021), and 31 vs. 53 (p=0.015). Of the 412 patients investigated, 58 (14.1%) involved incidental findings, 54 patients (15.3%) were lost to follow-up, 39 patients (55.7%) were successfully contacted, and 37 follow-up appointments (12.1%) were successfully coordinated. The high-performance metrics from our study demonstrate that transformer-based NLP can enhance aortic aneurysm surveillance. Our subsequent comprehensive patient profiling highlighted the need for such a system as a safety net within the electronic medical record (EMR), systematically reviewing radiology reports to detect incidental findings and patients lost to follow-up. This ensures appropriate referrals and monitoring, improving patient outcomes and healthcare efficiency through timely clinical interventions.

Text Segmentation Research Articles

Related Topics

Articles published on Text Segmentation

Improved bidirectional attention flow (BIDAF) model for Arabic machine reading comprehension

Restoration and Segmentation of Old Jawi Manuscripts using Variational Image Inpainting and Active Contour Models

Application of large language models to intelligently analyze long construction contract texts

Functional Analysis of Thematic Approach to the Division of Surahs in Two English Contemporary Translations of the Qur’an

Journalism Versus Churnalism: How News Factors in Press Releases Affect Journalistic Processing of Ocean Plastic Research in Newspapers Globally

Enhancing Aortic Aneurysm Surveillance: Transformer NLP for Flagging and Measuring in Radiology Reports

Deep handwritten diagram segmentation

Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records

User Stress Detection Using Social Media Text: A Novel Machine Learning Approach

A novel masking model for Buddhist literature understanding by using Generative Adversarial Networks

A Comprehensive Natural Language Processing Pipeline for the Chronic Lupus Disease.

Enhancing privacy policy comprehension through Privacify: A user-centric approach using advanced language models

Optical Character Recognition of Balochi Script

Immersive e-learning application in intelligent teaching of English composition based on neural network algorithm

ANALYSIS OF FUNCTIONAL SPEECH ADAPTATION STRATEGIES IN MILITARY TRANSLATION

What do Contemporary Publications Report about the Generation of Urban Solid Waste (MSW) and/or Consumption from the Perspective of Chemistry Teaching?

AraFast: Developing and Evaluating a Comprehensive Modern Standard Arabic Corpus for Enhanced Natural Language Processing

Street office and care practices: emerging conceptions and reflexivities

Clinical research text summarization method based on fusion of domain knowledge

Application of elementary probability models for text homogeneity and segmentation: A case study of Bible

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Segmentation Research Articles

Related Topics

Articles published on Text Segmentation

Improved bidirectional attention flow (BIDAF) model for Arabic machine reading comprehension

Restoration and Segmentation of Old Jawi Manuscripts using Variational Image Inpainting and Active Contour Models

Application of large language models to intelligently analyze long construction contract texts

Functional Analysis of Thematic Approach to the Division of Surahs in Two English Contemporary Translations of the Qur’an

Journalism Versus Churnalism: How News Factors in Press Releases Affect Journalistic Processing of Ocean Plastic Research in Newspapers Globally

Enhancing Aortic Aneurysm Surveillance: Transformer NLP for Flagging and Measuring in Radiology Reports

Deep handwritten diagram segmentation

Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records

User Stress Detection Using Social Media Text: A Novel Machine Learning Approach

A novel masking model for Buddhist literature understanding by using Generative Adversarial Networks

A Comprehensive Natural Language Processing Pipeline for the Chronic Lupus Disease.

Enhancing privacy policy comprehension through Privacify: A user-centric approach using advanced language models

Optical Character Recognition of Balochi Script

Immersive e-learning application in intelligent teaching of English composition based on neural network algorithm

ANALYSIS OF FUNCTIONAL SPEECH ADAPTATION STRATEGIES IN MILITARY TRANSLATION

What do Contemporary Publications Report about the Generation of Urban Solid Waste (MSW) and/or Consumption from the Perspective of Chemistry Teaching?

AraFast: Developing and Evaluating a Comprehensive Modern Standard Arabic Corpus for Enhanced Natural Language Processing

Street office and care practices: emerging conceptions and reflexivities

Clinical research text summarization method based on fusion of domain knowledge

Application of elementary probability models for text homogeneity and segmentation: A case study of Bible