Named Entity Recognition System Research Articles

The medical security privacy and named entity recognition (NER) technology under the blockchain technology has been a hot topic in all walks of life. As a typical representative of medical security risks and NER, the NER model of online social media based on active learning has attracted worldwide attention. NER is an important part of natural language processing. Traditional recognition technology usually requires a lot of external information, and through the manual identification of its features, which costs a lot of time and energy. In order to solve the shortcomings of traditional recognition algorithms and the lack of feature extraction in network media NER, a new active learning model was introduced in this paper. In the information age, people are increasingly demanding a large amount of text information, and NER technology came into being. Its main function is to accurately identify important information from text and provide useful information for high-level work. The initial design of NER system is mainly based on the recognition of rules, so as to realize the recognition of named entities. However, in a complex network environment, it takes a lot of time and energy to establish rules without conflicts, and it has poor mobility. In recent years, with the continuous development of computer technology, the use of machine learning to actively learn the unknown information in the target area reduces the workload of manual annotation, thus realizing the active learning of large amounts of data. The research showed that the recognition accuracy under the traditional NER was low, and the information processing speed was slow; the accuracy rate of NER based on active learning was as high as 97%, and the speed of information processing had also been greatly improved, which had solved many problems under the traditional mode. User satisfaction could be as high as 95%, which showed that the latter had broad prospects. The progress of the new era cannot be separated from the support of new technologies. The research of this article has important guiding significance for medical security privacy and the application of NER under blockchain technology.

Read full abstract

The ever-evolving volume of digital information requires the development of innovative search strategies aimed at obtaining the necessary data efficiently and economically feasible. The urgency of the problem is emphasized by the growing complexity of information landscapes and the need for fast data extraction methodologies. In the field of natural language processing, named entity recognition (NER) is an essential task for extracting useful information from unstructured text input for further classification into predefined categories. Nevertheless, conventional methods frequently encounter difficulties when confronted with a limited amount of labeled data, posing challenges in real-world scenarios where obtaining substantial annotated datasets is problematic or costly. In order to address the problem of domain-specific NER with limited data, this work investigates NER techniques that can overcome these constraints by continuously learning from newly collected information on pre-trained models. Several techniques are also used for making the greatest use of the limited labeled data, such as using active learning, exploiting unlabeled data, and integrating domain knowledge. Using domain-specific datasets with different levels of annotation scarcity, the fine-tuning process of pre-trained models, such as transformer-based models (TRF) and Toc2Vec (token-to-vector) models is investigated. The results show that, in general, expanding the volume of training data enhances most models' performance for NER, particularly for models with sufficient learning ability. Depending on the model architecture and the complexity of the entity label being learned, the effect of more data on the model's performance can change. After increasing the training data by 20%, the LT2V model shows the most balanced growth in accuracy overall by 11% recognizing 73% of entities and processing speed. Meanwhile, with consistent processing speed and the greatest F1-score, the Transformer-based model (TRF) shows promise for effective learning with less data, achieving 74% successful prediction and a 7% increase in performance after expanding the training data to 81%. Our results pave the way for the creation of more resilient and efficient NER systems suited to specialized domains and further the field of domain-specific NER with sparse data. We also shed light on the relative merits of various NER models and training strategies, and offer perspectives for future research.

Read full abstract

Named Entity Recognition System Research Articles

Related Topics

Articles published on Named Entity Recognition System

SWENER-1800

Improving dictionary-based named entity recognition with deep learning.

AI-Driven Thoracic X-ray Diagnostics: Transformative Transfer Learning for Clinical Validation in Pulmonary Radiography.

Evaluation on Network Social Media Named Entity Recognition Model Based on Active Learning

Is Boundary Annotation Necessary? Evaluating Boundary-Free Approaches to Improve Clinical Named Entity Annotation Efficiency: Case Study.

Extracting contextual insights from user reviews for recommender systems: a novel method

Data augmentation and transfer learning for cross-lingual Named Entity Recognition in the biomedical domain

ADAPTIVE DOMAIN-SPECIFIC NAMED ENTITY RECOGNITION METHOD WITH LIMITED DATA

AI Evaluation Authorities: A Case Study Mapping Model Audits to Persistent Standards

Named Entity Recognition of Tunisian Arabic Using the Bi-LSTM-CRF Model

BERT-based tourism named entity recognition: making use of social media for travel recommendations.

Bert Based Named Entity Recognition for the Albanian Language

A Hybrid Named Entity Recognition System for Aviation Text

Using machine learning to extract information and predict outcomes from reports of randomised trials of smoking cessation interventions in the Human Behaviour-Change Project.

Few-shot Named Entity Recognition: Definition, Taxonomy and Research Directions

LADA-Trans-NER: Adaptive Efficient Transformer for Chinese Named Entity Recognition Using Lexicon-Attention and Data-Augmentation

PUnifiedNER: A Prompting-Based Unified NER System for Diverse Datasets

A survey on Named Entity Recognition — datasets, tools, and methodologies

Automatic extraction of social determinants of health from medical notes of chronic lower back pain patients.

Combination of Loss-based Active Learning and Semi-supervised Learning for Recognizing Entities in Chinese Electronic Medical Records

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Named Entity Recognition System Research Articles

Related Topics

Articles published on Named Entity Recognition System

SWENER-1800

Improving dictionary-based named entity recognition with deep learning.

AI-Driven Thoracic X-ray Diagnostics: Transformative Transfer Learning for Clinical Validation in Pulmonary Radiography.

Evaluation on Network Social Media Named Entity Recognition Model Based on Active Learning

Is Boundary Annotation Necessary? Evaluating Boundary-Free Approaches to Improve Clinical Named Entity Annotation Efficiency: Case Study.

Extracting contextual insights from user reviews for recommender systems: a novel method

Data augmentation and transfer learning for cross-lingual Named Entity Recognition in the biomedical domain

ADAPTIVE DOMAIN-SPECIFIC NAMED ENTITY RECOGNITION METHOD WITH LIMITED DATA

AI Evaluation Authorities: A Case Study Mapping Model Audits to Persistent Standards

Named Entity Recognition of Tunisian Arabic Using the Bi-LSTM-CRF Model

BERT-based tourism named entity recognition: making use of social media for travel recommendations.

Bert Based Named Entity Recognition for the Albanian Language

A Hybrid Named Entity Recognition System for Aviation Text

Using machine learning to extract information and predict outcomes from reports of randomised trials of smoking cessation interventions in the Human Behaviour-Change Project.

Few-shot Named Entity Recognition: Definition, Taxonomy and Research Directions

LADA-Trans-NER: Adaptive Efficient Transformer for Chinese Named Entity Recognition Using Lexicon-Attention and Data-Augmentation

PUnifiedNER: A Prompting-Based Unified NER System for Diverse Datasets

A survey on Named Entity Recognition — datasets, tools, and methodologies

Automatic extraction of social determinants of health from medical notes of chronic lower back pain patients.

Combination of Loss-based Active Learning and Semi-supervised Learning for Recognizing Entities in Chinese Electronic Medical Records