Named Entity Recognition Methods Research Articles

In recent years, as cybersecurity threats have become increasingly severe and cyberattacks have occurred frequently, higher requirements have been put forward for cybersecurity protection. Therefore, the Named Entity Recognition (NER) technique, which is the cornerstone of Cyber Threat Intelligence (CTI) analysis, is particularly important. However, most existing NER studies are limited to recognizing single-layer flat entities, ignoring the possible nested entities in CTI. On the other hand, most of the existing studies focus on English CTIs, and the existing models performed poorly in a limited number of Chinese CTI studies. Given the above challenges, we propose in this paper a novel unified model, RBTG, which aims to identify flat and nested entities in Chinese CTI effectively. To overcome the difficult boundary recognition problem and the direction-dependent and distance-dependent properties in Chinese CTI NER, we use Global Pointer as the decoder and TENER as the encoder layer, respectively. Specifically, the Global Pointer layer solves the problem of the insensitivity of general NER methods to entity boundaries by utilizing the relative position information and the multiplicative attention mechanism. The TENER layer adapts to the Chinese CTI NER task by introducing an attention mechanism with direction awareness and distance awareness. Meanwhile, to cope with the complex feature capture of hierarchical structure and dependencies among Chinese CTI nested entities, the TENER layer solves the problem by following the structure of multiple self-attention layers and feed-forward network layers superimposed on each other in the Transformer. In addition, to fill the gap in the Chinese CTI nested entity dataset, we further apply the Large Language Modeling (LLM) technique and domain knowledge to construct a high-quality Chinese CTI nested entity dataset, CDTinee, which consists of six entity types selected from STIX, including nearly 4000 entity types extracted from more than 3000 threatening sentences. In the experimental session, we conduct extensive experiments on multiple datasets, and the results show that the proposed model RBTG outperforms the baseline model in both flat NER and nested NER.

Read full abstract

Materials science is an interdisciplinary field that studies the properties, structures, and behaviors of different materials. A large amount of scientific literature contains rich knowledge in the field of materials science, but manually analyzing these papers to find material-related data is a daunting task. In information processing, named entity recognition (NER) plays a crucial role as it can automatically extract entities in the field of materials science, which have significant value in tasks such as building knowledge graphs. The typically used sequence labeling methods for traditional named entity recognition in material science (MatNER) tasks often fail to fully utilize the semantic information in the dataset and cannot effectively extract nested entities. Herein, we proposed to convert the sequence labeling task into a machine reading comprehension (MRC) task. MRC method effectively can solve the challenge of extracting multiple overlapping entities by transforming it into the form of answering multiple independent questions. Moreover, the MRC framework allows for a more comprehensive understanding of the contextual information and semantic relationships within materials science literature, by integrating prior knowledge from queries. State-of-the-art (SOTA) performance was achieved on the Matscholar, BC4CHEMD, NLMChem, SOFC, and SOFC-Slot datasets, with F1-scores of 89.64%, 94.30%, 85.89%, 85.95%, and 71.73%, respectively in MRC approach. By effectively utilizing semantic information and extracting nested entities, this approach holds great significance for knowledge extraction and data analysis in the field of materials science, and thus accelerating the development of material science.Scientific contributionWe have developed an innovative NER method that enhances the efficiency and accuracy of automatic entity extraction in the field of materials science by transforming the sequence labeling task into a MRC task, this approach provides robust support for constructing knowledge graphs and other data analysis tasks.

Read full abstract

Named Entity Recognition Methods Research Articles

Related Topics

Articles published on Named Entity Recognition Methods

Biomedical named entity recognition using improved green anaconda-assisted Bi-GRU-based hierarchical ResNet model

Template-Free Prompting for Few-Shot Named Entity Recognition via Semantic-Enhanced Contrastive Learning.

Vocabulary-Enhanced Named Entity Recognition and its Application on Distribution Network Maintenance

PromptCNER: A Segmentation-based Method for Few-shot Chinese NER with Prompt-tuning

A Chinese named entity recognition method for landslide geological disasters based on deep learning

A Unified Model for Chinese Cyber Threat Intelligence Flat Entity and Nested Entity Recognition

A multimodal approach for few-shot biomedical named entity recognition in low-resource languages

Dual Contrastive Learning for Cross-Domain Named Entity Recognition

HiNER: Hierarchical feature fusion for Chinese named entity recognition

WITHDRAWN: TourismNER: A tourism named entity recognition method based on entity boundary joint prediction

Chinese EMR Named Entity Recognition Using Fused Label Relations Based on Machine Reading Comprehension Framework.

Improving dictionary-based named entity recognition with deep learning.

Transformer-based Named Entity Recognition for Clinical Cancer Drug Toxicity by Positive-unlabeled Learning and KL Regularizers

Class-Imbalanced-Aware Distantly Supervised Named Entity Recognition.

A novel prompting method for few-shot NER via LLMs

A New Chinese Named Entity Recognition Method for Pig Disease Domain Based on Lexicon-Enhanced BERT and Contrastive Learning

Research on named entity recognition method for high-speed railway technology transformation project text

Application of machine reading comprehension techniques for named entity recognition in materials science

Improved XLNet modeling for Chinese named entity recognition of edible fungus.

Chinese Named Entity Recognition method based on multi-feature fusion and biaffine

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Named Entity Recognition Methods Research Articles

Related Topics

Articles published on Named Entity Recognition Methods

Biomedical named entity recognition using improved green anaconda-assisted Bi-GRU-based hierarchical ResNet model

Template-Free Prompting for Few-Shot Named Entity Recognition via Semantic-Enhanced Contrastive Learning.

Vocabulary-Enhanced Named Entity Recognition and its Application on Distribution Network Maintenance

PromptCNER: A Segmentation-based Method for Few-shot Chinese NER with Prompt-tuning

A Chinese named entity recognition method for landslide geological disasters based on deep learning

A Unified Model for Chinese Cyber Threat Intelligence Flat Entity and Nested Entity Recognition

A multimodal approach for few-shot biomedical named entity recognition in low-resource languages

Dual Contrastive Learning for Cross-Domain Named Entity Recognition

HiNER: Hierarchical feature fusion for Chinese named entity recognition

WITHDRAWN: TourismNER: A tourism named entity recognition method based on entity boundary joint prediction

Chinese EMR Named Entity Recognition Using Fused Label Relations Based on Machine Reading Comprehension Framework.

Improving dictionary-based named entity recognition with deep learning.

Transformer-based Named Entity Recognition for Clinical Cancer Drug Toxicity by Positive-unlabeled Learning and KL Regularizers

Class-Imbalanced-Aware Distantly Supervised Named Entity Recognition.

A novel prompting method for few-shot NER via LLMs

A New Chinese Named Entity Recognition Method for Pig Disease Domain Based on Lexicon-Enhanced BERT and Contrastive Learning

Research on named entity recognition method for high-speed railway technology transformation project text

Application of machine reading comprehension techniques for named entity recognition in materials science

Improved XLNet modeling for Chinese named entity recognition of edible fungus.

Chinese Named Entity Recognition method based on multi-feature fusion and biaffine