Chinese Named Entity Recognition Research Articles

In the rapidly evolving field of cybersecurity, the integration of multi-source, heterogeneous, and fragmented data into a coherent knowledge graph has garnered considerable attention. Such a graph elucidates semantic interconnections, thereby facilitating sophisticated analytical decision support. Central to the construction of a cybersecurity knowledge graph is Named Entity Recognition (NER), a critical technology that converts unstructured text into structured data. The efficacy of NER is pivotal, as it directly influences the integrity of the knowledge graph. The task of NER in cybersecurity, particularly within the Chinese linguistic context, presents distinct challenges. Chinese text lacks explicit space delimiters and features complex contextual dependencies, exacerbating the difficulty in discerning and categorizing named entities. These linguistic characteristics contribute to errors in word segmentation and semantic ambiguities, impeding NER accuracy. This paper introduces a novel NER methodology tailored for the Chinese cybersecurity corpus, termed CSBERT-IDCNN-BiLSTM-CRF. This approach harnesses Iterative Dilated Convolutional Neural Networks (IDCNN) for extracting local features, and Bi-directional Long Short-Term Memory networks (BiLSTM) for contextual understanding. It incorporates CSBERT, a pre-trained model adept at processing few-shot data, to derive input feature representations. The process culminates with Conditional Random Fields (CRF) for precise sequence labeling. To compensate for the scarcity of publicly accessible Chinese cybersecurity datasets, this paper synthesizes a bespoke dataset, authenticated by data from the China National Vulnerability Database, processed via the YEDDA annotation tool. Empirical analysis affirms that the proposed CSBERT-IDCNN-BiLSTM-CRF model surpasses existing Chinese NER frameworks, with an F1-score of 87.30% and a precision rate of 85.89%. This marks a significant advancement in the accurate identification of cybersecurity entities in Chinese text, reflecting the model’s robust capability to address the unique challenges presented by the language’s structural intricacies.

Read full abstract

The automatic extraction of key entities in mechanics problems is an important means to automatically solve mechanics problems. Nevertheless, for standard Chinese, compared with the open domain, mechanics problems have a large number of specialized terms and composite entities, which leads to a low recognition capability. Although recent research demonstrates that external information and pre-trained language models can improve the performance of Chinese Named Entity Recognition (CNER), few efforts have been made to combine the two to explore high-performance algorithms for extracting mechanics entities. Therefore, this article proposes a Multi-Meta Information Embedding Enhanced Bidirectional Encoder Representation from Transformers (MMIEE-BERT) for recognizing entities in mechanics problems. The proposed method integrates lexical information and radical information into BERT layers directly by employing an information adapter layer (IAL). Firstly, according to the characteristics of Chinese, a Multi-Meta Information Embedding (MMIE) including character embedding, lexical embedding, and radical embedding is proposed to enhance Chinese sentence representation. Secondly, an information adapter layer (IAL) is proposed to fuse the above three embeddings into the lower layers of the BERT. Thirdly, a Bidirectional Long Short-Term Memory (BiLSTM) network and a Conditional Random Field (CRF) model are applied to semantically encode the output of MMIEE-BERT and obtain each character’s label. Finally, extensive experiments were carried out on the dataset built by our team and widely used datasets. The results demonstrate that the proposed method has more advantages than the existing models in the entity recognition of mechanics problems, and the precision, recall, and F1 score were improved. The proposed method is expected to provide an automatic means for extracting key information from mechanics problems.

Read full abstract

Chinese Named Entity Recognition Research Articles

Related Topics

Articles published on Chinese Named Entity Recognition

Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network

Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer

A Chinese named entity recognition model incorporating recurrent cell and information state recursion

ATBBC: Named entity recognition in emergency domains based on joint BERT-BILSTM-CRF adversarial training

A Robust Chinese Named Entity Recognition Method Based on Integrating Dual-Layer Features and CSBERT

A template augmented distant supervision framework for Chinese named entity recognition

A Weakly Supervised Chinese Named Entity Recognition Method Combining First-Order Logic

Chinese named entity recognition method based on multiscale feature fusion

PGD-GP: A Chinese Named Entity Recognition Model for Constructing Food Safety Standard Knowledge Graph

Local or global? A novel transformer for Chinese named entity recognition based on multi-view and sliding attention

Chinese Named Entity Recognition Based on Boundary Enhancement with Multi-Class Information

EPIC: An epidemiological investigation of COVID-19 dataset for Chinese named entity recognition

MFF-CNER: A Multi-feature Fusion Model for Chinese Named Entity Recognition in Finance Securities

Multi-Meta Information Embedding Enhanced BERT for Chinese Mechanics Entity Recognition

CLART: A cascaded lattice-and-radical transformer network for Chinese medical named entity recognition

Chinese Named Entity Recognition in Football Based on ALBERT-BiLSTM Model

Improving Low-Resource Chinese Named Entity Recognition Using Bidirectional Encoder Representation from Transformers and Lexicon Adapter

A novel Data and Model Centric artificial intelligence based approach in developing high-performance Named Entity Recognition for Bengali Language.

MCA-NER: Multi-Contextualized Adversarial-Based Attentional Deep Neural Network for Named Entity Recognition

Chinese Named Entity Recognition Augmented with Lexicon Memory

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Chinese Named Entity Recognition Research Articles

Related Topics

Articles published on Chinese Named Entity Recognition

Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network

Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer

A Chinese named entity recognition model incorporating recurrent cell and information state recursion

ATBBC: Named entity recognition in emergency domains based on joint BERT-BILSTM-CRF adversarial training

A Robust Chinese Named Entity Recognition Method Based on Integrating Dual-Layer Features and CSBERT

A template augmented distant supervision framework for Chinese named entity recognition

A Weakly Supervised Chinese Named Entity Recognition Method Combining First-Order Logic

Chinese named entity recognition method based on multiscale feature fusion

PGD-GP: A Chinese Named Entity Recognition Model for Constructing Food Safety Standard Knowledge Graph

Local or global? A novel transformer for Chinese named entity recognition based on multi-view and sliding attention

Chinese Named Entity Recognition Based on Boundary Enhancement with Multi-Class Information

EPIC: An epidemiological investigation of COVID-19 dataset for Chinese named entity recognition

MFF-CNER: A Multi-feature Fusion Model for Chinese Named Entity Recognition in Finance Securities

Multi-Meta Information Embedding Enhanced BERT for Chinese Mechanics Entity Recognition

CLART: A cascaded lattice-and-radical transformer network for Chinese medical named entity recognition

Chinese Named Entity Recognition in Football Based on ALBERT-BiLSTM Model

Improving Low-Resource Chinese Named Entity Recognition Using Bidirectional Encoder Representation from Transformers and Lexicon Adapter

A novel Data and Model Centric artificial intelligence based approach in developing high-performance Named Entity Recognition for Bengali Language.

MCA-NER: Multi-Contextualized Adversarial-Based Attentional Deep Neural Network for Named Entity Recognition

Chinese Named Entity Recognition Augmented with Lexicon Memory