Named Entity Extraction Research Articles

The amount of textual information available electronically has made it difficult for many users to find and access the right information within acceptable time. Research communities in the natural language processing (NLP) field are developing tools and techniques to alleviate these problems and help users in exploiting these vast resources. These techniques include Information Retrieval (IR) and Information Extraction (IE). The work described in this thesis concerns IE and more specifically, named entity extraction in English. The English language is of significant interest to the NLP community mainly due to its political and economic significance, but also due to its interesting characteristics. Text usually contains all kinds of names such as person names, company names, city and country names, sports teams, chemicals and lots of other names from specific domains. These names are called Named Entities (NE) and Named Entity Recognition (NER), one of the main tasks of IE systems, seeks to locate and classify automatically these names into predefined categories. NER systems are developed for different applications and can be beneficial to other information management technologies as it can be built over an IR system or can be used as the base module of a Data Mining application. In this thesis we propose an efficient and effective framework for extracting Arabic NEs from text using a rule based approach. Our approach makes use of English contextual and morphological information to extract named entities. The context is represented by means of words that are used as clues for each named entity type. Morphological information is used to detect the part of speech of each word given to the morphological analyzer. Subsequently we developed and implemented our rules in order to recognize each position of the named entity. Finally, our system implementation, evaluation metrics and experimental results are presented. We Present our Methodology by this Paper. Which use Hybrid approach of NlP and Machine Learning. This paper is a Review paper and Introduce Our Methodlogy. Key Terms: Natural Language Processing, Machine Translation, Name Entity Recognition, Different Languages.

Read full abstract

BackgroundNamed Entity (NE) extraction is one of the most fundamental and important tasks in biomedical information extraction. It involves identification of certain entities from text and their classification into some predefined categories. In the biomedical community, there is yet no general consensus regarding named entity (NE) annotation; thus, it is very difficult to compare the existing systems due to corpus incompatibilities. Due to this problem we can not also exploit the advantages of using different corpora together. In our present work we address the issues of corpus compatibilities, and use a single objective optimization (SOO) based classifier ensemble technique that uses the search capability of genetic algorithm (GA) for NE extraction in biomedicine. We hypothesize that the reliability of predictions of each classifier differs among the various output classes. We use Conditional Random Field (CRF) and Support Vector Machine (SVM) frameworks to build a number of models depending upon the various representations of the set of features and/or feature templates. It is to be noted that we tried to extract the features without using any deep domain knowledge and/or resources.ResultsIn order to assess the challenges of corpus compatibilities, we experiment with the different benchmark datasets and their various combinations. Comparison results with the existing approaches prove the efficacy of the used technique. GA based ensemble achieves around 2% performance improvements over the individual classifiers. Degradation in performance on the integrated corpus clearly shows the difficulties of the task.ConclusionsIn summary, our used ensemble based approach attains the state-of-the-art performance levels for entity extraction in three different kinds of biomedical datasets. The possible reasons behind the better performance in our used approach are the (i). use of variety and rich features as described in Subsection “Features for named entity extraction”; (ii) use of GA based classifier ensemble technique to combine the outputs of multiple classifiers.

Read full abstract

Named Entity Extraction Research Articles

Related Topics

Articles published on Named Entity Extraction

Statistical Arabic Name Entity Recognition Approaches: A Survey

Boosted Web Named Entity Recognition via Tri-Training

Issues and Challenges in Marathi Named Entity Recognition

Optimized Multi Class SVM Classifier for Named Entity Extraction for Workflow Scheduling in Cloud

Semantic Enrichment for Recommendation of Primary Studies in a Systematic Literature Review

TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet

Exploiting Linked Data for Open and Configurable Named Entity Extraction

NER-FL: A Novel Named Entity Recognizer of Farsi Languageusing the Web-Based Natural Language Processors and Semantic Annotations

Tibetan-Chinese Named Entity Extraction Based on Comparable Corpus

Name Entity Recognition by New Framework Using Machine Learning Algorithm

Razpoznavanje imenskih entitet v slovenskem besedilu

Biomedical named entity extraction: some issues of corpus compatibilities

A Rule Based Answer Extraction System with Stemming & Anaphora Resolution

An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge

ROSeAnn

Analysis of Textual Data Based on Inductive Learning Techniques

Arabic Semantic Web Applications – A Survey

ANEEC: A Quasi-Automatic System for Massive Named Entity Extraction and Categorization

The Study on Enlarging Specific Extractor for Technology-Related Named Entity Extraction from Text Collections of Applied Mechanics Field

A new multiword expression metric and its applications

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Named Entity Extraction Research Articles

Related Topics

Articles published on Named Entity Extraction

Statistical Arabic Name Entity Recognition Approaches: A Survey

Boosted Web Named Entity Recognition via Tri-Training

Issues and Challenges in Marathi Named Entity Recognition

Optimized Multi Class SVM Classifier for Named Entity Extraction for Workflow Scheduling in Cloud

Semantic Enrichment for Recommendation of Primary Studies in a Systematic Literature Review

TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet

Exploiting Linked Data for Open and Configurable Named Entity Extraction

NER-FL: A Novel Named Entity Recognizer of Farsi Languageusing the Web-Based Natural Language Processors and Semantic Annotations

Tibetan-Chinese Named Entity Extraction Based on Comparable Corpus

Name Entity Recognition by New Framework Using Machine Learning Algorithm

Razpoznavanje imenskih entitet v slovenskem besedilu

Biomedical named entity extraction: some issues of corpus compatibilities

A Rule Based Answer Extraction System with Stemming &amp; Anaphora Resolution

An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge

ROSeAnn

Analysis of Textual Data Based on Inductive Learning Techniques

Arabic Semantic Web Applications – A Survey

ANEEC: A Quasi-Automatic System for Massive Named Entity Extraction and Categorization

The Study on Enlarging Specific Extractor for Technology-Related Named Entity Extraction from Text Collections of Applied Mechanics Field

A new multiword expression metric and its applications

A Rule Based Answer Extraction System with Stemming & Anaphora Resolution