Named Entity Extraction via Automatic Labeling and Tri-training: Comparison of Selection Methods

Chien-Lung Chou,Chia-Hui Chang

doi:10.1007/978-3-319-12844-3_21

Abstract

AbstractDetecting named entities from documents is one of the most important tasks in knowledge engineering. Previous studies rely on annotated training data, which is quite expensive to obtain large training data sets, limiting the effectiveness of recognition. In this research, we propose a semi-supervised learning approach for named entity recognition (NER) via automatic labeling and tritraining which make use of unlabeled data and structured resources containing known named entities. By modifying tri-training for sequence labeling and deriving proper initialization, we can train a NER model for Web news articles automatically with satisfactory performance. In the task of Chinese personal name extraction from 8,672 news articles on the Web (with 364,685 sentences and 54,449 (11,856 distinct) person names), an F-measure of 90.4% can be achieved.KeywordsNamed entity extractionco-labeling methodtri-training

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Named Entity Extraction via Automatic Labeling and Tri-training: Comparison of Selection Methods

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Semi-supervised Sequence Labeling for Named Entity Extraction based on Tri-Training: Case Study on Chinese Person Name Extraction
Chien-Lung Chou ... Chia-Hui Chang
-
Chien-Lung Chou, et. al.Chien-Lung Chou ... Chia-Hui Chang
01 Jan 2014
01 Jan 2014

Boosted Web Named Entity Recognition via Tri-Training
Chien-Lung Chou ... Ya-Yun Huang
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 16
Chien-Lung Chou, et. al.Chien-Lung Chou ... Ya-Yun Huang
14 Oct 2016
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 16

Thai personal named entity extraction without using word segmentation or POS tagging
P Sutheebanjard ... W Premchaiswadi
-
P Sutheebanjard, et. al.P Sutheebanjard ... W Premchaiswadi
01 Oct 2009
01 Oct 2009

Improving Named Entity Extraction Accuracy using Unlabeled Data and Several Extractors (pp. 29-38)
Tomoya Iwakura ... Seishi Okamoto
Polibits | VOL. 40
Tomoya Iwakura, et. al.Tomoya Iwakura ... Seishi Okamoto
31 Dec 2009
Improving Named Entity Extraction Accuracy using Unlabeled Data and Several Extractors (pp. 29-38)
Tomoya Iwakura ... Seishi Okamoto

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Named Entity Extraction via Automatic Labeling and Tri-training: Comparison of Selection Methods

Abstract

Talk to us

Similar Papers