Semantic Drift in Espresso-style Bootstrapping: Graph-theoretic Analysis and Evaluation in Word Sense Disambiguation

Mamoru Komachi,Yuji Matsumoto,Taku Kudo,Masashi Shimbo

doi:10.1527/tjsai.25.233

Semantic Drift in Espresso-style Bootstrapping: Graph-theoretic Analysis and Evaluation in Word Sense Disambiguation

Mamoru Komachi, Yuji Matsumoto + Show 2 more

Open Access

https://doi.org/10.1527/tjsai.25.233

Copy DOI

Journal: Transactions of the Japanese Society for Artificial Intelligence	Publication Date: Jan 1, 2010
Citations: 7	License type: free

Affiliation: Nara Institute of Science and Technology

#Semantic Drift #Task Of Word Sense Disambiguation + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Bootstrapping has a tendency, called semantic drift, to select instances unrelated to the seed instances as the iteration proceeds. We demonstrate the semantic drift of Espresso-style bootstrapping has the same root as the topic drift of Kleinberg's HITS, using a simplified graph-based reformulation of bootstrapping. We confirm that two graph-based algorithms, the von Neumann kernels and the regularized Laplacian, can reduce the effect of semantic drift in the task of word sense disambiguation (WSD) on Senseval-3 English Lexical Sample Task. Proposed algorithms achieve superior performance to Espresso and previous graph-based WSD methods, even though the proposed algorithms have less parameters and are easy to calibrate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Transactions of the Japanese Society for Artificial Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.