Abstract
This paper proposes an unsupervised word sense disambiguation method for the biomedical domain. In this paper, a network representation of co-occurrence data is first defined to represent both word senses and word contexts. The representation expresses textual context observed around a certain term as a network, where nodes are terms and edges are the number of co-occurrences between connected terms. A graph kernel is adopted as a similarity measure between terms and senses represented in networks. Candidate senses and ambiguous contexts are then compared directly in the representation space to resolve the word sense. It only needs the sense definitions and a large amount of unlabeled texts. The experiments in the biomedical domain show that the method outperforms a baseline method of vector representation. The performance of the proposed method is comparable to the state-of-the-art unsupervised word sense disambiguation methods.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.