Abstract

Identifying expansion forms for acronyms is beneficial to many natural language processing and information retrieval tasks. As a step to this goal, we study the problem of finding expansions in texts for given acronym queries by modeling the problem as a sequence labeling task. However, it is challenging for traditional sequence labeling models like the Conditional Random Fields (CRF) because of the complexity of the input sentences and the substructure of the categories. In this paper, we propose the neural conditional random fields model with latent variables (LNCRF) to deal with the aforementioned challenge. On one hand, we extend the CRF by coupling it with nonlinear hidden layers to learn better representations of the input data under the framework of the CR-F. On the other hand, we introduce latent variables to capture the fine-grained information from the intrinsic substructures within each label implicitly. Experimental results on real data show that our model achieves the best performance against the state-of-the-art baselines including the Support Vector Machines and the standard Conditional Random Fields.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.