Identifying Abbreviations in Biomedical Literature Based on Maximum Entropy with Web Features

Jing Peng,Hong Min Sun,Yan Wang

doi:10.4028/www.scientific.net/amr.998-999.1024

Identifying Abbreviations in Biomedical Literature Based on Maximum Entropy with Web Features

Jing Peng, Hong Min Sun + Show 1 more

https://doi.org/10.4028/www.scientific.net/amr.998-999.1024

Copy DOI

Export

Save

Cite

Journal: Advanced Materials Research

Publication Date: Jul 1, 2014

Affiliation: Northeast Agricultural University

#Biomedical Literature Mining #Gold Standard Corpus #Larger Test Data #Maximum Entropy Classifier #Maximum Entropy #Machine Learning Framework #Biomedical Literature #Web Features #Knowledge Source #Full Literatures

Abstract
Full-Text
Similar Papers

Abstract

Listen

The number of biomedical literatures is growing rapidly, and biomedical literature mining is becoming essential. A learning classifier based on maximum entropy (ME) for identifying abbreviations is proposed. Two innovative Web-based features for extracting additional semantic information are developed. The study shows the Web as a knowledge source can be incorporated effectively in the machine learning framework and significantly improves its performance. The ME classifier achieves 95% precision and 89% recall on the gold standard corpus “Medstract” and 91% precision and 84% recall on the larger test data that includes 128 full text literatures.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Advanced Materials Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Identifying Abbreviations in Biomedical Literature Based on Maximum Entropy with Web Features