Abstract

The existing volumes of biomedical texts available online drive the increasing need for automated techniques to analyze and extract knowledge from these information repositories. Recognizing and classifying biomedical terms in these texts is an important step for developing efficient techniques for knowledge discovery and information extraction from the literature. This paper presents a new technique for biomedical term classification in biomedical texts. The method is based on combing successful feature selection techniques (MI, X2) with machine learning (SVM) for biomedical term classification. We utilize the advances in feature selection techniques in IR and use them to select the key features for term identification and classification. We evaluated the method using Genia 3.0 corpus with about 3,000 to more than 34,000 biomedical term instances. The technique is effective, achieving impressive accuracy, precision, and recall; and with F-score approaching ~ 90%, the method is superior or very competitive with the recently published results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.