Abstract

The accumulation of biomedical literature makes it increasingly difficult for scientists to keep up with scientific advancements, requiring the development of text mining tools to collect and integrate data in a high-throughput fashion. A major challenge in biomedical text mining is how to recognize genes sensitively and accurately, and translate them to their official gene symbols. Gene symbols and their commonly used aliases and synonyms usually derive from an abbreviation of the gene's description. However, many gene symbols and alias exactly match other abbreviations commonly used in scientific literature that do not refer to genes. A systematic study on the abbreviations used in biomedical literatures should help improve the accuracy of gene recognition during text mining.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.