Abstract

Chemical information mining has turned into a well-established scientific area over the last five years. Several software solutions exist that are able to identify and extract names of chemical compounds in text documents and convert them into chemical structure-searchable information. Likewise, several programs exist which recognize chemical structures from images and translate them into the computer-readable format, the connection table. However, a still unsolved issue is the automatic abstraction of generic compounds (Markush structures). These usually consist of a core structure image and variable groups specified in the text, in additional images or in tables. This presentation describes our hybrid approach to extract generic structure information from documents by using combining information science, cheminformatics, computational linguistics and pattern recognition techniques. Experiences with the envisaged methodology and the first results are presented. This research project is funded by the German Ministry of Economics and Technology. It is part of the THESEUS research programme which has the goal to facilitate access to information, combine data to form new kinds of knowledge and lay the groundwork for new services on the Internet.

Highlights

  • Chemical information mining has turned into a well-established scientific area over the last five years

  • A still unsolved issue is the automatic abstraction of generic compounds (Markush structures)

  • These usually consist of a core structure image and variable groups specified in the text, in additional images or in tables

Read more

Summary

Introduction

Chemical information mining has turned into a well-established scientific area over the last five years. ChemProspector and generic structures: advanced mining and searching of chemical content Valentina Eigner-Pitto*, Josef Eiblmaier, Hans Kraut, Larisa Isenko, Heinz Saller, Peter Loew

Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.