Abstract

Crystallography Open Database (COD, http://www.crystallography.net/) is the largest to date curated open-access collection of small to medium sized unit cell crystal structures [1,2]. Over 11 years of development, COD has accumulated over 1/4 million structures from the peer reviewed press and personal communications. COD has an automated data submission Web site, performs routine automatic quality checks on all incoming structures and is now recommended as a database for crystallographic deposition by several scientific journals. To facilitate automatic use and discoverability of COD data, and to increase usefulness of our database for chemists, two steps were undertaken. COD was now supplemented with software and data from the CrystalEye data aggregator. The new software permits extracting chemical data and presenting them as structural formula, unique moieties, and chemically significant fragments. We have also implemented search of crystal structures by the structural chemical formulae of the target compounds. The search is first of all performed among 70 000 hand-curated chemical structure descriptors, and can be extended to automatically generated descriptors. To facilitate data curation, a new software platform for data review is being developed. All COD structures will be evaluated using statistical distributions of observed geometrical and chemical properties (bond lengths, angles, dihedrals, planarities). The most statistically unusual structures will be forwarded to a COD reviewer Internet forum, where qualified reviewers will be asked whether they find provided evidence for a particular structure convincing or not. In this way, a set of human review indicators (convincing/unconvincing) will be available along with the match against the bulk of data (usual structure/unusual). Such indicators would be especially useful for deciding which COD records require special attention and which subsets of COD should be selected for reliable scientific inferences.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.