Density Functional Theory (DFT) is extensively used in theoretical and computational chemistry to study molecular and crystal properties across diverse fields, including quantum chemistry, materials physics, catalysis, biochemistry, and surface science. Despite advances in DFT hardware and software for optimized geometries, achieving consensus in molecular structure comparisons with experimental counterparts remains a challenge. This difficulty is exacerbated by the lack of automated bond length comparison tools, resulting in labor-intensive and error-prone manual processes. To address these challenges, we propose MolGC, a Molecular Geometry Comparator algorithm that automates the comparison of optimized geometries from different theoretical levels. MolGC calculates the mean absolute error (MAE) of bond lengths by integrating data from various DFT software. It provides interactive and customizable visualization of geometries, enabling users to explore different views for enhanced analysis. In addition, it saves MAE computations for further analysis and offers a comprehensive statistical summary of the results. MolGC effectively addresses complex graph labeling challenges, ensuring accurate identification and categorization of bonds in diverse chemical structures. It achieves a 98.91% average rate in correct bond label assignments on an antibiotics dataset, showcasing its effectiveness for comparing molecular bond lengths across geometries of varying complexity and size. The executable file and software resources for running MolGC can be downloaded from https://github.com/AbimaelGP/MolGC/tree/main .
Read full abstract