MPOCSR: optical chemical structure recognition based on multi-path Vision Transformer

Fan Lin,Jianhua Li

doi:10.1007/s40747-024-01561-6

Fan Lin, Jianhua Li

Open Access

https://doi.org/10.1007/s40747-024-01561-6

Copy DOI

Export

Save

Cite

Journal: Complex & Intelligent Systems	Publication Date: Jul 22, 2024
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

AbstractOptical chemical structure recognition (OCSR) is a fundamental and crucial task in the field of chemistry, which aims at transforming intricate chemical structure images into machine-readable formats. Current deep learning-based OCSR methods typically use image feature extractors to extract visual features and employ encoder-decoder architectures for chemical structure recognition. However, the performance of these methods is limited by their image feature extractors and the class imbalance of elements in chemical structure representation. This paper proposes MPOCSR (multi-path optical chemical structure recognition), which introduces the multi-path Vision Transformer (MPViT) and the class-balanced (CB) loss function to address these two challenges. MPOCSR uses MPViT as an image feature extractor, combining the advantages of convolutional neural networks and Vision Transformers. This strategy enables the provision of richer visual information for subsequent decoding processes. Furthermore, MPOCSR incorporates CB loss function to rebalance the loss weights among different categories. For training and validation of our method, we constructed a dataset that includes both Markush and non-Markush structures. Experimental results show that MPOCSR achieves an accuracy of 90.95% on the test set, surpassing other existing methods.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

MPOCSR: optical chemical structure recognition based on multi-path Vision Transformer

Abstract

Published Version

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

DECIMER.ai: an open platform for automated optical chemical structure identification, segmentation and recognition in scientific publications
Kohulan Rajan ... Christoph Steinbeck
Nature Communications | VOL. 14
Kohulan Rajan, et. al.Kohulan Rajan ... Christoph Steinbeck
19 Aug 2023
Nature Communications | VOL. 14

Advancements in hand-drawn chemical structure recognition through an enhanced DECIMER architecture
Kohulan Rajan ... Christoph Steinbeck
Journal of Cheminformatics | VOL. 16
Kohulan Rajan, et. al.Kohulan Rajan ... Christoph Steinbeck
05 Jul 2024
Journal of Cheminformatics | VOL. 16

Chemical Structure Recognition (CSR) System: Automatic Analysis of 2D Chemical Structures in Document Images
Syed Saqib Bukhari ... Andreas Dengel
-
Syed Saqib Bukhari, et. al.Syed Saqib Bukhari ... Andreas Dengel
01 Sep 2019
01 Sep 2019

SwinOCSR: end-to-end optical chemical structure recognition using a Swin Transformer
Zhanpeng Xu ... Jianhua Li
Journal of Cheminformatics | VOL. 14
Zhanpeng Xu, et. al.Zhanpeng Xu ... Jianhua Li
01 Jul 2022
Journal of Cheminformatics | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

MPOCSR: optical chemical structure recognition based on multi-path Vision Transformer

Abstract

Published Version

Talk to us

Similar Papers

More From: Complex &amp; Intelligent Systems

More From: Complex & Intelligent Systems