"Found in Translation": predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models.

Philippe Schwaller,Dávid Lányi,Théophile Gaudin,Teodoro Laino,Costas Bekas

doi:10.1039/c8sc02339e

Philippe Schwaller, Dávid Lányi + Show 3 more

Open Access

https://doi.org/10.1039/c8sc02339e

Copy DOI

Journal: Chemical Science	Publication Date: Jan 1, 2018
Citations: 332	License type: CC BY-NC 3.0

Affiliation: IBM Research - Zurich

Abstract

There is an intuitive analogy of an organic chemist's understanding of a compound and a language speaker's understanding of a word. Based on this analogy, it is possible to introduce the basic concepts and analyze potential impacts of linguistic analysis to the world of organic chemistry. In this work, we cast the reaction prediction task as a translation problem by introducing a template-free sequence-to-sequence model, trained end-to-end and fully data-driven. We propose a tokenization, which is arbitrarily extensible with reaction information. Using an attention-based model borrowed from human language translation, we improve the state-of-the-art solutions in reaction prediction on the top-1 accuracy by achieving 80.3% without relying on auxiliary knowledge, such as reaction templates or explicit atomic features. Also, a top-1 accuracy of 65.4% is reached on a larger and noisier dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

"Found in Translation": predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models.

Abstract

Talk to us

Similar Papers

More From: Chemical Science

Lead the way for us

Similar Papers

What do Neural Machine Translation Models Learn about Morphology?
Yonatan Belinkov ... Nadir Durrani
-
Yonatan Belinkov, et. al.Yonatan Belinkov ... Nadir Durrani
01 Jan 2017
01 Jan 2017

Neural Machine Translation model for University Email Application
Sandhya Aneja ... Nagender Aneja
-
Sandhya Aneja, et. al.Sandhya Aneja ... Nagender Aneja
11 Jul 2020
11 Jul 2020

A Study on Chinese-English Machine Translation Based on Migration Learning and Neural Networks
Fan Ying
International Journal on Artificial Intelligence Tools | VOL. 31
Fan YingFan Ying
01 Aug 2022
International Journal on Artificial Intelligence Tools | VOL. 31

Research on the Construction of a Bidirectional Neural Network Machine Translation Model Fused with Attention Mechanism
Guangming Zuo
Mathematical Problems in Engineering | VOL. 2022
Guangming ZuoGuangming Zuo
19 Aug 2022
Mathematical Problems in Engineering | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

"Found in Translation": predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models.

Abstract

Talk to us

Similar Papers

More From: Chemical Science