Learning the Language of Chemical Reactions using Transformer Models

webinar

Thu, 23 Sep 2021, 16:00 CEST (Berlin)

Dr. Philippe Schwaller, IBM Research Zurich, Switzerland

Learning the Language of Chemical Reactions using Transformer Models

In organic chemistry, we are currently witnessing a rise in artificial intelligence (AI) approaches, which show great potential for improving molecular design, facilitating synthesis and accelerating the discovery of novel molecules. Based on an analogy between written language and organic chemistry, Philippe et al. built linguistics-inspired transformer neural network models for chemical reaction prediction, synthesis planning, and the prediction of experimental actions. They extended the models to chemical reaction classification and fingerprints. By finding a mapping from discrete reactions to continuous vectors, they enabled efficient chemical reaction space exploration. Moreover, the group specialized similar models for reaction yield predictions. Intrigued by the remarkable performance of chemical language models, they discovered that the models can capture how atoms rearrange during a reaction, without supervision or human labelling, leading to the development of the open-source atom-mapping tool RXNMapper. These advances led to the developments of IBM RXN for Chemistry and RoboRXN, the prototype of an AI-driven, cloud-connected, and automated synthesis platform. RoboRXN will enable chemists to execute chemical syntheses from the comfort of their home. Philippe will provide an overview of the different contributions that are at the base of this digital synthetic chemistry revolution.

Image taken from: http://rxnmapper.ai/index.html

Current news

AMBrosia Just Got Bigger: 126 Billion Compound Update Now Available
June 11, 2025 13:21 CEST
We are thrilled to announce a major update to the AMBrosia Chemical Space in collaboration with Ambinter! The state-of-the-art source for diverse drug-like scaffolds now boasts an incredible 126 billion of synthetically accessible compounds for drug discovery projects. What does the update feature? Expanded to an astounding 125,924,571,692 compounds (1.26...
Read on
category
Software
Mode Highlight: SeeSAR's Idea Factory
May 28, 2025 11:06 CEST
We’re excited to spotlight the Inspirator Mode in SeeSAR – a feature designed to supercharge your creativity during hit and lead discovery. Whether you’re a medicinal chemist or computational expert, Inspirator highlights the most promising molecular decorations across hundreds of thousands of conformations. With just a few clicks, it guides...
Read on
category
Challenge
Winners of the Scientific Challenge Spring 2024: Diana Assis And Vicente Ledesma with Molecular Photoswitches Targeting The Orexin System In Alzheimer's Disease
May 6, 2025 14:56 CEST
We are happy to announce that Diana Assis and Vicente Ledesma from the Faculty of Pharmacy, University of Lisbon, Portugal, are the winners of the BioSolveIT Scientific Challenge Spring 2024. Their research explores the potential of molecular photoswitches targeting orexin receptors 1 and 2 (OX1R and OX2R) as a novel...
Read on