HiRXN: Hierarchical Attention-Based Representation Learning for Chemical Reaction.

J Chem Inf Model

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China.

Published: February 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In recent years, natural language processing (NLP) techniques, including large language modeling (LLM), have contributed significantly to advancements in organic chemistry research. Chemical reaction representations provide a link between NLP models and chemistry prediction tasks and enable the translation of complex chemical processes into a format that NLP models can understand and learn from. However, previous representation methods fail to adequately consider the hierarchical and structural information inherent in chemical reactions. Here, we propose a tool named HiRXN to learn the comprehensive representation of chemical reactions based on their hierarchical structure. In order to significantly enhance feature engineering for machine learning (ML) models, HiRXN develops an effective tokenization method called RXNTokenizer to capture atomic microenvironment features with multiradius. Then, the hierarchical attention network is used to integrate information from atomic microenvironment-level and molecule-level to accurately understand chemical reactions. The experimental results show that HiRXN is capable of representing chemical reactions and achieves remarkable performance in terms of reaction regression and classification prediction tasks. A web server has been developed to provide a specialized service that accepts Reaction SMILES as input and provides predicted results. The Web site is accessible at http://bdatju.com.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.4c01787DOI Listing

Publication Analysis

Top Keywords

chemical reactions
16
chemical reaction
8
nlp models
8
prediction tasks
8
chemical
7
hirxn
4
hirxn hierarchical
4
hierarchical attention-based
4
attention-based representation
4
representation learning
4

Similar Publications

Carbon aerogels and xerogels, with their 3D porous architectures, ultralow density, high surface area, and excellent conductivity, have emerged as multifunctional materials for energy and environmental applications. This review highlights recent advances in the synthesis of these materials polymerisation, drying, and carbonisation, as well as the role of novel precursors such as graphene, carbon nanotubes, and biomass. Emphasis is also placed on doped and metal-decorated carbon gels as efficient electrocatalysts for oxygen reduction reactions, enabling four- and two-electron pathways for energy conversion and the production of green HO, respectively.

View Article and Find Full Text PDF

Photocatalytic cyclization reaction of 2-vinylarylamines with CFSONa and arylaldehydes to access 3-(2,2,2-trifluoroethyl)-3-indoles.

Chem Commun (Camb)

September 2025

College of Chemistry, Pingyuan Laboratory, Henan Key Laboratory of Chemical Biology and Organic Chemistry, State Key Laboratory of Coking Coal Resources Green Exploitation, Zhengzhou University, Zhengzhou 450052, P. R. China.

A visible-light-catalyzed three-component cyclization reaction of 2-vinylarylamines with CFSONa and arylaldehydes is developed to build a series of 3-(2,2,2-trifluoroethyl)-3-indoles. This protocol features mild reaction conditions using an 18 W blue LED as the light source at room temperature. The desired 3-indole products can be successfully transformed into valuable tetrahydroindole scaffolds through either reduction or cross-coupling reactions.

View Article and Find Full Text PDF

Surface photocatalysis holds significant promise for converting solar energy into chemical fuels and addressing environmental challenges. While calculations provide critical insights into the thermodynamic and kinetic aspects of catalytic reactions, applying these methods to surface photocatalysis remains challenging. In this work, we discuss the key challenges that need to be addressed when using calculations to understand surface photocatalytic processes, the reasons behind these challenges, and the potential directions and opportunities for overcoming them in the future.

View Article and Find Full Text PDF

Stochastic model analysis of luminescence switching in hybrid systems of CdSe/CdS quantum dots-spiropyran molecular photoswitches.

Phys Chem Chem Phys

September 2025

Department of Chemistry, Graduate School of Science and Technology, Kwansei Gakuin University, 1 Gakuen Uegahara, Sanda, Hyogo 669-1330, Japan.

Hybrid systems (HSs) of quantum dots (QDs) and molecular photoswitches exhibit luminescence switching of QDs based on energy transfer and have garnered attention for their potential applications in sensors and optical memories. In HSs, the chemical composition, such as the number of attached ligands, is inherently distributed, posing challenges for extracting the energy transfer process from the QDs to a single acceptor molecule. The stochastic model, assuming a Poisson distribution for the number of acceptors, proves to be an effective approach for extracting the process.

View Article and Find Full Text PDF

Presented herein is a protocol for the iron-catalyzed [4 + 1] cycloadditions of -Boc-imines with β-ketosulfoxonum ylides through a succession of nucleophilic additions and intramolecular annulation, giving access to functionalized oxazolidine-2-ones in generally good yields. In contrast to the renown Corey-Chaykovsky reaction via N attack to furnish aziridines, this method afforded oxazolidine-2-ones via an O-nucleophilic attack. Features of this strategy include readily accessible starting materials, sustainable catalyst, post-functionalizations of complex molecules, scalability, and exceptional control over diastereoselectivity.

View Article and Find Full Text PDF