Graph-sequence enhanced transformer for template-free prediction of natural product biosynthesis.

Patterns (N Y)

Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, P.R. China.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Natural products (NPs) play a vital role in drug discovery, with many FDA-approved drugs derived from these compounds. Despite their significance, the biosynthetic pathways of NPs remain poorly characterized due to their inherent complexity and the limitations of traditional retrosynthesis methods in predicting such intricate reactions. While template-free machine learning models have demonstrated promise in organic synthesis, their application to biosynthetic pathways is still in its infancy. Addressing this gap, we propose the graph-sequence enhanced transformer (GSETransformer), which leverages both graph structural information and sequential dependencies to achieve superior performance in addressing the complexity of biosynthetic data. When evaluated on benchmark datasets, GSETransformer achieves state-of-the-art performance in single- and multi-step retrosynthesis tasks. These results highlight its effectiveness in computational biosynthesis and its potential to facilitate the design of NP-based therapeutics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12365517PMC
http://dx.doi.org/10.1016/j.patter.2025.101259DOI Listing

Publication Analysis

Top Keywords

graph-sequence enhanced
8
enhanced transformer
8
biosynthetic pathways
8
transformer template-free
4
template-free prediction
4
prediction natural
4
natural product
4
product biosynthesis
4
biosynthesis natural
4
natural products
4

Similar Publications

Graph-sequence enhanced transformer for template-free prediction of natural product biosynthesis.

Patterns (N Y)

August 2025

Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, P.R. China.

Natural products (NPs) play a vital role in drug discovery, with many FDA-approved drugs derived from these compounds. Despite their significance, the biosynthetic pathways of NPs remain poorly characterized due to their inherent complexity and the limitations of traditional retrosynthesis methods in predicting such intricate reactions. While template-free machine learning models have demonstrated promise in organic synthesis, their application to biosynthetic pathways is still in its infancy.

View Article and Find Full Text PDF

Named Entity Recognition (NER) is a natural language processing task for recognizing named entities in a given sentence. Chinese NER is difficult due to the lack of delimited spaces and conventional features for determining named entity boundaries and categories. This study proposes the ME-MGNN (Multiple Embeddings enhanced Multi-Graph Neural Networks) model for Chinese NER in the healthcare domain.

View Article and Find Full Text PDF