Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Chemical structure segmentation constitutes a pivotal task in cheminformatics, involving the extraction and abstraction of structural information of chemical compounds from text-based sources, including patents and scientific articles. This study introduces a deep learning approach to chemical structure segmentation, employing a Vision Transformer (ViT) to discern the structural patterns of chemical compounds from their graphical representations. The Chemistry-Segment Anything Model (ChemSAM) achieves state-of-the-art results on publicly available benchmark datasets and real-world tasks, underscoring its effectiveness in accurately segmenting chemical structures from text-based sources. Moreover, this deep learning-based approach obviates the need for handcrafted features and demonstrates robustness against variations in image quality and style. During the detection phase, a ViT-based encoder-decoder model is used to identify and locate chemical structure depictions on the input page. This model generates masks to ascertain whether each pixel belongs to a chemical structure, thereby offering a pixel-level classification and indicating the presence or absence of chemical structures at each position. Subsequently, the generated masks are clustered based on their connectivity, and each mask cluster is updated to encapsulate a single structure in the post-processing workflow. This two-step process facilitates the effective automatic extraction of chemical structure depictions from documents. By utilizing the deep learning approach described herein, it is demonstrated that effective performance on low-resolution and densely arranged molecular structural layouts in journal articles and patents is achievable.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10935819PMC
http://dx.doi.org/10.1186/s13321-024-00823-2DOI Listing

Publication Analysis

Top Keywords

chemical structure
20
structure segmentation
12
chemical
9
chemical compounds
8
text-based sources
8
deep learning
8
learning approach
8
chemical structures
8
structure depictions
8
structure
7

Similar Publications

The electron-deficient oxidant 2,3-dichloro-5,6-dicyano-1,4-benzoquinone (DDQ) has recently emerged as a promising visible-light photoredox catalyst. However, its excited-state behavior remains poorly understood. Here, we investigate the ultrafast dynamics of photoexcited DDQ in acetonitrile using transient electronic and infrared absorption spectroscopy, supported by quantum chemical calculations.

View Article and Find Full Text PDF

Nuclear receptors (NRs) are a superfamily of ligand-activated transcription factors that regulate gene expression in response to metabolic, hormonal, and environmental signals. These receptors play a critical role in metabolic homeostasis, inflammation, immune function, and disease pathogenesis, positioning them as key therapeutic targets. This review explores the mechanistic roles of NRs such as PPARs, FXR, LXR, and thyroid hormone receptors (THRs) in regulating lipid and glucose metabolism, energy expenditure, cardiovascular health, and neurodegeneration.

View Article and Find Full Text PDF

This study investigated the impact of dietary zeolite supplementation on growth, cecal microbiota and digesta viscosity, digestive enzymes, carcass traits, blood constituents, and antioxidant parameters of broilers. A completely randomized design was used with 240 one-day-old broiler chicks randomly assigned to three dietary treatments (0%, 1.5%, and 3% zeolite as a feed additive) with four replicates of 20 chicks each.

View Article and Find Full Text PDF

Background: Avenanthramides (AVAs) and Avenacosides (AVEs) are unique to oats (Avena Sativa) and may serve as biomarkers of oat intake. However, information regarding their validity as food intake biomarkers is missing. We aimed to investigate critical validation parameters such as half-lives, dose-response, matrix effects, relative bioavailability under single dose, and in relation to the abundance of Feacalibacterium prausnitzii, and under repeated dosing, to understand the potential applications of AVAs and AVEs as biomarkers of oat intake.

View Article and Find Full Text PDF

Background And Aim: Synthetic dyes in the textile industry pose risks to human health and environmental safety. The current study aims to examine the efficacy of a novel esterase derived from an endophyte fungus in decolorizing diverse dyes, focusing on its production, purification, optimization, and characterization.

Results: Trichoderma afroharzianum AUMC16433, a novel fungal endophyte with esterase-producing ability, was first detected from the cladodes of Opuntia ficus indica by ITS-rRNA sequencing.

View Article and Find Full Text PDF