Site-of-Metabolism Prediction with Aleatoric and Epistemic Uncertainty Quantification.

J Chem Inf Model

Department of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, Faculty of Life Sciences, University of Vienna, Josef-Holaubek-Platz 2, Vienna 1090, Austria.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In silico metabolism prediction models have become indispensable tools to optimize the metabolic properties of xenobiotics while preserving their intended biological activity. Among these, site-of-metabolism (SOM) prediction models are particularly valuable for pinpointing metabolically labile atomic positions. However, the practical utility of these models depends not only on their ability to deliver accurate predictions but also on their capacity to provide reliable estimates of predictive uncertainty. In this work, we introduce aweSOM, a graph neural network (GNN)-based SOM prediction model that leverages deep ensembling to model the total predictive accuracy and partition it into its aleatoric and epistemic components. We conduct a comprehensive evaluation of aweSOM's uncertainty estimates on a high-quality data set, identifying key challenges that currently constrain the performance of SOM prediction models. Based on these findings, we propose actionable insights to drive progress in the field of metabolism prediction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12381852PMC
http://dx.doi.org/10.1021/acs.jcim.5c00762DOI Listing

Publication Analysis

Top Keywords

prediction models
12
som prediction
12
aleatoric epistemic
8
metabolism prediction
8
prediction
5
site-of-metabolism prediction
4
prediction aleatoric
4
epistemic uncertainty
4
uncertainty quantification
4
quantification silico
4

Similar Publications

Purpose: The present study aimed to develop a noninvasive predictive framework that integrates clinical data, conventional radiomics, habitat imaging, and deep learning for the preoperative stratification of MGMT gene promoter methylation in glioma.

Materials And Methods: This retrospective study included 410 patients from the University of California, San Francisco, USA, and 102 patients from our hospital. Seven models were constructed using preoperative contrast-enhanced T1-weighted MRI with gadobenate dimeglumine as the contrast agent.

View Article and Find Full Text PDF

Coronary heart disease (CHD) is a leading cause of morbidity and mortality; patients with type 2 diabetes mellitus (T2DM) are at particularly high risk, highlighting the need for reliable biomarkers for early detection and risk stratification. We investigated whether combining the stress hyperglycemia ratio (SHR) and systemic inflammation response index (SIRI) improves CHD detection in T2DM. In this retrospective cohort of 943 T2DM patients undergoing coronary angiography, associations of SHR and SIRI with CHD were evaluated using multivariable logistic regression and restricted cubic splines; robustness was examined with subgroup and sensitivity analyses.

View Article and Find Full Text PDF

Designing Buchwald-Hartwig Reaction Graph for Yield Prediction.

J Org Chem

September 2025

State Key Laboratory of Fine Chemicals, School of Chemical Engineering, Ocean and Life Sciences, Dalian University of Technology, Panjin 124221, P. R. China.

The Buchwald-Hartwig (B-H) reaction graph, a novel graph for deep learning models, is designed to simulate the interactions among multiple chemical components in the B-H reaction by representing each reactant as an individual node within a custom-designed reaction graph, thereby capturing both single-molecule and intermolecular relationship features. Trained on a high-throughput B-H reaction data set, B-H Reaction Graph Neural Network (BH-RGNN) achieves near-state-of-the-art performance with an score of 0.971 while maintaining low computational costs.

View Article and Find Full Text PDF

Background: Labor shortages in health care pose significant challenges to sustaining high-quality care for people with intellectual disabilities. Social robots show promise in supporting both people with intellectual disabilities and their health care professionals; yet, few are fully developed and embedded in productive care environments. Implementation of such technologies is inherently complex, requiring careful examination of facilitators and barriers influencing sustained use.

View Article and Find Full Text PDF

The calculation of the highest occupied molecular orbital-lowest unoccupied molecular orbital (HOMO-LUMO) gap for chemical molecules is computationally intensive using quantum mechanics (QM) methods, while experimental determination is often costly and time-consuming. Machine Learning (ML) offers a cost-effective and rapid alternative, enabling efficient predictions of HOMO-LUMO gap values across large data sets without the need for extensive QM computations or experiments. ML models facilitate the screening of diverse molecules, providing valuable insights into complex chemical spaces and integrating seamlessly into high-throughput workflows to prioritize candidates for experimental validation.

View Article and Find Full Text PDF