Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Despite the increasing availability of tandem mass spectrometry (MS/MS) community spectral libraries for untargeted metabolomics over the past decade, the majority of acquired MS/MS spectra remain uninterpreted. To further aid in interpreting unannotated spectra, we created a nearest neighbor suspect spectral library, consisting of 87,916 annotated MS/MS spectra derived from hundreds of millions of MS/MS spectra originating from published untargeted metabolomics experiments. Entries in this library, or "suspects," were derived from unannotated spectra that could be linked in a molecular network to an annotated spectrum. Annotations were propagated to unknowns based on structural relationships to reference molecules using MS/MS-based spectrum alignment. We demonstrate the broad relevance of the nearest neighbor suspect spectral library through representative examples of propagation-based annotation of acylcarnitines, bacterial and plant natural products, and drug metabolism. Our results also highlight how the library can help to better understand an Alzheimer's brain phenotype. The nearest neighbor suspect spectral library is openly available for download or for data analysis through the GNPS platform to help investigators hypothesize candidate structures for unknown MS/MS spectra in untargeted metabolomics data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10733301PMC
http://dx.doi.org/10.1038/s41467-023-44035-yDOI Listing

Publication Analysis

Top Keywords

nearest neighbor
16
neighbor suspect
16
suspect spectral
16
spectral library
16
untargeted metabolomics
16
ms/ms spectra
16
unannotated spectra
8
library
6
spectra
6
spectral
5

Similar Publications

This study investigates the relationship between dietary antioxidants and heart failure (HF) risk using nationally representative National Health and Nutrition Examination Survey data (2005-2018). It aims to identify key dietary antioxidants and develop a machine-learning-based predictive model for HF. Among 9279 participants (434 HF cases), 44 dietary antioxidant variables were extracted from two 24-h dietary recalls.

View Article and Find Full Text PDF

This research examines the impact of environmental (dis)amenities on residential rental values in the urban areas of Rawalpindi and Islamabad, Pakistan. Using a unique dataset of 849 households and geospatial data on 35 irregular dumpsites, we quantify how proximity to environmental disamenities depresses rental prices. Specifically, results confirm that irregular dumpsites significantly depress rental values, especially for properties situated near the closest distance rings.

View Article and Find Full Text PDF

Objective: The aim of this study is to analyse the factors affecting medical burnout in hospitals, identify the characteristics of staff experiencing high levels of burnout and devise a practical and sustainable prediction mechanism.

Methods: A survey was conducted to access the current situation, followed by a regression analysis using data from the Maslach Burnout Inventory General Survey, demographic information related to healthcare personnel and employee job satisfaction metrics from the hospitals under study. Subsequently, four predictive models-logistic regression, K-nearest neighbour, decision tree and random forest (RF)-were employed to predict the degree of healthcare burnout.

View Article and Find Full Text PDF

Physicochemical Property Models for Poly- and Perfluorinated Alkyl Substances and Other Chemical Classes.

J Chem Inf Model

September 2025

United States Environmental Protection Agency, Center for Computational Toxicology and Exposure, 109 TW Alexander Dr., Research Triangle Park, North Carolina 27711, United States.

To assess environmental fate, transport, and exposure for PFAS (per- and polyfluoroalkyl substances), predictive models are needed to fill experimental data gaps for physicochemical properties. In this work, quantitative structure-property relationship (QSPR) models for octanol-water partition coefficient, water solubility, vapor pressure, boiling point, melting point, and Henry's law constant are presented. Over 200,000 experimental property value records were extracted from publicly available data sources.

View Article and Find Full Text PDF

Drug-induced hepatotoxicity (DIH), characterized by diverse phenotypes and complex mechanisms, remains a critical challenge in drug discovery. To systematically decode this diversity and complexity, we propose a multi-dimensional computational framework integrating molecular structure analysis with disease pathogenesis exploration, focusing on drug-induced intrahepatic cholestasis (DIIC) as a representative DIH subtype. First, a graph-based modularity maximization algorithm identified DIIC risk genes, forming a DIIC module and eight disease pathogenesis clusters.

View Article and Find Full Text PDF