Data processing solutions to render metabolomics more quantitative: case studies in food and clinical metabolomics using Metabox 2.0.

Gigascience

Siriraj Center of Research Excellence in Metabolomics and Systems Biology (SiCORE-MSB), Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.

Published: January 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In classic semiquantitative metabolomics, metabolite intensities are affected by biological factors and other unwanted variations. A systematic evaluation of the data processing methods is crucial to identify adequate processing procedures for a given experimental setup. Current comparative studies are mostly focused on peak area data but not on absolute concentrations. In this study, we evaluated data processing methods to produce outputs that were most similar to the corresponding absolute quantified data. We examined the data distribution characteristics, fold difference patterns between 2 metabolites, and sample variance. We used 2 metabolomic datasets from a retail milk study and a lupus nephritis cohort as test cases. When studying the impact of data normalization, transformation, scaling, and combinations of these methods, we found that the cross-contribution compensating multiple standard normalization (ccmn) method, followed by square root data transformation, was most appropriate for a well-controlled study such as the milk study dataset. Regarding the lupus nephritis cohort study, only ccmn normalization could slightly improve the data quality of the noisy cohort. Since the assessment accounted for the resemblance between processed data and the corresponding absolute quantified data, our results denote a helpful guideline for processing metabolomic datasets within a similar context (food and clinical metabolomics). Finally, we introduce Metabox 2.0, which enables thorough analysis of metabolomic data, including data processing, biomarker analysis, integrative analysis, and data interpretation. It was successfully used to process and analyze the data in this study. An online web version is available at http://metsysbio.com/metabox.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10941642PMC
http://dx.doi.org/10.1093/gigascience/giae005DOI Listing

Publication Analysis

Top Keywords

data processing
16
data
15
food clinical
8
clinical metabolomics
8
processing methods
8
corresponding absolute
8
absolute quantified
8
quantified data
8
metabolomic datasets
8
milk study
8

Similar Publications

Minoritized racial, ethnic, sexual, and gender communities and populations face profound health disparities and their engagement in research remains low. In a randomized controlled trial, our community-based participatory research partnership tested the efficacy of ChiCAS, an HIV prevention intervention designed to increase pre-exposure prophylaxis use among Spanish-speaking transgender Latinas. Of 161 eligible Spanish-speaking transgender Latinas screened, we enrolled 144, achieving an 89% participation rate, and retained 94% at 6-month follow-up.

View Article and Find Full Text PDF

Long COVID and Food Insecurity in US Adults, 2022-2023.

JAMA Netw Open

September 2025

Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia.

Importance: Long COVID (ie, post-COVID-19 condition) is a substantial public health concern, and its association with health-related social needs, such as food insecurity, remains poorly understood. Identifying modifiable risk factors like food insecurity and interventions like food assistance programs is critical for reducing the health burden of long COVID.

Objective: To investigate the association of food insecurity with long COVID and to assess the modifying factors of Supplemental Nutrition Assistance Program (SNAP) participation and employment status.

View Article and Find Full Text PDF

Achermann, BB, Drewek, A, and Lorenzetti, SR. Acute effect of the bounce squat on ground reaction force at the turning point and barbell kinematics. J Strength Cond Res XX(X): 000-000, 2025-The free-weight back squat is a key exercise for developing lower-body strength, with variations that influence muscle activation and performance.

View Article and Find Full Text PDF

Cancer, with its inherent heterogeneity, is commonly categorized into distinct subtypes based on unique traits, cellular origins, and molecular markers specific to each type. However, current studies primarily rely on complete multi-omics datasets for predicting cancer subtypes, often overlooking predictive performance in cases where some omics data may be missing and neglecting implicit relationships across multiple layers of omics data integration. This paper introduces Multi-Layer Matrix Factorization (MLMF), a novel approach for cancer subtyping that employs multi-omics data clustering.

View Article and Find Full Text PDF

Essentials of the System of Radiological Protection.

J Radiol Prot

September 2025

Centre for Radiation Protection Research, Stockholm University, Svante Arrheniusväg 20C, 106 91 Stockholm, Sweden.

The System of Radiological Protection (the "System") developed by the International Commission on Radiological Protection (ICRP) is built on nearly a century of efforts of numerous scientists and practitioners working together internationally. It rests on three enduring pillars: science, ethics, and experience. These pillars support the three fundamental principles that shape radiological protection strategies: justification, optimisation, and application of dose limits.

View Article and Find Full Text PDF