Single sample pathway analysis in metabolomics: performance evaluation and application.

BMC Bioinformatics

Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, UK.

Published: November 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Single sample pathway analysis (ssPA) transforms molecular level omics data to the pathway level, enabling the discovery of patient-specific pathway signatures. Compared to conventional pathway analysis, ssPA overcomes the limitations by enabling multi-group comparisons, alongside facilitating numerous downstream analyses such as pathway-based machine learning. While in transcriptomics ssPA is a widely used technique, there is little literature evaluating its suitability for metabolomics. Here we provide a benchmark of established ssPA methods (ssGSEA, GSVA, SVD (PLAGE), and z-score) alongside the evaluation of two novel methods we propose: ssClustPA and kPCA, using semi-synthetic metabolomics data. We then demonstrate how ssPA can facilitate pathway-based interpretation of metabolomics data by performing a case-study on inflammatory bowel disease mass spectrometry data, using clustering to determine subtype-specific pathway signatures.

Results: While GSEA-based and z-score methods outperformed the others in terms of recall, clustering/dimensionality reduction-based methods provided higher precision at moderate-to-high effect sizes. A case study applying ssPA to inflammatory bowel disease data demonstrates how these methods yield a much richer depth of interpretation than conventional approaches, for example by clustering pathway scores to visualise a pathway-based patient subtype-specific correlation network. We also developed the sspa python package (freely available at https://pypi.org/project/sspa/ ), providing implementations of all the methods benchmarked in this study.

Conclusion: This work underscores the value ssPA methods can add to metabolomic studies and provides a useful reference for those wishing to apply ssPA methods to metabolomics data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664704PMC
http://dx.doi.org/10.1186/s12859-022-05005-1DOI Listing

Publication Analysis

Top Keywords

pathway analysis
12
sspa methods
12
metabolomics data
12
sspa
9
single sample
8
sample pathway
8
analysis sspa
8
methods
8
inflammatory bowel
8
bowel disease
8

Similar Publications

Background: We investigated circulating protein profiles and molecular pathways among various chronic kidney disease (CKD) etiologies to study its underlying molecular heterogeneity.

Methods: We conducted a proteomic biomarker analysis in the DAPA-CKD trial recruiting adults with and without type 2 diabetes with an eGFR of 25 to 75 mL/min/1.73m2 and a UACR of 200 to 5000 mg/g.

View Article and Find Full Text PDF

Pentameric ligand-gated ion channels control synaptic neurotransmission via an allosteric mechanism, whereby agonist binding induces global protein conformational changes that open an ion-conducting pore. For the proton-activated bacterial () ligand-gated ion channel (GLIC), high-resolution structures are available in multiple conformational states. We used a library of atomistic molecular dynamics (MD) simulations to study conformational changes and to perform dynamic network analysis to elucidate the communication pathways underlying the gating process.

View Article and Find Full Text PDF

Background And Aims: Inflammatory bowel disease (IBD), including Crohn's disease (CD) and ulcerative colitis (UC), remain heterogeneous disorders with variable response to biologics. Post-operative recurrence in CD is common despite surgery and prophylactic biotherapies. Understanding the inflammatory mediators associated with recurrence and treatment response could pave the way for personalized strategies.

View Article and Find Full Text PDF

Motivation: Graph Neural Network (GNN) models have emerged in many fields and notably for biological networks constituted by genes or proteins and their interactions. The majority of enrichment study methods apply over-representation analysis and gene/protein set scores according to the existing overlap between pathways. Such methods neglect knowledges coming from the interactions between the gene/protein sets.

View Article and Find Full Text PDF

The emergence of antimicrobial resistance (AMR) Escherichia coli in poultry farming is a growing global public health concern, particularly in Bangladesh, where the use of antibiotics remains largely unregulated. This study aimed to determine the prevalence and AMR patterns of E. coli isolated from broiler chickens in Sylhet district of Bangladesh and to investigate the network of coexisting resistance traits among the isolates.

View Article and Find Full Text PDF