Benchmarking of dimensionality reduction methods to capture drug response in transcriptome data.

Sci Rep

Department of Pharmacy, College of Pharmacy and Research Institute for Drug Development, Pusan National University, Busan, Republic of Korea.

Published: September 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Drug-induced transcriptomic data are crucial for understanding molecular mechanisms of action (MOAs), predicting drug efficacy, and identifying off-target effects. However, their high dimensionality presents challenges for analysis and interpretation. Dimensionality reduction (DR) methods simplify such data, enabling efficient analysis and visualization. Despite their importance, few studies have evaluated the performance of DR methods specifically for drug-induced transcriptomic data. We tested the DR methods across four distinct experimental conditions using data from the Connectivity Map (CMap) dataset, which includes different cell lines, drugs, MOA, and drug dosages. t-distributed Stochastic Neighbor Embedding (t-SNE), Uniform Manifold Approximation and Projection (UMAP), Pairwise Controlled Manifold Approximation (PaCMAP), and TRIMAP outperformed other methods in preserving both local and global biological structures, particularly in separating distinct drug responses and grouping drugs with similar molecular targets. However, most methods struggled with detecting subtle dose-dependent transcriptomic changes, where Spectral, Potential of Heat-diffusion for Affinity-based Trajectory Embedding (PHATE), and t-SNE showed stronger performance. Standard parameter settings limited the optimal performance of DR methods, highlighting the need for further exploration of hyperparameter optimization. Our study provides valuable insights into the strengths and limitations of various DR methods for analyzing drug-induced transcriptomic data. While t-SNE, UMAP, and PaCMAP are well-suited for studying discrete drug responses, further refinement is needed for detecting subtle dose-dependent changes. This study highlights the importance of selecting the DR method to accurately analyze drug-induced transcriptomic data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12402208PMC
http://dx.doi.org/10.1038/s41598-025-12021-7DOI Listing

Publication Analysis

Top Keywords

drug-induced transcriptomic
16
transcriptomic data
16
dimensionality reduction
8
methods
8
reduction methods
8
performance methods
8
manifold approximation
8
drug responses
8
detecting subtle
8
subtle dose-dependent
8

Similar Publications

Integration of transcriptomics and metabolomics to reveal crizotinib-induced liver injury in mice.

Eur J Pharmacol

September 2025

Department of Pharmacy, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, P. R. China; Henan Key Laboratory of Precision Clinical Pharmacy, Zhengzhou, P. R. China. Electronic address:

Drug-induced liver injury is a major cause of acute liver failure. Crizotinib is a first-line treatment for patients with cellular-mesenchymal epithelial transition factor (c-MET), anaplastic lymphoma kinase (ALK), and ROS proto-oncogene 1 (ROS1)-positive non-small cell lung cancer. Although some patients treated with crizotinib experience hepatic adverse effects, the underlying mechanisms remain unclear.

View Article and Find Full Text PDF

Benchmarking of dimensionality reduction methods to capture drug response in transcriptome data.

Sci Rep

September 2025

Department of Pharmacy, College of Pharmacy and Research Institute for Drug Development, Pusan National University, Busan, Republic of Korea.

Drug-induced transcriptomic data are crucial for understanding molecular mechanisms of action (MOAs), predicting drug efficacy, and identifying off-target effects. However, their high dimensionality presents challenges for analysis and interpretation. Dimensionality reduction (DR) methods simplify such data, enabling efficient analysis and visualization.

View Article and Find Full Text PDF

This study evaluated the utility of an integrated drug discovery strategy that combines three emerging data-driven approaches: real-world data analysis, in silico screening, and network pharmacology. First, transcriptomic data from public gene expression databases and adverse event reports were analyzed to address myocarditis induced by immune checkpoint inhibitors. The findings suggested a preventive effect of non-steroidal anti-inflammatory drugs, particularly those targeting the arachidonic acid metabolism pathway.

View Article and Find Full Text PDF

Modeling drug-induced transcriptional responses at the single-cell level is essential for advancing human healthcare, particularly in understanding disease mechanisms, assessing therapeutic efficacy, and anticipating adverse effects. However, existing approaches often impose a rigid constraint by enforcing pointwise alignment of latent representations to a standard normal prior, which limits expressiveness and results in biologically uninformative embeddings, especially in complex biological systems. Additionally, many methods inadequately address the challenges of unpaired data, typically relying on naive averaging strategies that ignore cell-type specificity and intercellular heterogeneity.

View Article and Find Full Text PDF

: Liver fibrosis, a consequence of various chronic liver diseases, is characterized by excessive accumulation of extracellular matrix (ECM), leading to impaired liver function and potentially progressing to cirrhosis or hepatocellular carcinoma. The molecular mechanisms underlying liver fibrosis are complex and not fully understood. In vivo experiments are essential for studying the molecular mechanisms of the disease.

View Article and Find Full Text PDF