Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

High-content image-based phenotypic screens (HCSs) provide a scalable approach to characterize biological functions of compounds. The widespread adoption of HCS has led to a growing body of available profile datasets. However, study-specific experimental and computational choices lead to profile datasets that cannot be directly combined. A critical, long-standing challenge is how to integrate these rich but currently isolated HCS dataset resources. Here we introduce a contrastive, deep-learning framework that leverages sparse sets of overlapping profiles as fiducials to align heterogeneous HCS profile datasets in a shared latent space. We demonstrate that this alignment facilitates accurate 'transitive' predictions, whereby the function of an uncharacterized compound screened in one dataset can be predicted through comparison with characterized compounds already profiled in other datasets. In silico alignment of HCS resources provides a path to unify fast-growing HCS resources and accelerate early drug discovery efforts.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41587-025-02729-2DOI Listing

Publication Analysis

Top Keywords

profile datasets
12
hcs resources
8
hcs
5
transitive prediction
4
prediction small-molecule
4
small-molecule function
4
function alignment
4
alignment high-content
4
high-content screening
4
resources
4

Similar Publications

Integrative profiling of lung cancer biomarkers EGFR, ALK, KRAS, and PD-1 with emphasis on nanomaterials-assisted immunomodulation and targeted therapy.

Front Immunol

September 2025

Department of Thoracic Surgery, Shenzhen People's Hospital (The First Affiliated Hospital, Southern University of Science and Technology; The Second Clinical Medical College, Jinan University), Shenzhen, Guangdong, China.

Background: Lung cancer remains the leading cause of cancer-related mortality globally, primarily due to late-stage diagnosis, molecular heterogeneity, and therapy resistance. Key biomarkers such as EGFR, ALK, KRAS, and PD-1 have revolutionized precision oncology; however, comprehensive structural and clinical validation of these targets is crucial to enhance therapeutic efficacy.

Methods: Protein sequences for EGFR, ALK, KRAS, and PD-1 were retrieved from UniProt and modeled using SWISS-MODEL to generate high-confidence 3D structures.

View Article and Find Full Text PDF

The morphological patterns of lung adenocarcinoma (LUAD) are recognized for their prognostic significance, with ongoing debate regarding the optimal grading strategy. This study aimed to develop a clinical-grade, fully quantitative, and automated tool for pattern classification/quantification (PATQUANT), to evaluate existing grading strategies, and determine the optimal grading system. PATQUANT was trained on a high-quality dataset, manually annotated by expert pathologists.

View Article and Find Full Text PDF

Risk analysis of microorganisms in common food spices.

Front Microbiol

August 2025

Shanghai Kangshi Food Science and Technology Co., Ltd., Shanghai, China.

Unlabelled: Spices play a ubiquitous role in the food industry, imparting distinctive flavors and exhibiting antibacterial properties. Nonetheless, microbial contamination and residues present persistent challenges to food safety within this sector. The systematic tracking of microbial input datasets in spices holds significant guiding implications and economic value for microbial control and risk assessment in the food industry.

View Article and Find Full Text PDF

Background And Aim: Granulosa cells (GCs) are crucial mediators of follicular development and oocyte competence in goats, with their gene expression profiles serving as potential biomarkers of fertility. However, the lack of a standardized, quantifiable method to assess GC quality using transcriptomic data has limited the translation of such findings into reproductive applications. This study aimed to develop a hybrid deep learning model integrating one-dimensional convolutional neural networks (1DCNNs) and gated recurrent units (GRUs) to classify GCs as fertility-supporting (FS) or non-fertility-supporting (NFS) using single-cell RNA sequencing (scRNA-seq) data.

View Article and Find Full Text PDF

Background: Most RNA-seq datasets harbor genes with extreme expression levels in some samples. Such extreme outliers are usually treated as technical errors and are removed from the data before further statistical analysis. Here we focus on the patterns of such outlier gene expression to investigate whether they provide insights into the underlying biology.

View Article and Find Full Text PDF