Multimodal contrastive learning for spatial gene expression prediction using histology images.

Brief Bioinform

Medical Big Data, Shenzhen Research Institute of Big Data, Longxiang Boulevard, Longgang District, Shenzhen 518172, Guangdong, China.

Published: September 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In recent years, the advent of spatial transcriptomics (ST) technology has unlocked unprecedented opportunities for delving into the complexities of gene expression patterns within intricate biological systems. Despite its transformative potential, the prohibitive cost of ST technology remains a significant barrier to its widespread adoption in large-scale studies. An alternative, more cost-effective strategy involves employing artificial intelligence to predict gene expression levels using readily accessible whole-slide images stained with Hematoxylin and Eosin (H&E). However, existing methods have yet to fully capitalize on multimodal information provided by H&E images and ST data with spatial location. In this paper, we propose mclSTExp, a multimodal contrastive learning with Transformer and Densenet-121 encoder for Spatial Transcriptomics Expression prediction. We conceptualize each spot as a "word", integrating its intrinsic features with spatial context through the self-attention mechanism of a Transformer encoder. This integration is further enriched by incorporating image features via contrastive learning, thereby enhancing the predictive capability of our model. We conducted an extensive evaluation of highly variable genes in two breast cancer datasets and a skin squamous cell carcinoma dataset, and the results demonstrate that mclSTExp exhibits superior performance in predicting spatial gene expression. Moreover, mclSTExp has shown promise in interpreting cancer-specific overexpressed genes, elucidating immune-related genes, and identifying specialized spatial domains annotated by pathologists. Our source code is available at https://github.com/shizhiceng/mclSTExp.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11952928PMC
http://dx.doi.org/10.1093/bib/bbae551DOI Listing

Publication Analysis

Top Keywords

gene expression
16
contrastive learning
12
multimodal contrastive
8
spatial gene
8
expression prediction
8
spatial transcriptomics
8
spatial
7
expression
5
learning spatial
4
gene
4

Similar Publications

Targeting the gut-liver axis with dietary polyphenols to ameliorate metabolic dysfunction-associated steatotic liver disease: advances in molecular mechanisms.

Crit Rev Food Sci Nutr

September 2025

Hunan Key Laboratory of Deep Processing and Quality Control of Cereals and Oils, State Key Laboratory of Utilization of Woody Oil Resource, College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha, Hunan, China.

Metabolic dysfunction-associated steatotic liver disease (MASLD) is a condition that results from metabolic disorders. In addition to genetic factors, irregular and high-energy diets may also significantly contribute to its pathogenesis. Dietary habits can profoundly alter the composition of gut microbiota and metabolites.

View Article and Find Full Text PDF

Selenium is an essential trace element in many organisms but becomes toxic at elevated concentrations. At moderately increased, non-lethal levels, selenite triggers both selenium utilization and stress responses in microorganisms. However, the thresholds of such responses in archaea remain poorly understood.

View Article and Find Full Text PDF

Glycocins are a growing family of ribosomally synthesized and posttranslationally modified peptides (RiPPs) that are O- and/or S-glycosylated. Using a sequence similarity network of putative glycosyltransferases, the thg biosynthetic gene cluster was identified in the genome of Thermoanaerobacterium thermosaccharolyticum. Heterologous expression in Escherichia coli showed that the glycosyltransferase (ThgS) encoded in the biosynthetic gene cluster (BGC) adds N-acetyl-glucosamine (GlcNAc) to Ser and Cys residues of ThgA.

View Article and Find Full Text PDF

It is helpful for diagnostic purposes to improve our current knowledge of gut development and serum biochemistry in young piglets. This study investigated serum biochemistry, and gut site-specific patterns of short-chain fatty acids (SCFA) and expression of genes related to barrier function, innate immune response, antioxidative status and sensing of fatty and bile acids in suckling and newly weaned piglets. The experiment consisted of two replicate batches with 10 litters each.

View Article and Find Full Text PDF

Expression analysis of C-FOS and XRCC3 Thr241Met polymorphism in gastric cancer.

Cell Mol Biol (Noisy-le-grand)

September 2025

Department of Biology, College of Education for Pure Sciences, University of Kerbala, Kerbala, Iraq.

Gastric cancer is one of the causes of deaths related to cancer across the globe and both genetic and environmental factors are the most prominent. Causes of its pathogenesis. This paper researches the expression of the C-FOS gene.

View Article and Find Full Text PDF