Causal Representation Learning from Multi-modal Biomedical Observations.

Yuewen Sun , Lingjing Kong , Guangyi Chen , Loka Li , Gongxu Luo , Zijian Li , Yixuan Zhang , Yujia Zheng , Mengyue Yang , Petar Stojanov , Eran Segal , Eric P Xing , Kun Zhang

ArXiv

Mohamed bin Zayed University of Artificial Intelligence.

Published: March 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Prevalent in biomedical applications (e.g., human phenotype research), multi-modal datasets can provide valuable insights into the underlying physiological mechanisms. However, current machine learning (ML) models designed to analyze these datasets often lack interpretability and identifiability guarantees, which are essential for biomedical research. Recent advances in causal representation learning have shown promise in identifying interpretable latent causal variables with formal theoretical guarantees. Unfortunately, most current work on multi-modal distributions either relies on restrictive parametric assumptions or yields only coarse identification results, limiting their applicability to biomedical research that favors a detailed understanding of the mechanisms. In this work, we aim to develop flexible identification conditions for multimodal data and principled methods to facilitate the understanding of biomedical datasets. Theoretically, we consider a nonparametric latent distribution (c.f., parametric assumptions in previous work) that allows for causal relationships across potentially different modalities. We establish identifiability guarantees for each latent component, extending the subspace identification results from previous work. Our key theoretical contribution is the structural sparsity of causal connections between modalities, which, as we will discuss, is natural for a large collection of biomedical systems. Empirically, we present a practical framework to instantiate our theoretical insights. We demonstrate the effectiveness of our approach through extensive experiments on both numerical and synthetic datasets. Results on a real-world human phenotype dataset are consistent with established biomedical research, validating our theoretical and methodological framework.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11952583	PMC

Publication Analysis

Top Keywords

causal representation

representation learning

human phenotype

identifiability guarantees

parametric assumptions

previous work

biomedical

causal

learning multi-modal

multi-modal biomedical

Similar Publications

A guide to bayesian networks software for structure and parameter learning, with a focus on causal discovery tools.

Front Syst Biol

August 2025

Minutia.AI Pte. Ltd., Singapore, Singapore.

Francesco Canonaco , Joverlyn Gaudillo , Nicole Astrologo , Fabio Stella , Enzo Acerbi

A representation of the cause-effect mechanism is needed to enable artificial intelligence to represent how the world works. Bayesian Networks (BNs) have proven to be an effective and versatile tool for this task. BNs require constructing a structure of dependencies among variables and learning the parameters that govern these relationships.

View Article and Find Full Text PDF

Similar Publications

Combinatorial prediction of therapeutic perturbations using causally inspired neural networks.

Nat Biomed Eng

September 2025

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Guadalupe Gonzalez , Xiang Lin , Isuru Herath , Kirill Veselkov , Michael Bronstein

Phenotype-driven approaches identify disease-counteracting compounds by analysing the phenotypic signatures that distinguish diseased from healthy states. Here we introduce PDGrapher, a causally inspired graph neural network model that predicts combinatorial perturbagens (sets of therapeutic targets) capable of reversing disease phenotypes. Unlike methods that learn how perturbations alter phenotypes, PDGrapher solves the inverse problem and predicts the perturbagens needed to achieve a desired response by embedding disease cell states into networks, learning a latent representation of these states, and identifying optimal combinatorial perturbations.

View Article and Find Full Text PDF

Similar Publications

Common neural choice signals reflect accumulated evidence, not confidence.

Cereb Cortex

August 2025

Brain and Cognition, KU Leuven, Tiensestraat 102, 3000 Leuven, Belgium.

Kobe Desender , Andi Smet , Deniz Erdil , Esna Mualla Gunay , Yvonne F Visser

Centro-parietal electroencephalogram signals (centro-parietal positivity and error positivity) correlate with the reported level of confidence. According to recent computational work these signals reflect evidence which feeds into the computation of confidence, not directly confidence. To test this prediction, we causally manipulated prior beliefs to selectively affect confidence, while leaving objective task performance unaffected.

View Article and Find Full Text PDF

Similar Publications

Multi-layer matrix factorization for cancer subtyping using full and partial multi-omics dataset.

Brief Bioinform

August 2025

School of Computer Science, Xi'an Polytechnic University, 710048, Xi'an, China.

Yingxuan Ren , Fengtao Ren , Bo Yang

Cancer, with its inherent heterogeneity, is commonly categorized into distinct subtypes based on unique traits, cellular origins, and molecular markers specific to each type. However, current studies primarily rely on complete multi-omics datasets for predicting cancer subtypes, often overlooking predictive performance in cases where some omics data may be missing and neglecting implicit relationships across multiple layers of omics data integration. This paper introduces Multi-Layer Matrix Factorization (MLMF), a novel approach for cancer subtyping that employs multi-omics data clustering.

View Article and Find Full Text PDF

Similar Publications

Development of the SCI-BodyMap-Measuring Mental Body Representations in Adults With Spinal Cord Injury: Protocol for Item Generation, Reliability, and Validity Testing.

JMIR Res Protoc

September 2025

Division of Physical Therapy and Rehabilitation Science, Department of Family Medicine and Community Health, Medical School, University of Minnesota-Twin Cities, Minneapolis, MN, United States.

Sydney Carpentier , Sara Bottale , Nicole Cenci , Mauro Cracchiolo , Daniele De Patre

Background: Approximately 69% of Americans with spinal cord injury (SCI) have neuropathic pain. Research suggests that impairments in mental body representations (MBRs; ie, representations of the body in the brain) likely contribute to neuropathic pain. Clinical trials in adults with SCI, focused on restoring MBR, led to improvements in sensation and movement as well as neuropathic pain relief.

View Article and Find Full Text PDF

Similar Publications