Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Prevalent in biomedical applications (e.g., human phenotype research), multi-modal datasets can provide valuable insights into the underlying physiological mechanisms. However, current machine learning (ML) models designed to analyze these datasets often lack interpretability and identifiability guarantees, which are essential for biomedical research. Recent advances in causal representation learning have shown promise in identifying interpretable latent causal variables with formal theoretical guarantees. Unfortunately, most current work on multi-modal distributions either relies on restrictive parametric assumptions or yields only coarse identification results, limiting their applicability to biomedical research that favors a detailed understanding of the mechanisms. In this work, we aim to develop flexible identification conditions for multimodal data and principled methods to facilitate the understanding of biomedical datasets. Theoretically, we consider a nonparametric latent distribution (c.f., parametric assumptions in previous work) that allows for causal relationships across potentially different modalities. We establish identifiability guarantees for each latent component, extending the subspace identification results from previous work. Our key theoretical contribution is the structural sparsity of causal connections between modalities, which, as we will discuss, is natural for a large collection of biomedical systems. Empirically, we present a practical framework to instantiate our theoretical insights. We demonstrate the effectiveness of our approach through extensive experiments on both numerical and synthetic datasets. Results on a real-world human phenotype dataset are consistent with established biomedical research, validating our theoretical and methodological framework.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11952583PMC

Publication Analysis

Top Keywords

causal representation
8
representation learning
8
human phenotype
8
identifiability guarantees
8
parametric assumptions
8
previous work
8
biomedical
7
causal
5
learning multi-modal
4
multi-modal biomedical
4

Similar Publications

A representation of the cause-effect mechanism is needed to enable artificial intelligence to represent how the world works. Bayesian Networks (BNs) have proven to be an effective and versatile tool for this task. BNs require constructing a structure of dependencies among variables and learning the parameters that govern these relationships.

View Article and Find Full Text PDF

Phenotype-driven approaches identify disease-counteracting compounds by analysing the phenotypic signatures that distinguish diseased from healthy states. Here we introduce PDGrapher, a causally inspired graph neural network model that predicts combinatorial perturbagens (sets of therapeutic targets) capable of reversing disease phenotypes. Unlike methods that learn how perturbations alter phenotypes, PDGrapher solves the inverse problem and predicts the perturbagens needed to achieve a desired response by embedding disease cell states into networks, learning a latent representation of these states, and identifying optimal combinatorial perturbations.

View Article and Find Full Text PDF

Common neural choice signals reflect accumulated evidence, not confidence.

Cereb Cortex

August 2025

Brain and Cognition, KU Leuven, Tiensestraat 102, 3000 Leuven, Belgium.

Centro-parietal electroencephalogram signals (centro-parietal positivity and error positivity) correlate with the reported level of confidence. According to recent computational work these signals reflect evidence which feeds into the computation of confidence, not directly confidence. To test this prediction, we causally manipulated prior beliefs to selectively affect confidence, while leaving objective task performance unaffected.

View Article and Find Full Text PDF

Cancer, with its inherent heterogeneity, is commonly categorized into distinct subtypes based on unique traits, cellular origins, and molecular markers specific to each type. However, current studies primarily rely on complete multi-omics datasets for predicting cancer subtypes, often overlooking predictive performance in cases where some omics data may be missing and neglecting implicit relationships across multiple layers of omics data integration. This paper introduces Multi-Layer Matrix Factorization (MLMF), a novel approach for cancer subtyping that employs multi-omics data clustering.

View Article and Find Full Text PDF

Development of the SCI-BodyMap-Measuring Mental Body Representations in Adults With Spinal Cord Injury: Protocol for Item Generation, Reliability, and Validity Testing.

JMIR Res Protoc

September 2025

Division of Physical Therapy and Rehabilitation Science, Department of Family Medicine and Community Health, Medical School, University of Minnesota-Twin Cities, Minneapolis, MN, United States.

Background: Approximately 69% of Americans with spinal cord injury (SCI) have neuropathic pain. Research suggests that impairments in mental body representations (MBRs; ie, representations of the body in the brain) likely contribute to neuropathic pain. Clinical trials in adults with SCI, focused on restoring MBR, led to improvements in sensation and movement as well as neuropathic pain relief.

View Article and Find Full Text PDF