Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Observational zero-inflated count data arise in a wide range of areas such as genomics. One of the common research questions is to identify causal relationships by learning the structure of a sparse directed acyclic graph (DAG). While structure learning of DAGs has been an active research area, existing methods do not adequately account for excessive zeros and therefore are not suitable for modeling zero-inflated count data. Moreover, it is often interesting to study differences in the causal networks for data collected from two experimental groups (control vs treatment). To explicitly account for zero-inflation and identify differential causal networks, we propose a novel Bayesian differential zero-inflated negative binomial DAG (DAG0) model. We prove that the causal relationships under the proposed DAG0 are fully identifiable from purely observational, cross-sectional data, using a general proof technique that is applicable beyond the proposed model. Bayesian inference based on parallel-tempered Markov chain Monte Carlo is developed to efficiently explore the multi-modal posterior landscape. We demonstrate the utility of the proposed DAG0 by comparing it with state-of-the-art alternative methods through extensive simulations. An application in a single-cell RNA-sequencing dataset generated under two experimental groups finds some interesting results that appear to be consistent with existing knowledge. A user-friendly R package that implements DAG0 is available at https://github.com/junsoukchoi/BayesDAG0.git.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12395422PMC
http://dx.doi.org/10.1214/25-aoas2042DOI Listing

Publication Analysis

Top Keywords

bayesian differential
8
differential causal
8
directed acyclic
8
observational zero-inflated
8
zero-inflated count
8
count data
8
causal relationships
8
causal networks
8
experimental groups
8
proposed dag0
8

Similar Publications

Population genetics plays a critical role in creating policies for managing fisheries, conservation, and development of aquaculture. The golden snapper, Lutjanus johnii (Bloch, 1792), is a highly commercial and aquaculture important snapper species. This study used mitochondrial markers D-loop (151 specimens) and Cytochrome b (Cyt-b, 120 specimens) from 10 populations, including populations from the east South China Sea, the west South China Sea and the Strait of Malacca to investigate the genetic diversity, population connectivity, and historical demography of L.

View Article and Find Full Text PDF

Most of the United States (US) population resides in cities, where they are subjected to the urban heat island effect. In this study, we develop a method to estimate hourly air temperatures at resolution, improving exposure assessment of US population when compared to existing gridded products. We use an extensive network of personal weather stations to capture the intra-urban variability.

View Article and Find Full Text PDF

Lentinula edodes (shiitake mushroom) is a widely cultivated edible and medicinal fungus, valued for its bioactive compounds. While East Asian strains have been well studied, Indian populations remain under-characterized. This study explores the genetic and functional diversity of five Indian-origin L.

View Article and Find Full Text PDF

Motivation: The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has reshaped genomic studies by enabling high-throughput gene expression profiling while preserving spatial and morphological context. Understanding gene functions and interactions in different spatial domains is crucial, as it can enhance our comprehension of biological mechanisms, such as cancer-immune interactions and cell differentiation in various regions. It is necessary to cluster tissue regions into distinct spatial domains and identify discriminating genes that elucidate the clustering result, referred to as spatial domain-specific discriminating genes (DGs).

View Article and Find Full Text PDF

The Ancestral Recombination Graph (ARG), which describes the genealogical history of a sample of genomes, is a vital tool in population genomics and biomedical research. Recent advancements have substantially increased ARG reconstruction scalability, but they rely on approximations that can reduce accuracy, especially under model misspecification. Moreover, they reconstruct only a single ARG topology and cannot quantify the considerable uncertainty associated with ARG inferences.

View Article and Find Full Text PDF