98%
921
2 minutes
20
Observational zero-inflated count data arise in a wide range of areas such as genomics. One of the common research questions is to identify causal relationships by learning the structure of a sparse directed acyclic graph (DAG). While structure learning of DAGs has been an active research area, existing methods do not adequately account for excessive zeros and therefore are not suitable for modeling zero-inflated count data. Moreover, it is often interesting to study differences in the causal networks for data collected from two experimental groups (control vs treatment). To explicitly account for zero-inflation and identify differential causal networks, we propose a novel Bayesian differential zero-inflated negative binomial DAG (DAG0) model. We prove that the causal relationships under the proposed DAG0 are fully identifiable from purely observational, cross-sectional data, using a general proof technique that is applicable beyond the proposed model. Bayesian inference based on parallel-tempered Markov chain Monte Carlo is developed to efficiently explore the multi-modal posterior landscape. We demonstrate the utility of the proposed DAG0 by comparing it with state-of-the-art alternative methods through extensive simulations. An application in a single-cell RNA-sequencing dataset generated under two experimental groups finds some interesting results that appear to be consistent with existing knowledge. A user-friendly R package that implements DAG0 is available at https://github.com/junsoukchoi/BayesDAG0.git.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12395422 | PMC |
http://dx.doi.org/10.1214/25-aoas2042 | DOI Listing |
Genetica
September 2025
Faculty of Fisheries and Aquaculture Sciences, Universiti Malaysia Terengganu, Kuala Nerus, Terengganu, Malaysia.
Population genetics plays a critical role in creating policies for managing fisheries, conservation, and development of aquaculture. The golden snapper, Lutjanus johnii (Bloch, 1792), is a highly commercial and aquaculture important snapper species. This study used mitochondrial markers D-loop (151 specimens) and Cytochrome b (Cyt-b, 120 specimens) from 10 populations, including populations from the east South China Sea, the west South China Sea and the Strait of Malacca to investigate the genetic diversity, population connectivity, and historical demography of L.
View Article and Find Full Text PDFMost of the United States (US) population resides in cities, where they are subjected to the urban heat island effect. In this study, we develop a method to estimate hourly air temperatures at resolution, improving exposure assessment of US population when compared to existing gridded products. We use an extensive network of personal weather stations to capture the intra-urban variability.
View Article and Find Full Text PDFComput Biol Chem
September 2025
Department of Biotechnology, Deenbandhu Chhotu Ram University of Science & Technology, Murthal, Haryana 131039, India. Electronic address:
Lentinula edodes (shiitake mushroom) is a widely cultivated edible and medicinal fungus, valued for its bioactive compounds. While East Asian strains have been well studied, Indian populations remain under-characterized. This study explores the genetic and functional diversity of five Indian-origin L.
View Article and Find Full Text PDFBioinformatics
September 2025
Department of Mathematical Sciences, The University of Texas at Dallas, TX United States.
Motivation: The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has reshaped genomic studies by enabling high-throughput gene expression profiling while preserving spatial and morphological context. Understanding gene functions and interactions in different spatial domains is crucial, as it can enhance our comprehension of biological mechanisms, such as cancer-immune interactions and cell differentiation in various regions. It is necessary to cluster tissue regions into distinct spatial domains and identify discriminating genes that elucidate the clustering result, referred to as spatial domain-specific discriminating genes (DGs).
View Article and Find Full Text PDFNat Genet
September 2025
Department of Statistics, University of California, Berkeley, CA, USA.
The Ancestral Recombination Graph (ARG), which describes the genealogical history of a sample of genomes, is a vital tool in population genomics and biomedical research. Recent advancements have substantially increased ARG reconstruction scalability, but they rely on approximations that can reduce accuracy, especially under model misspecification. Moreover, they reconstruct only a single ARG topology and cannot quantify the considerable uncertainty associated with ARG inferences.
View Article and Find Full Text PDF