Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Motivation: Gene set enrichment analysis has been shown to be effective in identifying relevant biological pathways underlying complex diseases. Existing approaches lack the ability to quantify the enrichment levels accurately, hence preventing the enrichment information to be further utilized in both upstream and downstream analyses. A modernized and rigorous approach for gene set enrichment analysis that emphasizes both hypothesis testing and enrichment estimation is much needed.

Results: We propose a novel computational method, Bayesian Analysis of Gene Set Enrichment (BAGSE), for gene set enrichment analysis. BAGSE is built on a Bayesian hierarchical model and fully accounts for the uncertainty embedded in the association evidence of individual genes. We adopt an empirical Bayes inference framework to fit the proposed hierarchical model by implementing an efficient EM algorithm. Through simulation studies, we illustrate that BAGSE yields accurate enrichment quantification while achieving similar power as the state-of-the-art methods. Further simulation studies show that BAGSE can effectively utilize the enrichment information to improve the power in gene discovery. Finally, we demonstrate the application of BAGSE in analyzing real data from a differential expression experiment and a transcriptome-wide association study. Our results indicate that the proposed statistical framework is effective in aiding the discovery of potentially causal pathways and gene networks.

Availability And Implementation: BAGSE is implemented using the C++ programing language and is freely available from https://github.com/xqwen/bagse/. Simulated and real data used in this paper are also available at the Github repository for reproducibility purposes.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7523653PMC
http://dx.doi.org/10.1093/bioinformatics/btz831DOI Listing

Publication Analysis

Top Keywords

gene set
20
set enrichment
20
enrichment analysis
16
hierarchical model
12
enrichment
10
bayesian hierarchical
8
approach gene
8
simulation studies
8
real data
8
bagse
7

Similar Publications

Whole blood (WB) transcriptomics offers a minimal-invasive method to assess patients' immune system. This study aimed to identify transcriptional patterns in WB associated with clinical outcomes in patients treated with immune checkpoint inhibitors (ICIs). We performed RNA-sequencing on pre-treatment WB samples from 145 patients with advanced cancer.

View Article and Find Full Text PDF

The size of microbial sequence databases continues to grow beyond the abilities of existing alignment tools. We introduce LexicMap, a nucleotide sequence alignment tool for efficiently querying moderate-length sequences (>250 bp) such as a gene, plasmid or long read against up to millions of prokaryotic genomes. We construct a small set of probe k-mers, which are selected to efficiently sample the entire database to be indexed such that every 250-bp window of each database genome contains multiple seed k-mers, each with a shared prefix with one of the probes.

View Article and Find Full Text PDF

Uncovering the evolving arms race between host immunity and HIV-1.

Trends Immunol

September 2025

Department of Life Science, University of Seoul, Seoul, Republic of Korea. Electronic address:

Despite an effective combination of antiretroviral therapy, HIV persists as a lifelong infection and global health threat. The human host equips restriction factors and interferon (IFN)-stimulated genes that target every step of the viral life cycle. However, HIV-1 has evolved a coordinated immune evasion strategy using a limited set of accessory proteins with distinct antagonistic functions.

View Article and Find Full Text PDF

EASY-edit: a toolbox for high-throughput single-step custom genetic editing in bacteria.

Nucleic Acids Res

September 2025

Expression génétique microbienne, UMR8261 CNRS, Université Paris Cité, Institut de Biologie Physico-Chimique, Paris 75005, France.

Targeted gene editing can be achieved using CRISPR-Cas9-assisted recombineering. However, high-efficiency editing requires careful optimization for each locus to be modified, which can be tedious and time-consuming. In this work, we developed a simple, fast and cheap method: Engineered Assembly of SYnthetic operons for targeted editing (EASY-edit) in Escherichia coli.

View Article and Find Full Text PDF

Prediction and Characterization of Genetically-Regulated Expression of Asthma Tissues from African-Ancestry Populations.

J Allergy Clin Immunol

September 2025

Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA; Department of Epidemiology, Colorado School of Public Health, Aurora, CO, USA. Electronic address:

Background: Genetic control of gene expression in asthma-related tissues is not well-characterized, particularly for African-ancestry populations, limiting advancement in our understanding of the increased prevalence and severity of asthma in those populations.

Objective: To create novel transcriptome prediction models for asthma tissues (nasal epithelium and CD4+ T cells) and apply them in transcriptome-wide association study to discover candidate asthma genes.

Methods: We developed and validated gene expression prediction databases for unstimulated CD4+ T cells and nasal epithelium using an elastic net framework.

View Article and Find Full Text PDF