98%
921
2 minutes
20
Motivation: Gene set enrichment analysis has been shown to be effective in identifying relevant biological pathways underlying complex diseases. Existing approaches lack the ability to quantify the enrichment levels accurately, hence preventing the enrichment information to be further utilized in both upstream and downstream analyses. A modernized and rigorous approach for gene set enrichment analysis that emphasizes both hypothesis testing and enrichment estimation is much needed.
Results: We propose a novel computational method, Bayesian Analysis of Gene Set Enrichment (BAGSE), for gene set enrichment analysis. BAGSE is built on a Bayesian hierarchical model and fully accounts for the uncertainty embedded in the association evidence of individual genes. We adopt an empirical Bayes inference framework to fit the proposed hierarchical model by implementing an efficient EM algorithm. Through simulation studies, we illustrate that BAGSE yields accurate enrichment quantification while achieving similar power as the state-of-the-art methods. Further simulation studies show that BAGSE can effectively utilize the enrichment information to improve the power in gene discovery. Finally, we demonstrate the application of BAGSE in analyzing real data from a differential expression experiment and a transcriptome-wide association study. Our results indicate that the proposed statistical framework is effective in aiding the discovery of potentially causal pathways and gene networks.
Availability And Implementation: BAGSE is implemented using the C++ programing language and is freely available from https://github.com/xqwen/bagse/. Simulated and real data used in this paper are also available at the Github repository for reproducibility purposes.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7523653 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btz831 | DOI Listing |
Cancer Immunol Immunother
September 2025
Department of Medical Oncology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.
Whole blood (WB) transcriptomics offers a minimal-invasive method to assess patients' immune system. This study aimed to identify transcriptional patterns in WB associated with clinical outcomes in patients treated with immune checkpoint inhibitors (ICIs). We performed RNA-sequencing on pre-treatment WB samples from 145 patients with advanced cancer.
View Article and Find Full Text PDFNat Biotechnol
September 2025
European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK.
The size of microbial sequence databases continues to grow beyond the abilities of existing alignment tools. We introduce LexicMap, a nucleotide sequence alignment tool for efficiently querying moderate-length sequences (>250 bp) such as a gene, plasmid or long read against up to millions of prokaryotic genomes. We construct a small set of probe k-mers, which are selected to efficiently sample the entire database to be indexed such that every 250-bp window of each database genome contains multiple seed k-mers, each with a shared prefix with one of the probes.
View Article and Find Full Text PDFTrends Immunol
September 2025
Department of Life Science, University of Seoul, Seoul, Republic of Korea. Electronic address:
Despite an effective combination of antiretroviral therapy, HIV persists as a lifelong infection and global health threat. The human host equips restriction factors and interferon (IFN)-stimulated genes that target every step of the viral life cycle. However, HIV-1 has evolved a coordinated immune evasion strategy using a limited set of accessory proteins with distinct antagonistic functions.
View Article and Find Full Text PDFNucleic Acids Res
September 2025
Expression génétique microbienne, UMR8261 CNRS, Université Paris Cité, Institut de Biologie Physico-Chimique, Paris 75005, France.
Targeted gene editing can be achieved using CRISPR-Cas9-assisted recombineering. However, high-efficiency editing requires careful optimization for each locus to be modified, which can be tedious and time-consuming. In this work, we developed a simple, fast and cheap method: Engineered Assembly of SYnthetic operons for targeted editing (EASY-edit) in Escherichia coli.
View Article and Find Full Text PDFJ Allergy Clin Immunol
September 2025
Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA; Department of Epidemiology, Colorado School of Public Health, Aurora, CO, USA. Electronic address:
Background: Genetic control of gene expression in asthma-related tissues is not well-characterized, particularly for African-ancestry populations, limiting advancement in our understanding of the increased prevalence and severity of asthma in those populations.
Objective: To create novel transcriptome prediction models for asthma tissues (nasal epithelium and CD4+ T cells) and apply them in transcriptome-wide association study to discover candidate asthma genes.
Methods: We developed and validated gene expression prediction databases for unstimulated CD4+ T cells and nasal epithelium using an elastic net framework.