98%
921
2 minutes
20
Summary: Gene set scoring (or enrichment) is a common dimension reduction task in bioinformatics that can be focused on the differences between groups or at the single sample level. Gene sets can represent biological functions, molecular pathways, cell identities, and more. Gene set scores are context dependent values that are useful for interpreting biological changes following experiments or perturbations. Single sample scoring produces a set of scores, one for each member of a group, which can be analyzed with statistical models that can include additional clinically important factors such as gender or age. However, the sparsity and technical noise of single-cell expression measures create difficulties for these methods, which were originally designed for bulk expression profiling (microarrays, RNAseq). This can be greatly remedied by first applying a smoothing transformation that shares gene measure information within transcriptomic neighborhoods. In this work, we use the nearest neighbor graph of cells for matrix smoothing to produce high quality gene set scores on a per-cell, per-group, level which is useful for visualization and statistical analysis.
Availability And Implementation: The gssnng software is available using the python package index (PyPI) and works with Scanpy AnnData objects. It can be installed using "pip install gssnng." More information and demo notebooks: see https://github.com/IlyaLab/gssnng.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10599965 | PMC |
http://dx.doi.org/10.1093/bioadv/vbad150 | DOI Listing |
Cancer Immunol Immunother
September 2025
Department of Medical Oncology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.
Whole blood (WB) transcriptomics offers a minimal-invasive method to assess patients' immune system. This study aimed to identify transcriptional patterns in WB associated with clinical outcomes in patients treated with immune checkpoint inhibitors (ICIs). We performed RNA-sequencing on pre-treatment WB samples from 145 patients with advanced cancer.
View Article and Find Full Text PDFNat Biotechnol
September 2025
European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK.
The size of microbial sequence databases continues to grow beyond the abilities of existing alignment tools. We introduce LexicMap, a nucleotide sequence alignment tool for efficiently querying moderate-length sequences (>250 bp) such as a gene, plasmid or long read against up to millions of prokaryotic genomes. We construct a small set of probe k-mers, which are selected to efficiently sample the entire database to be indexed such that every 250-bp window of each database genome contains multiple seed k-mers, each with a shared prefix with one of the probes.
View Article and Find Full Text PDFTrends Immunol
September 2025
Department of Life Science, University of Seoul, Seoul, Republic of Korea. Electronic address:
Despite an effective combination of antiretroviral therapy, HIV persists as a lifelong infection and global health threat. The human host equips restriction factors and interferon (IFN)-stimulated genes that target every step of the viral life cycle. However, HIV-1 has evolved a coordinated immune evasion strategy using a limited set of accessory proteins with distinct antagonistic functions.
View Article and Find Full Text PDFNucleic Acids Res
September 2025
Expression génétique microbienne, UMR8261 CNRS, Université Paris Cité, Institut de Biologie Physico-Chimique, Paris 75005, France.
Targeted gene editing can be achieved using CRISPR-Cas9-assisted recombineering. However, high-efficiency editing requires careful optimization for each locus to be modified, which can be tedious and time-consuming. In this work, we developed a simple, fast and cheap method: Engineered Assembly of SYnthetic operons for targeted editing (EASY-edit) in Escherichia coli.
View Article and Find Full Text PDFJ Allergy Clin Immunol
September 2025
Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA; Department of Epidemiology, Colorado School of Public Health, Aurora, CO, USA. Electronic address:
Background: Genetic control of gene expression in asthma-related tissues is not well-characterized, particularly for African-ancestry populations, limiting advancement in our understanding of the increased prevalence and severity of asthma in those populations.
Objective: To create novel transcriptome prediction models for asthma tissues (nasal epithelium and CD4+ T cells) and apply them in transcriptome-wide association study to discover candidate asthma genes.
Methods: We developed and validated gene expression prediction databases for unstimulated CD4+ T cells and nasal epithelium using an elastic net framework.