A fast and globally optimal solution for RNA-seq quantification.

Brief Bioinform

School of Life Sciences, Southern University of Science and Technology, 1088 Xueyuan Blvd, Shenzhen 518055, Guangdong, China.

Published: September 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Alignment-based RNA-seq quantification methods typically involve a time-consuming alignment process prior to estimating transcript abundances. In contrast, alignment-free RNA-seq quantification methods bypass this step, resulting in significant speed improvements. Existing alignment-free methods rely on the Expectation-Maximization (EM) algorithm for estimating transcript abundances. However, EM algorithms only guarantee locally optimal solutions, leaving room for further accuracy improvement by finding a globally optimal solution. In this study, we present TQSLE, the first alignment-free RNA-seq quantification method that provides a globally optimal solution for transcript abundances estimation. TQSLE adopts a two-step approach: first, it constructs a k-mer frequency matrix A for the reference transcriptome and a k-mer frequency vector b for the RNA-seq reads; then, it directly estimates transcript abundances by solving the linear equation ATAx = ATb. We evaluated the performance of TQSLE using simulated and real RNA-seq data sets and observed that, despite comparable speed to other alignment-free methods, TQSLE outperforms them in terms of accuracy. TQSLE is freely available at https://github.com/yhg926/TQSLE.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbad298DOI Listing

Publication Analysis

Top Keywords

rna-seq quantification
16
transcript abundances
16
globally optimal
12
optimal solution
12
quantification methods
8
estimating transcript
8
alignment-free rna-seq
8
alignment-free methods
8
k-mer frequency
8
rna-seq
6

Similar Publications

Omics Insights Into the Effects of Highbush Blueberry and Cranberry Crop Agroecosystems on Honey Bee Health and Physiology.

Proteomics

September 2025

Department of Biochemistry and Molecular Biology, Michael Smith Laboratories, Life Sciences Institute, University of British Columbia, Vancouver, British Columbia, Canada.

Honey bees (Apis mellifera) are vital pollinators in fruit-producing agroecosystems like highbush blueberry (HBB) and cranberry (CRA). However, their health is threatened by multiple interacting stressors, including pesticides, pathogens, and nutritional changes. We tested the hypothesis that distinct agricultural ecosystems-with different combinations of agrochemical exposure, pathogen loads, and floral resources-elicit ecosystem-specific, tissue-level molecular responses in honey bees.

View Article and Find Full Text PDF

, a marine pathogen, employs biofilm formation to enhance environmental persistence and transmission. Biofilm development is intricately regulated by cyclic di-GMP (c-di-GMP), whose levels are controlled by diguanylate cyclases (DGCs) and phosphodiesterases (PDEs). This study elucidates the coordinated regulatory roles of the LysR-type transcriptional regulator AcsS and the PDE TpdA in biofilm formation.

View Article and Find Full Text PDF

Scnanoseq: an nf-core pipeline for oxford nanopore single-cell RNA-sequencing.

Bioinformatics

September 2025

Institutional Research Core Program-Biological Data Science Core, University of Alabama at Birmingham, Birmingham, AL United States.

Motivation: Recent advancements in long-read single-cell RNA sequencing (scRNA-seq) have facilitated the quantification of full-length transcripts and isoforms at the single-cell level. Historically, long-read data would need to be complemented with short-read single-cell data in order to overcome the higher sequencing errors to correctly identify cellular barcodes and unique molecular identifiers. Improvements in Oxford Nanopore sequencing, and development of novel computational methods have removed this requirement.

View Article and Find Full Text PDF

Objective: To identify genes related to eravacycline resistance in () and to provide a theoretical basis for the study of eravacycline resistance mechanisms in and the development of new antibiotics.

Methods: The study employed an integrated omics approach: (1) antimicrobial susceptibility profiling via broth microdilution to determine baseline MICs for eravacycline and comparator drugs; (2) Induction of resistance in clinical isolates (WJ_4, WJ_14, WJ_18) with low eravacycline MICs through serial passage in escalating drug concentrations; (3) Transcriptome sequencing (RNA-seq) and whole-genome sequencing (WGS) of -induced resistant strains (WJ_4a, WJ_14a, WJ_18a) and a clinical high-MIC isolate (WJ_97); (4) Bioinformatics analyses, including differential gene expression screening (with |log2(fold change)| > 2 and FDR-adjusted p < 0.05), SNP detection via GATK, and copy number variation (CNV) quantification using CCNE-acc to identify and compare resistance-related genetic alterations.

View Article and Find Full Text PDF

eQTL mapping and TWAS are widely used to contextualize GWAS, yet the impact of RNA-seq processing choices remains unexplored. We find that RNA-seq quantification method and transcriptomic reference substantially affect eQTL detection and gene expression prediction with significant downstream impact on colocalization and TWAS results. Our findings demonstrate that seemingly minor methodological decisions substantially affect these common analyses, highlighting the need for standardized practices to ensure reproducible genetic association studies.

View Article and Find Full Text PDF