Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Read alignment is an important step in RNA-seq analysis as the result of alignment forms the basis for downstream analyses. However, recent studies have shown that published alignment tools have variable mapping sensitivity and do not necessarily align all the reads which should have been aligned, a problem we termed as the false-negative non-alignment problem. Here we present Scavenger, a python-based bioinformatics pipeline for recovering unaligned reads using a novel mechanism in which a putative alignment location is discovered based on sequence similarity between aligned and unaligned reads. We showed that Scavenger could recover unaligned reads in a range of simulated and real RNA-seq datasets, including single-cell RNA-seq data. We found that recovered reads tend to contain more genetic variants with respect to the reference genome compared to previously aligned reads, indicating that divergence between personal and reference genomes plays a role in the false-negative non-alignment problem. Even when the number of recovered reads is relatively small compared to the total number of reads, the addition of these recovered reads can impact downstream analyses, especially in terms of estimating the expression and differential expression of lowly expressed genes, such as pseudogenes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459848PMC
http://dx.doi.org/10.12688/f1000research.19426.2DOI Listing

Publication Analysis

Top Keywords

unaligned reads
16
recovered reads
12
reads
11
similarity aligned
8
aligned reads
8
downstream analyses
8
false-negative non-alignment
8
non-alignment problem
8
scavenger pipeline
4
pipeline recovery
4

Similar Publications

Background: The presence of microbes within healthy human internal organs still remains under question. Our study endeavors to discern microbial signatures within normal human internal tissues using data from the Genotype-Tissue Expression (GTEx) consortium. Machine learning (ML) models were developed to classify each tissue type based solely on microbial profiles, with the identification of tissue-specific microbial signatures suggesting the presence of distinct microbial communities inside tissues.

View Article and Find Full Text PDF

Three modes of viral adaption by the heart.

Sci Adv

November 2024

Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA.

Viruses elicit long-term adaptive responses in the tissues they infect. Understanding viral adaptions in humans is difficult in organs such as the heart, where primary infected material is not routinely collected. In search of asymptomatic infections with accompanying host adaptions, we mined for cardio-pathogenic viruses in the unaligned reads of nearly 1000 human hearts profiled by RNA sequencing.

View Article and Find Full Text PDF

While several well-established quality control (QC) tools are available for short reads sequencing data, there is a general paucity of computational tools that provide long read metrics in a fast and comprehensive manner across all major sequencing platforms (such as PacBio, Oxford Nanopore, Illumina Complete Long Read) and data formats (such as ONT POD5, FAST5, basecall summary files and PacBio unaligned BAM). Additionally, none of the current tools provide support for summarizing Oxford Nanopore basecall signal or comprehensive base modification (methylation) information from genomic data. Furthermore, nowadays a single PromethION flowcell on the Oxford Nanopore platform can generate terabytes of signal data, which cannot be handled by existing tools designed for small-scale flowcells.

View Article and Find Full Text PDF

Unlabelled: In the current study, high-throughput sequencing (HTS) was used to identify viruses associated with the Kinnow mandarin () plants exhibiting yellow vein clearing, mottling, and chlorosis symptoms at experimental farm of ICAR-Indian Agricultural Research Institute, New Delhi, India. During November 2022, leaf samples of symptomatic and asymptomatic Kinnow mandarin trees were collected, subjected to HTS and one of the representative symptomatic samples was subjected to leaf-dip electron microscopy (EM). In the EM results, flexuous virus particles typical of mandarivirus were observed.

View Article and Find Full Text PDF

Viruses elicit long-term adaptive responses in the tissues they infect. Understanding viral adaptions in humans is difficult in organs such as the heart, where primary infected material is not routinely collected. In search of asymptomatic infections with accompanying host adaptions, we mined for cardio-pathogenic viruses in the unaligned reads of nearly one thousand human hearts profiled by RNA sequencing.

View Article and Find Full Text PDF