98%
921
2 minutes
20
Read alignment is an important step in RNA-seq analysis as the result of alignment forms the basis for downstream analyses. However, recent studies have shown that published alignment tools have variable mapping sensitivity and do not necessarily align all the reads which should have been aligned, a problem we termed as the false-negative non-alignment problem. Here we present Scavenger, a python-based bioinformatics pipeline for recovering unaligned reads using a novel mechanism in which a putative alignment location is discovered based on sequence similarity between aligned and unaligned reads. We showed that Scavenger could recover unaligned reads in a range of simulated and real RNA-seq datasets, including single-cell RNA-seq data. We found that recovered reads tend to contain more genetic variants with respect to the reference genome compared to previously aligned reads, indicating that divergence between personal and reference genomes plays a role in the false-negative non-alignment problem. Even when the number of recovered reads is relatively small compared to the total number of reads, the addition of these recovered reads can impact downstream analyses, especially in terms of estimating the expression and differential expression of lowly expressed genes, such as pseudogenes.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7459848 | PMC |
http://dx.doi.org/10.12688/f1000research.19426.2 | DOI Listing |
Front Microbiol
January 2025
DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece.
Background: The presence of microbes within healthy human internal organs still remains under question. Our study endeavors to discern microbial signatures within normal human internal tissues using data from the Genotype-Tissue Expression (GTEx) consortium. Machine learning (ML) models were developed to classify each tissue type based solely on microbial profiles, with the identification of tissue-specific microbial signatures suggesting the presence of distinct microbial communities inside tissues.
View Article and Find Full Text PDFSci Adv
November 2024
Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA.
Viruses elicit long-term adaptive responses in the tissues they infect. Understanding viral adaptions in humans is difficult in organs such as the heart, where primary infected material is not routinely collected. In search of asymptomatic infections with accompanying host adaptions, we mined for cardio-pathogenic viruses in the unaligned reads of nearly 1000 human hearts profiled by RNA sequencing.
View Article and Find Full Text PDFWhile several well-established quality control (QC) tools are available for short reads sequencing data, there is a general paucity of computational tools that provide long read metrics in a fast and comprehensive manner across all major sequencing platforms (such as PacBio, Oxford Nanopore, Illumina Complete Long Read) and data formats (such as ONT POD5, FAST5, basecall summary files and PacBio unaligned BAM). Additionally, none of the current tools provide support for summarizing Oxford Nanopore basecall signal or comprehensive base modification (methylation) information from genomic data. Furthermore, nowadays a single PromethION flowcell on the Oxford Nanopore platform can generate terabytes of signal data, which cannot be handled by existing tools designed for small-scale flowcells.
View Article and Find Full Text PDF3 Biotech
June 2024
Advanced Centre for Plant Virology, Division of Plant Pathology, ICAR-Indian Agricultural Research Institute, New Delhi, 110012 India.
Unlabelled: In the current study, high-throughput sequencing (HTS) was used to identify viruses associated with the Kinnow mandarin () plants exhibiting yellow vein clearing, mottling, and chlorosis symptoms at experimental farm of ICAR-Indian Agricultural Research Institute, New Delhi, India. During November 2022, leaf samples of symptomatic and asymptomatic Kinnow mandarin trees were collected, subjected to HTS and one of the representative symptomatic samples was subjected to leaf-dip electron microscopy (EM). In the EM results, flexuous virus particles typical of mandarivirus were observed.
View Article and Find Full Text PDFbioRxiv
March 2024
Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA.
Viruses elicit long-term adaptive responses in the tissues they infect. Understanding viral adaptions in humans is difficult in organs such as the heart, where primary infected material is not routinely collected. In search of asymptomatic infections with accompanying host adaptions, we mined for cardio-pathogenic viruses in the unaligned reads of nearly one thousand human hearts profiled by RNA sequencing.
View Article and Find Full Text PDF