98%
921
2 minutes
20
We present ViPRA-Haplo, a de novo strain-specific assembly workflow for reconstructing viral haplotypes in a viral population from paired-end next generation sequencing (NGS) data. The proposed Viral Path Reconstruction Algorithm (ViPRA) generates a subset of paths from a De Bruijn graph of reads using the pairing information of reads. The paths generated by ViPRA are an over-estimation of the true contigs. We propose two refinement methods to obtain an optimal set of contigs representing viral haplotypes. The first method clusters paths reconstructed by ViPRA using VSEARCH Deorowicz et al. 2015 based on sequence similarity, while the second method, MLEHaplo, generates a maximum likelihood estimate of viral populations. We evaluated our pipeline on both simulated and real viral quasispecies data from HIV (and real data from SARS-COV-2). Experimental results show that ViPRA-Haplo, although still an overestimation in the number of true contigs, outperforms the existing tool, PEHaplo, providing up to 9% better genome coverage on HIV real data. In addition, ViPRA-Haplo also retains higher diversity of the viral population as demonstrated by the presence of a higher percentage of contigs less than 1000 base pairs (bps), which also contain k-mers with counts less than 100 (representing rarer sequences), which are absent in PEHaplo. For SARS-CoV-2 sequencing data, ViPRA-Haplo reconstructs contigs that cover more than 90% of the reference genome and were able to validate known SARS-CoV-2 strains in the sequencing data.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCBB.2024.3374595 | DOI Listing |
Head Neck Pathol
September 2025
Department of Laboratory Medicine and Pathology, Mayo Clinic, 4500 San Pablo Road, Jacksonville, FL, 32224, USA.
Myoepithelial carcinoma (MECA) is a malignant neoplasm composed exclusively of myoepithelial cells and accounts for less than 1% of all salivary gland tumors. Its diagnosis is often challenging due to histologic overlaps with benign lesions and its variable morphologic presentation. Although molecular profiling has emerged as a valuable tool in salivary gland tumor classification, the genetic landscape of MECA remains incompletely defined.
View Article and Find Full Text PDFFunct Integr Genomics
September 2025
Zhengzhou Research Base, State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Zhengzhou University/Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Zhengzhou, China.
In this study, a comprehensive genome-wide identification and analysis of the aldo-keto reductase (AKR) gene family was performed to explore the role of Gossypium hirsutumAKR40 under salt stress in cotton. A total of 249 AKR genes were identified with uneven distribution on the chromosomes in four cotton species. The diversity and evolutionary relationship of the cotton AKR gene family was identified using physio-chemical analysis, phylogenetic tree construction, conserved motif analysis, chromosomal localization, prediction of cis-acting elements, and calculation of evolutionary selection pressure under 300 mM NaCl stress.
View Article and Find Full Text PDFPhotosynth Res
September 2025
College of Life Sciences, Shanghai Normal University, Shanghai, 200235, China.
Euglena sanguinea (Ehrenberg 1831) is one of the earliest reported species within the genus Euglena. Its prolific proliferation leading to red algal bloom has garnered significant scientific attention due to its ecological and environmental impacts. Despite this, research on E.
View Article and Find Full Text PDFJ Ultrasound Med
September 2025
Department of Ultrasound, Donghai Hospital Affiliated to Kangda College of Nanjing Medical University, Lianyungang, China.
Objective: The aim of this study is to evaluate the prognostic performance of a nomogram integrating clinical parameters with deep learning radiomics (DLRN) features derived from ultrasound and multi-sequence magnetic resonance imaging (MRI) for predicting survival, recurrence, and metastasis in patients diagnosed with triple-negative breast cancer (TNBC) undergoing neoadjuvant chemotherapy (NAC).
Methods: This retrospective, multicenter study included 103 patients with histopathologically confirmed TNBC across four institutions. The training group comprised 72 cases from the First People's Hospital of Lianyungang, while the validation group included 31 cases from three external centers.
J Virol
September 2025
Division of Pediatric Infectious Disease, Department of Pediatrics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA.
Rift Valley fever virus (RVFV) causes mild to severe disease in livestock and humans. It was first identified in 1931 during an epizootic in Kenya and has spread across Africa and into the Middle East. Hematopoietic cells are one of the major targets of RVFV ; however, their contribution to RVFV pathogenesis remains poorly understood.
View Article and Find Full Text PDF