PGIP: a web server for the rapid taxonomic identification of parasite genomes.

Parasit Vectors

National Health Commission Key Laboratory of Parasitic Disease Control and Prevention, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Provincial Medical Key Laboratory, Jiangsu Institute of Parasitic Diseases, Wuxi, 214064, Jiangsu, China.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Parasitic diseases remain a global health challenge, and traditional methods in their diagnosis face limitations in sensitivity and scalability. Genome-based sequencing technologies have improved and are increasingly employed for the identification of parasites; however, their clinical adoption remains hindered by the complexity of bioinformatics analysis, reliance on incomplete reference databases, and accessibility barriers for nonspecialists. Overcoming these challenges necessitates the development of standardized analytical workflows and high-quality genomic resources specifically tailored for parasite identification.

Methods: We developed a user-friendly web server named the Parasite Genome Identification Platform (PGIP). The reference database was sourced from the National Center for Biotechnology Information (NCBI), WormBase, European Nucleotide Archive (ENA), and VEuPathDB, rigorously filtered for quality, and deduplicated using Cluster Database at High Identity with Tolerance (CD-HIT) to ensure accuracy and nonredundancy. To streamline analysis, we integrated a standardized identification pipeline built on Nextflow, which encompasses host DNA depletion, quality control, parasite species identification via both reads mapping and assembly-based approaches, and automated report generation for comprehensive diagnostic insights.

Results: PGIP integrates a curated database of 280 parasite genomes; which is rigorously filtered for quality and taxonomic accuracy. Validation across diverse datasets demonstrated the precise species-level resolution of PGIP, and its compatibility with clinical samples. The platform features an intuitive graphic interface; and one-click analysis significantly reduces reliance on bioinformatics expertise, thus enabling rapid diagnosis.

Conclusions: PGIP offers an accurate, efficient, and a user-friendly web server designed to simplify and accelerate the taxonomic identification of parasite genomes using data from metagenomic next-generation sequencing. Its automated framework reduces the need for specialized expertise, enabling rapid application in clinical and public health settings.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12392538PMC
http://dx.doi.org/10.1186/s13071-025-07007-3DOI Listing

Publication Analysis

Top Keywords

web server
12
parasite genomes
12
taxonomic identification
8
identification parasite
8
user-friendly web
8
rigorously filtered
8
filtered quality
8
expertise enabling
8
enabling rapid
8
identification
6

Similar Publications

Multimodal Deep Learning for Generating Potential Anti-Dengue Peptides.

ACS Omega

September 2025

Division of Pharmaceutical Chemistry, Faculty of Pharmaceutical Sciences, Khon Kaen University, Khon Kaen 40002, Thailand.

Dengue virus remains a significant global health threat, imposing a substantial disease burden on nearly half of the world's population. The urgent need for effective antiviral therapeutics, including therapeutic peptides targeting the Dengue virus, is critical in the current healthcare landscape. However, the availability of anti-Dengue peptides (ADPs) data remains limited in existing data sets, posing a challenge for computational modeling and discovery.

View Article and Find Full Text PDF

Background: Sézary syndrome (SS) is an aggressive and leukemic variant of Cutaneous T-cell Lymphoma (CTCL) with an incidence of 1 case per million people per year. It is characterized by a complex and heterogeneous profile of genetic alteration ns that has so far precluded the development of a specific and definitive therapeutic intervention.

Methods: Deep-RNA-sequencing (RNA-seq) data were used to analyze the single nucleotide variants (SNVs) carried by 128 putative CTCL-driver genes, previously identified as mutated in genomic studies, in longitudinal SS samples collected from 17 patients subjected to extracorporeal photopheresis (ECP) with Interferon-α.

View Article and Find Full Text PDF

PhyCysID: Plant Cystatin Protein Prediction by an Artificial Intelligence Approach.

J Chem Inf Model

September 2025

Programa de Pós-Graduação em Genética e Biologia Molecular, Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre CEP 91501-970, Brazil.

Phytocystatins are proteinaceous inhibitors found in plants that competitively target various classes of cysteine proteinases, including papain-like enzymes, cathepsins, and legumains. Based on structural characteristics and gene organization, phytocystatins can be classified into four subtypes: intronless (I1 and I2), intron-containing (IwI), and multidomain cystatins containing more than one inhibitory region (II). This work presents PhyCysID, a dedicated web server designed for the rapid classification of phytocystatin subtypes.

View Article and Find Full Text PDF

The worldwide occurrence of neurodegenerative diseases in Alzheimer's and Parkinson's patients is increasing owing to multiple disease mechanisms, including oxidative stress, neuroinflammation, mitochondrial dysfunction, and excitotoxicity. (clove) flavonoid metabolites show strong neuroprotective potential because they act as antioxidants, reduce inflammation and lipid peroxidation, and prevent apoptosis. The key flavonoid metabolites, quercetin, kaempferol, kumatakenin, myricetin, ombuin 3-O-β-d-glucopyranoside, and tamarixetin 3-O-β-d-glucopyranoside, bind to various brain receptors implicated in disease propagation pathways and induce changes that support neuronal survival and decrease cognitive impairment.

View Article and Find Full Text PDF

Background: To guarantee high-quality patient scans, thorough quality assurance (QA) of SPECT or gamma cameras, including performance, review, and documentation, is essential.

Purpose: We developed a novel Nuclear Medicine Quality Assurance server (NMQA) with an AI deep learning (AIDL) optical character recognition (OCR) system to automate QA data retrieval and review from SPECT and gamma cameras. The system extracts and compares daily and weekly QA data against specifications.

View Article and Find Full Text PDF