BPGA- an ultra-fast pan-genome analysis pipeline.

Sci Rep

Structural Biology &Bioinformatics Division, CSIR- Indian Institute of Chemical Biology, 4, Raja S. C. Mullick Road, Kolkata 700032, India.

Published: April 2016


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Recent advances in ultra-high-throughput sequencing technology and metagenomics have led to a paradigm shift in microbial genomics from few genome comparisons to large-scale pan-genome studies at different scales of phylogenetic resolution. Pan-genome studies provide a framework for estimating the genomic diversity of the dataset, determining core (conserved), accessory (dispensable) and unique (strain-specific) gene pool of a species, tracing horizontal gene-flux across strains and providing insight into species evolution. The existing pan genome software tools suffer from various limitations like limited datasets, difficult installation/requirements, inadequate functional features etc. Here we present an ultra-fast computational pipeline BPGA (Bacterial Pan Genome Analysis tool) with seven functional modules. In addition to the routine pan genome analyses, BPGA introduces a number of novel features for downstream analyses like core/pan/MLST (Multi Locus Sequence Typing) phylogeny, exclusive presence/absence of genes in specific strains, subset analysis, atypical G + C content analysis and KEGG &COG mapping of core, accessory and unique genes. Other notable features include minimum running prerequisites, freedom to select the gene clustering method, ultra-fast execution, user friendly command line interface and high-quality graphics outputs. The performance of BPGA has been evaluated using a dataset of complete genome sequences of 28 Streptococcus pyogenes strains.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4829868PMC
http://dx.doi.org/10.1038/srep24373DOI Listing

Publication Analysis

Top Keywords

pan genome
12
pan-genome studies
8
genome
5
bpga- ultra-fast
4
ultra-fast pan-genome
4
analysis
4
pan-genome analysis
4
analysis pipeline
4
pipeline advances
4
advances ultra-high-throughput
4

Similar Publications

Purpose: Hereditary Leiomyomatosis and Renal Cell Cancer (HLRCC) is a rare cancer susceptibility syndrome exclusively attributable to pathogenic variants in FH (HGNC:3700). This paper quantitatively weights the phenotypic context (PP4/PS4) of such very rare variants in FH.

Methods: We collated clinical diagnostic testing data on germline FH variants from 387 individuals with HLRCC and 1,780 individuals with renal cancer, and compared the frequency of 'very rare' variants in each phenotypic cohort against 562,295 population controls.

View Article and Find Full Text PDF

Limosilactobacillus fermentum CRL2085, isolated from feedlot cattle rations, displayed high efficiency as a probiotic when administered to animals. A comprehensive genomic analysis was performed to elucidate the genetic basis underlying its probiotic potential. Fifteen genomic islands and CRISPR-Cas elements were identified in its genome.

View Article and Find Full Text PDF

In-silico modeling of SHLP6: A novel mitochondrial peptide controlling neurodegeneration and cellular aging.

Comput Biol Med

September 2025

Institute of Biotechnology, Department of Medical Biotechnology, SIMATS Engineering, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, 602105, Tamil Nadu, India. Electronic address:

Small humanin-like peptide-6 (SHLP6), is derived from the mitochondrial genome. The 3D structure of SHLP6 was evaluated using PEPstr, with homology modeling predicting a Cyt-C structure with a DOPE score of -645.717 and a GA341 score of 0.

View Article and Find Full Text PDF

Background: Intracranial aneurysm (IA), known as pathological dilation of cerebral arteries,commonly occurring at bifurcating arteries,carries a high risk of severe morbidity and mortality if left untreated.Although the treatment and early diagnosis have significantly improved,the complex pathophysiological process of IA formation presents significant challenges in the development of targeted therapies.Efficient disease-modifying therapies for IA are not yet available.

View Article and Find Full Text PDF

Comparative molecular and physiological analyses of organisms from one taxonomic group grown under similar conditions offer a strategy to identify gene targets for trait improvement. While this strategy can also be performed in silico using genome-scale metabolic models for the compared organisms, we continue to lack solutions for the de novo generation of such models, particularly for eukaryotes. To facilitate model-driven identification of gene targets for growth improvement in green algae, here we present a semiautomated platform for de novo generation of genome-scale algal metabolic models.

View Article and Find Full Text PDF