CloudMap: a cloud-based pipeline for analysis of mutant genome sequences.

Genetics

Department of Biochemistry and Molecular Biophysics, Howard Hughes Medical Institute, Columbia University Medical Center, New York, NY 10032, USA.

Published: December 2012


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Whole genome sequencing (WGS) allows researchers to pinpoint genetic differences between individuals and significantly shortcuts the costly and time-consuming part of forward genetic analysis in model organism systems. Currently, the most effort-intensive part of WGS is the bioinformatic analysis of the relatively short reads generated by second generation sequencing platforms. We describe here a novel, easily accessible and cloud-based pipeline, called CloudMap, which greatly simplifies the analysis of mutant genome sequences. Available on the Galaxy web platform, CloudMap requires no software installation when run on the cloud, but it can also be run locally or via Amazon's Elastic Compute Cloud (EC2) service. CloudMap uses a series of predefined workflows to pinpoint sequence variations in animal genomes, such as those of premutagenized and mutagenized Caenorhabditis elegans strains. In combination with a variant-based mapping procedure, CloudMap allows users to sharply define genetic map intervals graphically and to retrieve very short lists of candidate variants with a few simple clicks. Automated workflows and extensive video user guides are available to detail the individual analysis steps performed (http://usegalaxy.org/cloudmap). We demonstrate the utility of CloudMap for WGS analysis of C. elegans and Arabidopsis genomes and describe how other organisms (e.g., Zebrafish and Drosophila) can easily be accommodated by this software platform. To accommodate rapid analysis of many mutants from large-scale genetic screens, CloudMap contains an in silico complementation testing tool that allows users to rapidly identify instances where multiple alleles of the same gene are present in the mutant collection. Lastly, we describe the application of a novel mapping/WGS method ("Variant Discovery Mapping") that does not rely on a defined polymorphic mapping strain, and we integrate the application of this method into CloudMap. CloudMap tools and documentation are continually updated at http://usegalaxy.org/cloudmap.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3512137PMC
http://dx.doi.org/10.1534/genetics.112.144204DOI Listing

Publication Analysis

Top Keywords

cloudmap
9
cloud-based pipeline
8
analysis mutant
8
mutant genome
8
genome sequences
8
allows users
8
analysis
7
cloudmap cloud-based
4
pipeline analysis
4
sequences genome
4

Similar Publications

PHYTOCHROME C is an essential light receptor for photoperiodic flowering in the temperate grass, Brachypodium distachyon.

Genetics

September 2014

Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706 United States Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin, Madison, Wisconsin 53706 Department of Biochemistry, University of Wisconsin, Madison, Wisconsin 53706

We show that in the temperate grass, Brachypodium distachyon, PHYTOCHROME C (PHYC), is necessary for photoperiodic flowering. In loss-of-function phyC mutants, flowering is extremely delayed in inductive photoperiods. PHYC was identified as the causative locus by utilizing a mapping by sequencing pipeline (Cloudmap) optimized for identification of induced mutations in Brachypodium.

View Article and Find Full Text PDF

CloudMap: a cloud-based pipeline for analysis of mutant genome sequences.

Genetics

December 2012

Department of Biochemistry and Molecular Biophysics, Howard Hughes Medical Institute, Columbia University Medical Center, New York, NY 10032, USA.

Whole genome sequencing (WGS) allows researchers to pinpoint genetic differences between individuals and significantly shortcuts the costly and time-consuming part of forward genetic analysis in model organism systems. Currently, the most effort-intensive part of WGS is the bioinformatic analysis of the relatively short reads generated by second generation sequencing platforms. We describe here a novel, easily accessible and cloud-based pipeline, called CloudMap, which greatly simplifies the analysis of mutant genome sequences.

View Article and Find Full Text PDF