98%
921
2 minutes
20
This study compared computational approaches to parallelization of an SNP calling workflow. The data comprised DNA from five Holstein-Friesian cows sequenced with the Illumina platform. The pipeline consisted of quality control, alignment to the reference genome, post-alignment, and SNP calling. Three approaches to parallelization were compared: (i) a plain Bash script in which a pipeline for each cow was executed as separate processes invoked at the same time, (ii) a Bash script wrapped in a single Nextflow process and (iii) a Nextflow script with each component of the pipeline defined as a separate process. The results demonstrated that on average, the multi-process Nextflow script performed 15-27% faster depending on the number of assigned threads, with the biggest execution time advantage over the plain Bash approach observed with 10 threads. In terms of RAM usage, the most substantial variation was observed for the multi-process Nextflow, for which it increased with the number of assigned threads, while RAM consumption of the other setups did not depend much on the number of threads assigned for computations. Due to intermediate and log files generated, disk usage was markedly higher for the multi-process Nextflow than for the plain Bash and for the single-process Nextflow.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11057021 | PMC |
http://dx.doi.org/10.1093/nargab/lqae040 | DOI Listing |
NAR Genom Bioinform
June 2024
Wroclaw University of Environmental and Life Sciences, Department of Genetics, the Biostatistics Group Kozuchowska 7, Wroclaw PL-51631, Poland.
This study compared computational approaches to parallelization of an SNP calling workflow. The data comprised DNA from five Holstein-Friesian cows sequenced with the Illumina platform. The pipeline consisted of quality control, alignment to the reference genome, post-alignment, and SNP calling.
View Article and Find Full Text PDFBMJ Open
November 2023
Department of Population, Policy & Practice, UCL Great Ormond Street Institute of Child Health, London, UK.
Introduction: Feeding practices developed in early life can impact a child's nutrition, growth, dental health, cognitive development and lifetime risk of chronic diseases. Substantial evidence suggests ethnic health inequalities, and non-recommended complementary infant feeding practices among UK's South Asian (SA) population. Nurture Early for Optimal Nutrition aims to use women's group participatory learning and action (PLA) cycles to optimise infant feeding, care and dental hygiene practices in SA infants <2 years in East London.
View Article and Find Full Text PDFJ Geophys Res Atmos
April 2019
U.S. Environmental Protection Agency Research Triangle Park NC USA.
Air quality models provide spatial fields of wet deposition (WD) and dry deposition that explicitly account for the transport and transformation of emissions from thousands of sources. However, many sources of uncertainty in the air quality model including errors in emissions and meteorological inputs (particularly precipitation) and incomplete descriptions of the chemical and physical processes governing deposition can lead to bias and error in the simulation of WD. We present an approach to bias correct Community Multiscale Air Quality model output over the contiguous United States using observation-based gridded precipitation data generated by the Parameter-elevation Regressions on Independent Slopes Model and WD observations at the National Atmospheric Deposition Program National Trends Network sites.
View Article and Find Full Text PDFAtmos Chem Phys
June 2018
US Environmental Protection Agency, Research Triangle Park, NC 27711, USA.
Excess deposition (including both wet and dry deposition) of nitrogen and sulfur are detrimental to ecosystems. Recent studies have investigated the spatial patterns and temporal trends of nitrogen and sulfur wet deposition, but few studies have focused on dry deposition due to the scarcity of dry deposition measurements. Here, we use long-term model simulations from the coupled WRF-CMAQ model covering the period from 1990 to 2010 to study changes in spatial distribution as well as temporal trends in total (TDEP), wet (WDEP) and dry deposition (DDEP) of total inorganic nitrogen (TIN) and sulfur (TSO).
View Article and Find Full Text PDF