Publications by Allison A Regier

Publications by authors named "Allison A Regier"

Page 1 of 1

A draft human pangenome reference.

Wen-Wei Liao , Mobin Asri , Jana Ebler , Daniel Doerr , Marina Haukness , Allison A Regier

Nature

May 2023

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels.

View Article and Find Full Text PDF

High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.

Marta Byrska-Bishop , Uday S Evani , Xuefang Zhao , Anna O Basile , Haley J Abel , Allison A Regier

Cell

September 2022

The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina.

View Article and Find Full Text PDF

Association of structural variation with cardiometabolic traits in Finns.

Lei Chen , Haley J Abel , Indraniel Das , David E Larson , Liron Ganel , Allison A Regier

Am J Hum Genet

April 2021

The contribution of genome structural variation (SV) to quantitative traits associated with cardiometabolic diseases remains largely unknown. Here, we present the results of a study examining genetic association between SVs and cardiometabolic traits in the Finnish population. We used sensitive methods to identify and genotype 129,166 high-confidence SVs from deep whole-genome sequencing (WGS) data of 4,848 individuals.

View Article and Find Full Text PDF

Haplotype-resolved diverse human genomes and integrated analysis of structural variation.

Peter Ebert , Peter A Audano , Qihui Zhu , Bernardo Rodriguez-Martin , David Porubsky , Allison A Regier

Science

April 2021

Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci.

View Article and Find Full Text PDF

Mapping and characterization of structural variation in 17,795 human genomes.

Haley J Abel , David E Larson , Allison A Regier , Colby Chiang , Indraniel Das

Nature

July 2020

A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline to map and characterize structural variants in 17,795 deeply sequenced human genomes.

View Article and Find Full Text PDF

Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects.

Allison A Regier , Yossi Farjoun , David E Larson , Olga Krasheninina , Hyun Min Kang

Nat Commun

October 2018

Hundreds of thousands of human whole genome sequencing (WGS) datasets will be generated over the next few years. These data are more valuable in aggregate: joint analysis of genomes from many sources increases sample size and statistical power. A central challenge for joint analysis is that different WGS data processing pipelines cause substantial differences in variant calling in combined datasets, necessitating computationally expensive reprocessing.

View Article and Find Full Text PDF

Genome Modeling System: A Knowledge Management Platform for Genomics.

Malachi Griffith , Obi L Griffith , Scott M Smith , Avinash Ramu , Matthew B Callaway , Allison A Regier

PLoS Comput Biol

July 2015

In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system.

View Article and Find Full Text PDF

Breakpoint structure of the Anopheles gambiae 2Rb chromosomal inversion.

Neil F Lobo , Djibril M Sangaré , Allison A Regier , Kyanne R Reidenbach , David A Bretz

Malar J

October 2010

Background: Alternative arrangements of chromosome 2 inversions in Anopheles gambiae are important sources of population structure, and are associated with adaptation to environmental heterogeneity. The forces responsible for their origin and maintenance are incompletely understood. Molecular characterization of inversion breakpoints provides insight into how they arose, and provides the basis for development of molecular karyotyping methods useful in future studies.

View Article and Find Full Text PDF