Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Rare coding variants that substantially affect function provide insights into the biology of a gene. However, ascertaining the frequency of such variants requires large sample sizes. Here we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. In total, 23% of the Regeneron Genetics Center Million Exome (RGC-ME) data come from individuals of African, East Asian, Indigenous American, Middle Eastern and South Asian ancestry. The catalogue includes more than 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss of function (LOF), we identify 3,988 LOF-intolerant genes, including 86 that were previously assessed as tolerant and 1,153 that lack established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions that are depleted of missense variants despite being tolerant of pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this resource of coding variation from the RGC-ME dataset publicly accessible through a variant allele frequency browser.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11254753PMC
http://dx.doi.org/10.1038/s41586-024-07556-0DOI Listing

Publication Analysis

Top Keywords

plof variants
12
protein-coding variation
8
983578 individuals
8
individuals rare
8
variants
7
individuals
5
deep catalogue
4
catalogue protein-coding
4
variation 983578
4
rare coding
4

Similar Publications

Background: ANK3 encodes ankyrin-G, a key scaffolding protein essential for neuronal function. While both monoallelic and biallelic ANK3 variants have been linked to neurodevelopmental disorders (NDDs), existing evidence for their pathogenicity and clinical correlation remains limited and heterogeneous.

Objective: To delineate the clinical features associated with biallelic ANK3 predicted loss-of-function (pLOF) variants.

View Article and Find Full Text PDF

Purpose: is a proposed colorectal cancer (CRC) predisposition gene, with only four families with putative loss-of-function (pLoF) variants published. The prevalence, phenotypic spectrum, and clinical management recommendations for heterozygotes remain unknown.

Methods: Retrospective review of approximately 950,000 individuals undergoing multigene panel testing (MGPT) for cancer predisposition identified 36 individuals with pLoF variants in .

View Article and Find Full Text PDF

Background: Cancer predisposition syndromes (CPSs) with pediatric onset are the leading known cause of childhood malignancies and are increasingly guiding clinical strategies in pediatric oncology. CPSs are placed under evolutionary negative selective pressure, but pediatric pancancer studies have so far failed to investigate genomic evolutionary metrics as a guide to predict penetrance and reveal novel CPSs.

Method: Germline whole-genome sequencing (WGS) in a 5-year prospective, registry-validated, nationwide cohort of individuals diagnosed with cancer before 18 years of age.

View Article and Find Full Text PDF

Generation of four human pluripotent stem cell lines harboring OPA1-related optic atrophy variant.

Stem Cell Res

September 2025

Hawaii Permanente Medical Group, Kaiser Permanente, Mapunapuna Medical Office, USA. Electronic address:

We have generated four human iPSC lines from skin biopsy-derived fibroblast cells with pathogenic variants in OPA1. Donors have a clinical diagnosis of optic atrophy. Three harbor heterozygous presumed loss-of-function (pLOF) variants, c.

View Article and Find Full Text PDF

Functional analysis of human EED variants using Drosophila.

Genetics

September 2025

Department of Medical Genetics, University of British Columbia, Life Sciences Institute, The University of British Columbia, 2350 Health Sciences Mall, Vancouver, British Columbia, Canada V6T 1Z3.

The Polycomb Repressive Complex 2 is an epigenetic reader/writer that methylates histone H3K27. Rare germline partial loss-of-function (pLoF) variants in core members of the complex (EZH2, EED, and SUZ12) cause overgrowth and intellectual disability syndromes, whereas somatic variants are implicated in cancer. However, up to 1% of the general population will have a rare variant in one of these genes, most of which would be classified as variants of uncertain significance (VUS).

View Article and Find Full Text PDF