Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The 100,000 Genomes Project (100KGP) diagnosed a quarter of recruited affected participants, but 26% of diagnoses were in genes not on the chosen gene panel(s); with many being variants of high impact. However, assessing biallelic variants without a gene panel is challenging, due to the number of variants requiring scrutiny. We sought to identify potential missed biallelic diagnoses independent of the gene panel applied using GenePy - a whole gene pathogenicity metric. GenePy scores all variants called in a given individual, incorporating allele frequency, zygosity, and a user-defined deleterious metric (CADD v1.6 applied herein). GenePy then combines all variant scores for individual genes, generating an aggregate score per gene, per participant. We calculated GenePy scores for 2862 recessive disease genes in 78,216 individuals in 100KGP. For each gene, we ranked participant GenePy scores for that gene, and scrutinised affected individuals without a diagnosis whose scores ranked amongst the top-5 for each gene. We assessed these participants' phenotypes for overlap with the disease gene associated phenotype for which they were highly ranked. Where phenotypes overlapped, we extracted rare variants in the gene of interest and applied phase, ClinVar and ACMG classification looking for putative causal biallelic variants. 3184 affected individuals without a molecular diagnosis had a top-5 ranked GenePy gene score and 682/3184 (21%) had phenotypes overlapping with one of the top-ranking genes. After removing 13 withdrawn participants, in 122/669 (18%) of the phenotype-matched cases, we identified a putative missed diagnosis in a top-ranked gene supported by phasing, ClinVar and ACMG classification. A further 334/669 (50%) of cases have a possible missed diagnosis but require functional validation. Applying GenePy at scale has identified potential diagnoses for 456/3183 (14%) of undiagnosed participants who had a top-5 ranked GenePy score in a recessive disease gene, whilst adding only 1.2 additional variants (per individual) for assessment.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10081430PMC
http://dx.doi.org/10.1101/2023.03.21.23287545DOI Listing

Publication Analysis

Top Keywords

gene
14
genepy scores
12
gene pathogenicity
8
missed biallelic
8
biallelic diagnoses
8
100000 genomes
8
genomes project
8
biallelic variants
8
variants gene
8
gene panel
8

Similar Publications

Targeting the gut-liver axis with dietary polyphenols to ameliorate metabolic dysfunction-associated steatotic liver disease: advances in molecular mechanisms.

Crit Rev Food Sci Nutr

September 2025

Hunan Key Laboratory of Deep Processing and Quality Control of Cereals and Oils, State Key Laboratory of Utilization of Woody Oil Resource, College of Food Science and Engineering, Central South University of Forestry and Technology, Changsha, Hunan, China.

Metabolic dysfunction-associated steatotic liver disease (MASLD) is a condition that results from metabolic disorders. In addition to genetic factors, irregular and high-energy diets may also significantly contribute to its pathogenesis. Dietary habits can profoundly alter the composition of gut microbiota and metabolites.

View Article and Find Full Text PDF

Selenium is an essential trace element in many organisms but becomes toxic at elevated concentrations. At moderately increased, non-lethal levels, selenite triggers both selenium utilization and stress responses in microorganisms. However, the thresholds of such responses in archaea remain poorly understood.

View Article and Find Full Text PDF

Aim: To investigate the phenotypic and genomic features of three multidrug-resistant (MDR) clinical mucoid and non-mucoid uropathogenic Escherichia coli (UPEC) strains to understand their antimicrobial resistance, biofilm formation, and virulence in urinary tract infections (UTIs).

Methods And Results: The UPEC strains A5, A10, and A15 were isolated from two UTI patients. Phenotypic assays included colony morphology, antibiotic susceptibility, motility, and biofilm formation.

View Article and Find Full Text PDF

Glycocins are a growing family of ribosomally synthesized and posttranslationally modified peptides (RiPPs) that are O- and/or S-glycosylated. Using a sequence similarity network of putative glycosyltransferases, the thg biosynthetic gene cluster was identified in the genome of Thermoanaerobacterium thermosaccharolyticum. Heterologous expression in Escherichia coli showed that the glycosyltransferase (ThgS) encoded in the biosynthetic gene cluster (BGC) adds N-acetyl-glucosamine (GlcNAc) to Ser and Cys residues of ThgA.

View Article and Find Full Text PDF

Aims: This study aims to develop and evaluate a rapid and high-multiplex pathogen detection method for clinical and food specimens to address the ongoing public health threat of foodborne infections and the limitations of conventional culture-based diagnostics.

Methods And Results: The foodborne bacteria (FBB) assay integrates multiplex PCR, T7 exonuclease hydrolysis, and a suspension bead array to simultaneously detect 16 genes from 13 major foodborne bacteria. Analytical performance was evaluated using reference strains, while diagnostic performance was assessed using clinical and food samples.

View Article and Find Full Text PDF