A multi-dimensional integrative scoring framework for predicting functional variants in the human genome.

Am J Hum Genet

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Department of Statistics, Harvard University, Cambridge, MA, 02138, USA. Electronic address: xlin@h

Published: March 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Attempts to identify and prioritize functional DNA elements in coding and non-coding regions, particularly through use of in silico functional annotation data, continue to increase in popularity. However, specific functional roles can vary widely from one variant to another, making it challenging to summarize different aspects of variant function with a one-dimensional rating. Here we propose multi-dimensional annotation-class integrative estimation (MACIE), an unsupervised multivariate mixed-model framework capable of integrating annotations of diverse origin to assess multi-dimensional functional roles for both coding and non-coding variants. Unlike existing one-dimensional scoring methods, MACIE views variant functionality as a composite attribute encompassing multiple characteristics and estimates the joint posterior functional probabilities of each genomic position. This estimate offers more comprehensive and interpretable information in the presence of multiple aspects of functionality. Applied to a variety of independent coding and non-coding datasets, MACIE demonstrates powerful and robust performance in discriminating between functional and non-functional variants. We also show an application of MACIE to fine-mapping and heritability enrichment analysis by using the lipids GWAS summary statistics data from the European Network for Genetic and Genomic Epidemiology Consortium.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8948160PMC
http://dx.doi.org/10.1016/j.ajhg.2022.01.017DOI Listing

Publication Analysis

Top Keywords

coding non-coding
12
functional roles
8
functional
7
multi-dimensional integrative
4
integrative scoring
4
scoring framework
4
framework predicting
4
predicting functional
4
functional variants
4
variants human
4

Similar Publications

Whole genome sequence analysis of low-density lipoprotein cholesterol across 246 K individuals.

Genome Biol

September 2025

Center for Genomic Medicine, Cardiovascular Research Center, , Massachusetts General Hospital Simches Research Center, 185 Cambridge Street, CPZN 5.238,, Boston, MA, 02114, USA.

Background: Rare genetic variation provided by whole genome sequence datasets has been relatively less explored for its contributions to human traits. Meta-analysis of sequencing data offers advantages by integrating larger sample sizes from diverse cohorts, thereby increasing the likelihood of discovering novel insights into complex traits. Furthermore, emerging methods in genome-wide rare variant association testing further improve power and interpretability.

View Article and Find Full Text PDF

The global surge in the population of people 60 years and older, including that in China, challenges healthcare systems with rising age-related diseases. To address this demographic change, the Aging Biomarker Consortium (ABC) has launched the X-Age Project to develop a comprehensive aging evaluation system tailored to the Chinese population. Our goal is to identify robust biomarkers and construct composite aging clocks that capture biological age, defined as an individual's physiological and molecular state, across diverse Chinese cohorts.

View Article and Find Full Text PDF

Chemotherapeutic resistance is a significant issue in the treatment of breast cancer, which is related to pyroptosis inhibition. Increasing evidence suggests that long non-coding RNAs (lncRNAs) contribute to tumorigenesis and drug resistance. In this study we investigated the role of the lncRNA STMN1P2 in doxorubicin resistance in breast cancer, as well as its correlation with pyroptosis inhibition.

View Article and Find Full Text PDF

The flexibility of the spatio-temporal genome replication program during development and disease highlights the regulatory role of plastic epigenetic mechanisms over genetic determinants. Histone post-translational modifications are broadly implicated in replication timing control, yet the specific mechanisms through which individual histone marks influence replication dynamics, particularly in heterochromatin, remain unclear. Here, we demonstrate that H3K36me3 dynamically enriches at pericentromeric heterochromatin, composed of major satellite DNA repeats, prior to replication during mid S phase in mouse embryonic stem cells.

View Article and Find Full Text PDF

E3 ubiquitin ligases regulate the cellular proteome proteasome-dependent protein degradation; however, there exist limited studies outlining their non-canonical functions. RNA-binding ubiquitin ligases (RBULs) represent a subset of E3 ligases that harbour RNA-binding domains, making them uniquely positioned to function as both RNA-binding proteins and E3 ligases. Our initial microarray screen for E3 ligases from mouse cortical neural progenitor cells identified MEX3B, a known RNA-binding ubiquitin ligase, to be differentially expressed.

View Article and Find Full Text PDF