98%
921
2 minutes
20
Enhancer elements in the human genome control how genes are expressed in specific cell types and harbor thousands of genetic variants that influence risk for common diseases. Yet, we still do not know how enhancers regulate specific genes, and we lack general rules to predict enhancer-gene connections across cell types. We developed an experimental approach, CRISPRi-FlowFISH, to perturb enhancers in the genome, and we applied it to test >3,500 potential enhancer-gene connections for 30 genes. We found that a simple activity-by-contact model substantially outperformed previous methods at predicting the complex connections in our CRISPR dataset. This activity-by-contact model allows us to construct genome-wide maps of enhancer-gene connections in a given cell type, on the basis of chromatin state measurements. Together, CRISPRi-FlowFISH and the activity-by-contact model provide a systematic approach to map and predict which enhancers regulate which genes, and will help to interpret the functions of the thousands of disease risk variants in the noncoding genome.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886585 | PMC |
http://dx.doi.org/10.1038/s41588-019-0538-0 | DOI Listing |
Pancreatology
August 2025
Department of Internal Medicine I, Martin Luther University (MLU), Halle (Saale), Germany. Electronic address:
Background/objectives: Enhancers are key drivers of tissue-specific gene expression and can contain variants associated with pancreatic diseases. Enhancer-target gene assignment remains challenging, with the Activity-By-Contact (ABC) model, integrating open-chromatin, histone modification and chromatin interaction data, consistently outperforming other approaches. Recently an advanced version, the generalized ABC (gABC) model was published, yet lacking a clear and unique promoter definition impairing interpretability.
View Article and Find Full Text PDFNat Genet
July 2025
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Methods that analyze single-cell paired RNA sequencing (RNA-seq) and assay for transposase-accessible chromatin using sequencing (ATAC-seq) multiome data have shown promise in linking regulatory elements to genes. However, existing methods exhibit low concordance and do not capture the effects of genomic distance. We propose pgBoost, an integrative modeling framework that trains a non-linear combination of existing linking strategies (including genomic distance) on expression quantitative trait locus (eQTL) data to assign a probabilistic score to each candidate single-nucleotide polymorphism-gene link.
View Article and Find Full Text PDFmedRxiv
June 2025
Department of Human Genetics, University of Chicago, Chicago IL.
Asthma, allergic rhinitis, and atopic dermatitis are common, complex traits that are frequently co-morbid and have strong genetic correlation. However, the extent to which genome-wide genetic correlation between traits reflects shared causal variants or risk genes remains unclear. To address this question, we used functional fine-mapping.
View Article and Find Full Text PDFBMC Genomics
May 2025
Medical School of Chinese People's Liberation Army (PLA), 28 Fuxing Road, 100853, Beijing, China.
Introduction: Non-alcoholic fatty liver disease (NAFLD) represents the most widespread liver disease globally, ranging from non-alcoholic fatty liver (NAFL) and steatohepatitis (NASH) to fibrosis/cirrhosis, with potential progression to hepatocellular carcinoma (HCC). Genome-wide association studies (GWASs) have identified several single nucleotide polymorphisms (SNPs) associated with NAFLD. However, numerous GWAS signals associated with NAFLD locate in non-coding regions, posing a challenge for interpreting their functional annotation.
View Article and Find Full Text PDFbioRxiv
March 2025
Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA, USA.
Age-related macular degeneration (AMD) is a leading cause of vision loss worldwide. Genome-wide association studies (GWAS) of AMD have identified dozens of risk loci that may house disease targets. However, variants at these loci are largely noncoding, making it difficult to assess their function and whether they are causal.
View Article and Find Full Text PDF