Article Synopsis

  • Noncoding DNA's role in gene expression across cell types is still not fully understood, and solving this issue is crucial for advancements in human genetics.
  • Researchers developed a deep learning model called Enformer that significantly improves the accuracy of predicting gene expression from DNA sequences by considering long-range genomic interactions.
  • Enformer can predict how genetic variants affect gene expression and enhancer-promoter interactions more accurately than traditional methods, which could enhance the understanding of human diseases and regulatory evolution.

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

How noncoding DNA determines gene expression in different cell types is a major unsolved problem, and critical downstream applications in human genetics depend on improved solutions. Here, we report substantially improved gene expression prediction accuracy from DNA sequences through the use of a deep learning architecture, called Enformer, that is able to integrate information from long-range interactions (up to 100 kb away) in the genome. This improvement yielded more accurate variant effect predictions on gene expression for both natural genetic variants and saturation mutagenesis measured by massively parallel reporter assays. Furthermore, Enformer learned to predict enhancer-promoter interactions directly from the DNA sequence competitively with methods that take direct experimental data as input. We expect that these advances will enable more effective fine-mapping of human disease associations and provide a framework to interpret cis-regulatory evolution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8490152PMC
http://dx.doi.org/10.1038/s41592-021-01252-xDOI Listing

Publication Analysis

Top Keywords

gene expression
16
expression prediction
8
long-range interactions
8
effective gene
4
expression
4
prediction sequence
4
sequence integrating
4
integrating long-range
4
interactions noncoding
4
noncoding dna
4

Similar Publications

Analyzing the toxicological effects of PET-MPs on male infertility: Insights from network toxicology, mendelian randomization, and transcriptomics.

Reprod Biol

September 2025

Department of Obstetrics and Gynecology, The First Affiliated Hospital of Anhui Medical University, Hefei 230022, China; Engineering Research Center of Biopreservation and Artificial Organs, Ministry of Education, No 218 Jixi Road, Hefei Anhui230022, China; Key Laboratory of Population Health Across

Current research indicates that polyethylene terephthalate microplastics (PET-MPs) may significantly impair male reproductive function. This study aimed to investigate the potential molecular mechanisms underlying this impairment. Potential gene targets of PET-MPs were predicted via the SwissTargetPrediction database.

View Article and Find Full Text PDF

CRISPR/Cas9-mediated editing of COQ4 in induced pluripotent stem cells: A model for investigating COQ4-associated human coenzyme Q deficiency.

Stem Cell Res

September 2025

Department of General Pediatrics, Neonatology, and Pediatric Cardiology, Medical Faculty and University Hospital Düsseldorf, Heinrich-Heine-University, Düsseldorf 40225, Germany. Electronic address:

Pathogenic variants in the gene COQ4 cause primary coenzyme Q deficiency, which is associated with symptoms ranging from early epileptic encephalopathy up to adult-onset ataxia-spasticity spectrum disease. We genetically modified commercially available wild-type iPS cells by using a CRISPR/Cas9 approach to create heterozygous and homozygous isogenic cell lines carrying the disease-causing COQ4 variants c.458C > T, p.

View Article and Find Full Text PDF

Mechanistic roles of long non-coding RNAs in DNA damage response and genome stability.

Mutat Res Rev Mutat Res

September 2025

Institute of Environmental Medicine, Zhejiang University School of Medicine, Hangzhou 310058, China. Electronic address:

To maintain genomic stability, cells have evolved complex mechanisms collectively known as the DNA damage response (DDR), which includes DNA repair, cell cycle checkpoints, apoptosis, and gene expression regulation. Recent studies have revealed that long non-coding RNAs (lncRNAs) are pivotal regulators of the DDR. Beyond their established roles in recruiting repair proteins and modulating gene expression, emerging evidence highlights two particularly intriguing functions.

View Article and Find Full Text PDF

Clinicopathological features of dermal clear cell sarcoma: A series of 13 cases.

Pathol Res Pract

September 2025

Department of Pathology, Xijing Hospital and School of Basic Medicine, Fourth Military Medical University, Xi'an, China. Electronic address:

Background: Dermal clear cell sarcoma (DCCS) is a rare malignant mesenchymal neoplasm. Owing to the overlaps in its morphological and immunophenotypic profiles with a broad spectrum of tumors exhibiting melanocytic differentiation, it is frequently misdiagnosed as other tumor entities in clinical practice. By systematically analyzing the clinicopathological characteristics, immunophenotypic features, and molecular biological properties of DCCS, this study intends to further enhance pathologists' understanding of this disease and provide a valuable reference for its accurate diagnosis.

View Article and Find Full Text PDF

Background: Crohn's disease (CD) and rheumatoid arthritis (RA) are autoimmune diseases. CD is known to be closely associated with RA. However, the mechanisms underlying these relationships remain unclear.

View Article and Find Full Text PDF