Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Regulatory proteins have evolved diverse repressor domains (RDs) to enable precise context-specific repression of transcription. However, our understanding of how sequence variation impacts the functional activity of RDs is limited. To address this gap, we generated a high-throughput mutational scanning dataset measuring the repressor activity of 115,000 variant sequences spanning more than 50 RDs in human cells. We identified thousands of clinical variants with loss or gain of repressor function, including TWIST1 HLH variants associated with Saethre-Chotzen syndrome and MECP2 domain variants associated with Rett syndrome. We also leveraged these data to annotate short linear interacting motifs (SLiMs) that are critical for repression in disordered RDs. Then, we designed a deep learning model called TENet ( T ranscriptional E ffector Net work) that integrates sequence, structure and biochemical representations of sequence variants to accurately predict repressor activity. We systematically tested generalization within and across domains with varying homology using the mutational scanning dataset. Finally, we employed TENet within a directed evolution sequence editing framework to tune the activity of both structured and disordered RDs and experimentally test thousands of designs. Our work highlights critical considerations for future dataset design and model training strategies to improve functional variant prioritization and precision design of synthetic regulatory proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11463546PMC
http://dx.doi.org/10.1101/2024.09.21.614253DOI Listing

Publication Analysis

Top Keywords

repressor domains
8
deep learning
8
regulatory proteins
8
mutational scanning
8
scanning dataset
8
repressor activity
8
variants associated
8
disordered rds
8
repressor
5
rds
5

Similar Publications

Genome-wide association studies (GWAS) relevant to osteoporosis have identified hundreds of loci; however, understanding how these variants influence the phenotype is complicated because most reside in non-coding DNA sequence that serves as transcriptional enhancers and repressors. To advance knowledge on these regulatory elements in osteoclasts (OCs), we performed Micro-C analysis, which informs on the genome topology of these cells and integrated the results with transcriptome and GWAS data to further define loci linked to BMD. Using blood cells isolated from 4 healthy participants aged 31-61 yr, we cultured OC in vitro and generated a Micro-C chromatin conformation capture dataset.

View Article and Find Full Text PDF

Trimethylation of histone H3 at lys36 (H3K36me3) promotes gene transcription and governs plant development and plant responses to environmental cues. Yet, how H3K36me3 is translated into specific downstream events remains largely uninvestigated. Here, we report that the Arabidopsis PWWP-domain protein HUA2 binds methyl-H3K36 in a PWWP motif-dependent manner.

View Article and Find Full Text PDF

pVHL regulates protein stability of the TCF/LEF transcription factor family via ubiquitin-independent proteasomal degradation.

Cell Mol Life Sci

September 2025

Key Laboratory of Marine Drugs (Ocean University of China), Chinese Ministry of Education, and School of Medicine and Pharmacy, Ocean University of China, 5 Yushan Road, Qingdao, 266003, China.

The Wnt/β-catenin signaling pathway plays key roles in development and adult tissue homeostasis by controlling cell proliferation and cell fate decisions. TCF/LEF transcription factors play a pivotal role in this pathway, acting as repressors by recruiting co-repressors in the absence of Wnt signals, and as activators via β-catenin binding in the presence of Wnt signaling. While progress has been made in our understanding of Wnt signaling regulation, the underlying mechanism that regulates the protein stability of the TCF/LEF family is far less clear.

View Article and Find Full Text PDF

Duchenne muscular dystrophy (DMD) is a lethal pediatric striated muscle disease caused by loss of dystrophin for which there is no cure. Cardiomyopathy is the leading cause of death amongst individuals with DMD, and effective therapeutics to treat DMD cardiomyopathy are a major unmet clinical need. This work investigated adeno-associated viral (AAV) gene therapy approaches to treat DMD cardiomyopathy by overexpression of the calcium binding proteins S100A1 and apoptosis repressor with caspase recruitment domains (ARC).

View Article and Find Full Text PDF

All-in-one vectors for epigenetic CRISPR inhibition of in facioscapulohumeral muscular dystrophy.

Mol Ther Methods Clin Dev

September 2025

The Department of Pharmacology, University of Nevada, Reno School of Medicine, Reno, NV 89557, USA.

Facioscapulohumeral muscular dystrophy (FSHD) is caused by incomplete epigenetic silencing of the disease locus, leading to pathogenic misexpression of in skeletal muscle. Previously, we showed that CRISPR inhibition (CRISPRi) using several epigenetic regulators successfully targets and represses in FSHD myocytes with no adverse effects on the muscle transcriptome. However, for translatability, adeno-associated virus (AAV)-mediated gene therapies must include all components within a single vector that expresses at therapeutic levels in all target tissues.

View Article and Find Full Text PDF