98%
921
2 minutes
20
Motivation: In silico transcription factor and DNA (TF-DNA) binding affinity prediction plays a vital role in examining TF binding preferences and understanding gene regulation. The existing tools employ TF-DNA binding profiles from in vitro high-throughput technologies to predict TF-DNA binding affinity. However, TFs tend to bind to sequences in open chromatin regions in vivo, such TF binding preference is seldomly considered by these existing tools.
Results: In this study, we developed TRAFICA, an open chromatin language model to predict TF-DNA binding affinity by integrating sequence characteristics of open chromatin regions from ATAC-seq experiments and in vitro TF-DNA binding profiles from high-throughput technologies. We pre-trained TRAFICA on over 2.8 million nucleotide sequences in open chromatin regions derived from 197 ATAC-seq experiments (115 cell lines) to learn in vivo TF binding preferences. We further fine-tuned TRAFICA using low-rank adaptation (LoRA) on PBM and HT-SELEX TF-DNA binding profiles to learn intrinsic binding preferences for specific TFs. We systematically evaluated TRAFICA and compared its predictive performance with existing prediction tools and advanced DNA language models. The experimental results demonstrated that TRAFICA significantly outperformed the others in predicting in vitro and in vivo TF-DNA binding affinity, achieving state-of-the-art performance. These findings indicate that considering the sequence characteristics from open chromatin regions could significantly improve TF-DNA binding affinity prediction.
Availability And Implementation: The source code of TRAFICA and detailed tutorials are available at https://github.com/ericcombiolab/TRAFICA.
Supplementary Information: Supplementary files are available at Bioinformatics Journal online.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/bioinformatics/btaf469 | DOI Listing |
Cell Rep
September 2025
Department of Biochemistry, University of Colorado, Boulder, CO 80303, USA. Electronic address:
RNA polymerase II (RNAPII) is regulated by sequence-specific transcription factors (TFs) and the pre-initiation complex (PIC): TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIH, and Mediator. TFs, Mediator, and RNAPII contain intrinsically disordered regions (IDRs) and form phase-separated condensates, but how IDRs control RNAPII function remains poorly understood. Using purified PIC factors, we developed a real-time in vitro fluorescence transcription (RIFT) assay for second-by-second visualization of transcription at hundreds of promoters simultaneously.
View Article and Find Full Text PDFBioinformatics
August 2025
Department of Computer Science, Hong Kong Baptist University, Kowloon Town, Hong Kong, China.
Motivation: In silico transcription factor and DNA (TF-DNA) binding affinity prediction plays a vital role in examining TF binding preferences and understanding gene regulation. The existing tools employ TF-DNA binding profiles from in vitro high-throughput technologies to predict TF-DNA binding affinity. However, TFs tend to bind to sequences in open chromatin regions in vivo, such TF binding preference is seldomly considered by these existing tools.
View Article and Find Full Text PDFMol Plant
August 2025
, State Key Laboratory of Maize Bio-breeding, Frontiers Science Center for Molecular Design Breeding, Joint International Research Laboratory of Crop Molecular Breeding, National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100193, China; ,
Understanding gene regulatory networks (GRNs) is essential for improving maize yield and quality through molecular breeding approaches. The lack of comprehensive transcription factor (TF)-DNA interaction data has hindered accurate GRN predictions, limiting our insight into the regulatory mechanisms. Here, we performed large-scale profiling of maize TF binding sites.
View Article and Find Full Text PDFPlant J
August 2025
State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China.
Transcription factors (TFs) play a crucial role in gene regulation. They drive chromatin remodeling, transcription, mRNA splicing, and RNA processing via dynamic protein interactions. However, their low abundance and complex binding networks complicate the study of TF partners.
View Article and Find Full Text PDFCell Prolif
July 2025
Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.
Differences in gene expression, which arise from divergence in cis-regulatory elements or alterations in transcription factors (TFs) binding specificity, are one of the most important causes of phenotypic diversity during evolution. On one hand, changes in the cis-elements located in the vicinity of target genes affect TF binding and/or local chromatin environment, thereby modulating gene expression in cis. On the other hand, alterations in trans-factors influence the expression of their target genes in a more pleiotropic fashion.
View Article and Find Full Text PDF