Mathematical Modeling of Avidity Distribution and Estimating General Binding Properties of Transcription Factors from Genome-Wide Binding Profiles.

Methods Mol Biol

Bioinformatics Institute, Agency of Science, Technology and Research, 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore.

Published: May 2018


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The shape of the experimental frequency distributions (EFD) of diverse molecular interaction events quantifying genome-wide binding is often skewed to the rare but abundant quantities. Such distributions are systematically deviated from standard power-law functions proposed by scale-free network models suggesting that more explanatory and predictive probabilistic model(s) are needed. Identification of the mechanism-based data-driven statistical distributions that provide an estimation and prediction of binding properties of transcription factors from genome-wide binding profiles is the goal of this analytical survey. Here, we review and develop an analytical framework for modeling, analysis, and prediction of transcription factor (TF) DNA binding properties detected at the genome scale. We introduce a mixture probabilistic model of binding avidity function that includes nonspecific and specific binding events. A method for decomposition of specific and nonspecific TF-DNA binding events is proposed. We show that the Kolmogorov-Waring (KW) probability function (PF), modeling the steady state TF binding-dissociation stochastic process, fits well with the EFD for diverse TF-DNA binding datasets. Furthermore, this distribution predicts total number of TF-DNA binding sites (BSs), estimating specificity and sensitivity as well as other basic statistical features of DNA-TF binding when the experimental datasets are noise-rich and essentially incomplete. The KW distribution fits equally well to TF-DNA binding activity for different TFs including ERE, CREB, STAT1, Nanog, and Oct4. Our analysis reveals that the KW distribution and its generalized form provides the family of power-law-like distributions given in terms of hypergeometric series functions, including standard and generalized Pareto and Waring distributions, providing flexible and common skewed forms of the transcription factor binding site (TFBS) avidity distribution function. We suggest that the skewed binding events may be due to a wide range of evolutionary processes of creating weak avidity TFBS associated with random mutations, while the rare high-avidity binding sites (i.e., high-avidity evolutionarily conserved canonical e-boxes) rarely occurred. These, however, may be positively selected in microevolution.

Download full-text PDF

Source
http://dx.doi.org/10.1007/978-1-4939-7027-8_9DOI Listing

Publication Analysis

Top Keywords

binding
16
tf-dna binding
16
binding properties
12
genome-wide binding
12
binding events
12
avidity distribution
8
properties transcription
8
transcription factors
8
factors genome-wide
8
binding profiles
8

Similar Publications

In the presence of chromatin bridges in cytokinesis, human cells retain actin-rich structures (actin patches) at the base of the intercellular canal to prevent chromosome breakage. Here, we show that daughter nuclei connected by chromatin bridges are under mechanical tension that requires interaction of the nuclear membrane Sun1/2-Nesprin-2 Linker of Nucleoskeleton and Cytoskeleton (LINC) complex with the actin cytoskeleton, and an intact nuclear lamina. This nuclear tension promotes accumulation of Sun1/2-Nesprin-2 proteins at the base of chromatin bridges and local enrichment of the RhoA-activator PDZ RhoGEF through PDZ-binding to cytoplasmic Nesprin-2 spectrin repeats.

View Article and Find Full Text PDF

Comment on "spatially dependent tissue distribution of thyroid hormones by plasma thyroid hormone binding proteins".

Pflugers Arch

September 2025

Department of Research Analytics, Saveetha Dental College and Hospitals, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, India.

View Article and Find Full Text PDF

Manipulating Zika virus RNA tertiary structure for developing tissue-specific attenuated vaccines.

EMBO Mol Med

September 2025

State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, 100071, Beijing, China.

Traditional live attenuated vaccines (LAVs) are typically developed through serial passaging or genetic engineering to introduce specific mutations or deletions. While viral RNA secondary or tertiary structures have been well-documented for their multiple functions, including binding with specific host proteins, their potential for LAV design remains largely unexplored. Herein, using Zika virus (ZIKV) as a model, we demonstrate that targeted disruption of the primary sequence or tertiary structure of a specific viral RNA element responsible for Musashi-1 (MSI1) binding leads to a tissue-specific attenuation phenotype in multiple animal models.

View Article and Find Full Text PDF

Chemotherapeutic resistance is a significant issue in the treatment of breast cancer, which is related to pyroptosis inhibition. Increasing evidence suggests that long non-coding RNAs (lncRNAs) contribute to tumorigenesis and drug resistance. In this study we investigated the role of the lncRNA STMN1P2 in doxorubicin resistance in breast cancer, as well as its correlation with pyroptosis inhibition.

View Article and Find Full Text PDF

Construction of a bacterial surface display system using split green fluorescent protein (GFP) in Escherichia coli.

Biotechnol Lett

September 2025

Department of Chemical Engineering, Hongik University, Sangsu-dong, Mapo-gu, Seoul, 04066, Republic of Korea.

The cell surface display system employs carrier proteins to present target proteins on the outer membrane of cells. This system enables functional proteins to be exposed on the exterior of living cells without cell lysis, allowing direct interaction with the surrounding environment. A major limitation of conventional approaches is the difficulty in displaying large-sized enzymes or antibodies, despite their critical roles in applications requiring functional domains that must remain intact, such as catalytic or antigen-binding sites.

View Article and Find Full Text PDF