98%
921
2 minutes
20
Position weight matrix (PWM) is the traditional motif model representing the transcription factor (TF) binding sites. It proposes that the positions contribute independently to TFs binding affinity, although this hypothesis does not fit the data perfectly. This explains why PWM hits are missing in a substantial fraction of ChIP-seq peaks. To study various modes of the direct binding of plant TFs, we compiled the benchmark collection of 111 ChIP-seq datasets for , and applied the traditional PWM, and two alternative motif models BaMM and SiteGA, proposing the dependencies of the positions. The variation in the stringency of the recognition thresholds for the models proposed that the hits of PWM, BaMM, and SiteGA models are associated with the sites of high/medium, any, and low affinity, respectively. At the medium recognition threshold, about 60% of ChIP-seq peaks contain PWM hits consisting of conserved core consensuses, while BaMM and SiteGA provide hits for an additional 15% of peaks in which a weaker core consensus is compensated through intra-motif dependencies. The presence/absence of these dependencies in the motifs of alternative/traditional models was confirmed by the dependency logo DepLogo visualizing the position-wise partitioning of the alignments of predicted sites. We exemplify the detailed analysis of ChIP-seq profiles for plant TFs CCA1, MYC2, and SEP3. Gene ontology (GO) enrichment analysis revealed that among the three motif models, the SiteGA had the highest portions of genes with the significantly enriched GO terms among all predicted genes. We showed that both alternative motif models provide for traditional PWM greater extensions in predicted sites for TFs MYC2/SEP3 with condition/tissue specific functions, compared to those for TF CCA1 with housekeeping functions. Overall, the combined application of standard and alternative motif models is beneficial to detect various modes of the direct TF-DNA interactions in the maximal portion of ChIP-seq loci.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9373801 | PMC |
http://dx.doi.org/10.3389/fpls.2022.938545 | DOI Listing |
J Ind Microbiol Biotechnol
September 2025
Department of Biochemistry University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
Glycocins are a growing family of ribosomally synthesized and posttranslationally modified peptides (RiPPs) that are O- and/or S-glycosylated. Using a sequence similarity network of putative glycosyltransferases, the thg biosynthetic gene cluster was identified in the genome of Thermoanaerobacterium thermosaccharolyticum. Heterologous expression in Escherichia coli showed that the glycosyltransferase (ThgS) encoded in the biosynthetic gene cluster (BGC) adds N-acetyl-glucosamine (GlcNAc) to Ser and Cys residues of ThgA.
View Article and Find Full Text PDFJ Phys Chem B
September 2025
Hefei National Research Center for Physical Sciences at the Microscale and Key Laboratory of Precision and Intelligent Chemistry, Department of Chemical Physics, University of Science and Technology of China, Hefei, Anhui 230026, China.
Multivalent protein-protein interactions play essential roles in mediating liquid-liquid phase separation (LLPS) that drives biomolecular condensate formation. Here, we systematically investigate how the spatial distribution and relative size of protein binding domains (PBDs) would influence LLPS in a mixture of spherical proteins and RNA single strands by using a patchy-particle polymer model, wherein each protein contains a fixed number of PBDs on the surface distributed closely or sparsely. Intriguingly, we find that LLPS behavior exhibits a nontrivial dependence on the cooperative interplay between PBD distribution and protein size: while sparsely distributed PBDs are more favorable to LLPS for small proteins, closely packed PBDs facilitate LLPS for larger counterparts.
View Article and Find Full Text PDFRecursive splice sites are rare motifs postulated to facilitate splicing across massive introns and shape isoform diversity, especially for long, brain-expressed genes. The necessity of this unique mechanism remains unsubstantiated, as does the role of recursive splicing (RS) in human disease. From analyses of rare copy number variants (CNVs) from almost one million individuals, we previously identified large, heterozygous deletions eliminating an RS site (RS1) in the first intron of that conferred substantial risk for attention deficit hyperactivity disorder (ADHD) and other neurobehavioral traits.
View Article and Find Full Text PDFNAR Cancer
September 2025
Department of Molecular Genetics and Microbiology, Duke University School of Medicine, 213 Research Drive, Durham, NC 27710, United States.
Treatment of patients with platinum-resistant ovarian cancer is a major clinical challenge. We found that high expression of a meiotic protein, Synaptonemal Complex Protein 2 (SYCP2), is associated with platinum resistance and tyrosine kinase ABL1 inhibitor sensitivity in ovarian cancer. We demonstrate that tyrosine kinase ABL1 inhibitors inhibit cancer cell proliferation more efficiently in ovarian cancer cell lines with SYCP2 overexpression.
View Article and Find Full Text PDFFront Immunol
September 2025
Northwell, New Hyde Park, NY, United States.
Immunoglobulins (IGs) made by chronic lymphocytic leukemia (CLL) B cells are unique in that they bind themselves (homo-dimerize). This interaction leads to signal transduction with functional consequences that depend on the affinity of homo-dimerization. We have studied the antigen-binding properties of the IGs from a subset of patients with CLL (Subset #4) that homo-dimerize at high affinity.
View Article and Find Full Text PDF