98%
921
2 minutes
20
Characterization and accurate prediction of recombination hotspots and coldspots have crucial implications for the mechanism of recombination. Several models have predicted recombination hot/cold spots successfully, but there is still much room for improvement. We present a novel classifier in which k-mer frequency, physical and thermodynamic properties of DNA sequences are incorporated in the form of weighted features. Applying the classifier to recombination hot/cold ORFs in Saccharomyces cerevisiae, we achieved an accuracy of 90%, which is ~5% higher than existing methods, such as iRSpot-PseDNC, IDQD and Random Forest. The model also predicted non-ORF recombination hot/cold spots sequences in S. cerevisiae with high accuracy. A broad applicability of the model in the field of classification is expected.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.jtbi.2015.06.030 | DOI Listing |
Front Genet
June 2021
School of Life Sciences and Technology, Inner Mongolia University of Science and Technology, Baotou, China.
Characterization and identification of recombination hotspots provide important insights into the mechanism of recombination and genome evolution. In contrast with existing sequence-based models for predicting recombination hotspots which were defined in a ORF-based manner, here, we first defined recombination hot/cold spots based on public high-resolution Spo11-oligo-seq data, then characterized them in terms of DNA sequence and epigenetic marks, and finally presented classifiers to identify hotspots. We found that, in addition to some previously discovered DNA-based features like GC-skew, recombination hotspots in yeast can also be characterized by some remarkable features associated with DNA physical properties and shape.
View Article and Find Full Text PDFGenomics
December 2019
School of Computer Science, Xidian University, Xi'an, 710071, PR China.
Meiotic recombination plays an important role in the process of genetic evolution. Previous researches have shown that the recombination rates provide important information about the mechanism of recombination study. However, at present, most methods ignore the hidden correlation and spatial autocorrelation of the DNA sequence.
View Article and Find Full Text PDFSci Rep
September 2016
School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China.
Meiotic recombination presents an uneven distribution across the genome. Genomic regions that exhibit at relatively high frequencies of recombination are called hotspots, whereas those with relatively low frequencies of recombination are called coldspots. Therefore, hotspots and coldspots would provide useful information for the study of the mechanism of recombination.
View Article and Find Full Text PDFMol Biosyst
August 2016
Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China. and Center of Information in Biomedicine, University of Electronic Science and Technology of China, Chengdu, China and Key Laboratory for Neur
Pseudo dinucleotide composition (PseDNC) and Z curve showed excellent performance in the classification issues of nucleotide sequences in bioinformatics. Inspired by the principle of Z curve theory, we improved PseDNC to give the phase-specific PseDNC (psPseDNC). In this study, we used the prediction of recombination spots as a case to illustrate the capability of psPseDNC and also PseDNC fused with Z curve theory based on a novel machine learning method named large margin distribution machine (LDM).
View Article and Find Full Text PDFJ Theor Biol
October 2015
School of Mathematics, Physics and Biological Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China; The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou 014010, China.
Characterization and accurate prediction of recombination hotspots and coldspots have crucial implications for the mechanism of recombination. Several models have predicted recombination hot/cold spots successfully, but there is still much room for improvement. We present a novel classifier in which k-mer frequency, physical and thermodynamic properties of DNA sequences are incorporated in the form of weighted features.
View Article and Find Full Text PDF