Using weighted features to predict recombination hotspots in Saccharomyces cerevisiae.

J Theor Biol

School of Mathematics, Physics and Biological Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China; The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou 014010, China.

Published: October 2015


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Characterization and accurate prediction of recombination hotspots and coldspots have crucial implications for the mechanism of recombination. Several models have predicted recombination hot/cold spots successfully, but there is still much room for improvement. We present a novel classifier in which k-mer frequency, physical and thermodynamic properties of DNA sequences are incorporated in the form of weighted features. Applying the classifier to recombination hot/cold ORFs in Saccharomyces cerevisiae, we achieved an accuracy of 90%, which is ~5% higher than existing methods, such as iRSpot-PseDNC, IDQD and Random Forest. The model also predicted non-ORF recombination hot/cold spots sequences in S. cerevisiae with high accuracy. A broad applicability of the model in the field of classification is expected.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jtbi.2015.06.030DOI Listing

Publication Analysis

Top Keywords

recombination hot/cold
12
weighted features
8
recombination hotspots
8
saccharomyces cerevisiae
8
hot/cold spots
8
recombination
6
features predict
4
predict recombination
4
hotspots saccharomyces
4
cerevisiae characterization
4

Similar Publications

Characterization and identification of recombination hotspots provide important insights into the mechanism of recombination and genome evolution. In contrast with existing sequence-based models for predicting recombination hotspots which were defined in a ORF-based manner, here, we first defined recombination hot/cold spots based on public high-resolution Spo11-oligo-seq data, then characterized them in terms of DNA sequence and epigenetic marks, and finally presented classifiers to identify hotspots. We found that, in addition to some previously discovered DNA-based features like GC-skew, recombination hotspots in yeast can also be characterized by some remarkable features associated with DNA physical properties and shape.

View Article and Find Full Text PDF

Meiotic recombination plays an important role in the process of genetic evolution. Previous researches have shown that the recombination rates provide important information about the mechanism of recombination study. However, at present, most methods ignore the hidden correlation and spatial autocorrelation of the DNA sequence.

View Article and Find Full Text PDF

Meiotic recombination presents an uneven distribution across the genome. Genomic regions that exhibit at relatively high frequencies of recombination are called hotspots, whereas those with relatively low frequencies of recombination are called coldspots. Therefore, hotspots and coldspots would provide useful information for the study of the mechanism of recombination.

View Article and Find Full Text PDF

Pseudo dinucleotide composition (PseDNC) and Z curve showed excellent performance in the classification issues of nucleotide sequences in bioinformatics. Inspired by the principle of Z curve theory, we improved PseDNC to give the phase-specific PseDNC (psPseDNC). In this study, we used the prediction of recombination spots as a case to illustrate the capability of psPseDNC and also PseDNC fused with Z curve theory based on a novel machine learning method named large margin distribution machine (LDM).

View Article and Find Full Text PDF

Using weighted features to predict recombination hotspots in Saccharomyces cerevisiae.

J Theor Biol

October 2015

School of Mathematics, Physics and Biological Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China; The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou 014010, China.

Characterization and accurate prediction of recombination hotspots and coldspots have crucial implications for the mechanism of recombination. Several models have predicted recombination hot/cold spots successfully, but there is still much room for improvement. We present a novel classifier in which k-mer frequency, physical and thermodynamic properties of DNA sequences are incorporated in the form of weighted features.

View Article and Find Full Text PDF