Abstract
A method for predicting hotspots and coldspots using support vector machine (SVM) based on statistical learning theory is developed. This method is applied to published 303 hot and 48 cold open reading frames (ORFs) in Saccharomyces cerevisiae. The sequence features of general dinucleotide abundance and dinucleotide abundance based on codon usage are extracted, and then the data sets are classified with different parameters and kernel functions combined with the method of two-fold cross validation. The result indicates that 87.47% accuracy can be reached when classifying hot and cold ORF sequences with the kernel of radial basis function combined with dinucleotide abundance based on codon usage.
Original language | English (US) |
---|---|
Pages (from-to) | 112-116 |
Number of pages | 5 |
Journal | Journal of Southeast University (English Edition) |
Volume | 22 |
Issue number | 1 |
State | Published - Mar 2006 |
Externally published | Yes |
Keywords
- Coldspot
- Dinucleotide abundance
- Hotspot
- Meiotic recombination
- Support vector machine
ASJC Scopus subject areas
- General Engineering