Predicting piRNAs by Na?ve Bayes classifier

ZHANG Cheng,WANG Jun

sciencepaper_online

引用 0|浏览0
暂无评分
摘要
Abstract: In this paper , we proposed a machine learning method based on the Na?ve Bayes classifier for predicting piRNA. First, piRNA and non-piRNA sequences of five model species: human, rat, mouse, fruit fly and nematode are served as training set. Then, sequence features, including k-mer frequencies, standardized word frequencies under a K-2 order Markov Model and different functions of four nucleotides are extracted from each sequence. Finally, the integrated features were fed into the Na?ve Bayes classifier to perform the prediction, where conditional probability of a word in each class was estimated by a histogram technique. Our machine learning approach achieved the overall accuracy of 82% by 5-fold cross validation. Due to the conciseness of the probability model, Na?ve Bayes classifier can be trained and predicted very fast and was efficient in large datasets.
更多
查看译文
关键词
Na?ve Bayes classifier,extracting feature,cross validation.
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要