ConCreT, a 2D convolutional neural network for taxonomic classification applied to viruses in the phylum Cressdnaviricota.

Journal of virological methods(2023)

引用 0|浏览9
暂无评分
摘要
Taxonomic assignments allow scientists to communicate better with each other. In virology, taxonomy is continually improving towards a more precise and comprehensive framework. With the huge numbers of new viruses being described in metagenomic studies, automated taxonomy tools are urgently needed. A number of such tools have been proposed, and those applying machine learning (ML), mainly in the deep learning branch, stand out with accurate results. Still, there is a demand for tools that are less computationally intensive and that can classify viruses down to the ranks of genus and species. Cressdnaviruses are good subjects for testing these such tools, due to their small, circular genomes and the existence of several families and genera with a highly imbalanced number of species. We developed a 2D convolutional neural network for virus taxonomy, and tested it for classification of viruses from the phylum Cressdnaviricota. We obtained >98% accuracy in the final pipeline tested, which we named ConCreT (Convolutional Neural Network for Cressdnavirus Taxonomy). The mixture of augmentation for more imbalanced groups with no augmentation for more balanced ones achieved the best score in the final test.
更多
查看译文
关键词
taxonomic classification,phylum<i>cressdnaviricota</i>,viruses,convolutional neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要