γBOriS: Identification of Origins of Replication in Gammaproteobacteria using Motif-based Machine Learning

biorxiv(2019)

引用 0|浏览0
暂无评分
摘要
The biology of bacterial cells is, in general, based on the information encoded on circular chromosomes. Regulation of chromosome replication is an essential process which mostly takes place at the origin of replication ( oriC ). Identification of high numbers of oriC is a prerequisite to enable systematic studies that could lead to insights of oriC functioning as well as novel drug targets for antibiotic development. Current methods for identyfing oriC sequences rely on chromosome-wide nucleotide disparities and are therefore limited to fully sequenced genomes, leaving a superabundance of genomic fragments unstudied. Here, we present γ BOriS (Gammaproteobacterial oriC Searcher), which accurately identifies oriC sequences on gammaproteobacterial chromosomal fragments by employing motif-based DNA classification. Using γ BOriS, we created BOriS DB, which currently contains 25,827 oriC sequences from 1,217 species, thus making it the largest available database for oriC sequences to date.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要