GBRAP: a tool to retrieve, parse and analyze GenBank files of viral and bacterial species

Chiara Vischioni, Valerio Giaccone,Paolo Catellani,Leonardo Alberghini, Riccardo Miotti Scapin,Cristian Taccioli

biorxiv(2021)

引用 0|浏览2
暂无评分
摘要
Summary GenBank files contain genomic data of sequenced living organisms. Here, we present GBRAP (GenBank Retrieving, Analyzing and Parsing software), a tool written in Python 3 that can be used to easily download, parse and analyze viral and bacterial GenBank files, even when contain more than one genomic sequence for each species. GBRAP can analyze more files simultaneously through single command-line parameters that give as output a single table showing the genomic characteristics of each organism. It is also able to calculate Shannon, LZSS (Lempel–Ziv–Storer–Szymanski) and topological entropy for both the entire genome and its constitutive elements such as genes, rRNAs, tRNAs, tmRNAs and ncRNAs together with Chargaff’s second parity rule scores obtained using different mathematical methods. Moreover, GBRAP can calculate, the number, the length and the nucleotides abundance of genomic components for each DNA strand and for the overlapping regions among the two complementary helixes. To our knowledge, this is the only software capable of providing this type of genomic analyses all together in a single tool, that, therefore can be used by the scientists interested in both genomics and evolutionary research. Availability and implementation The data underlying this article are available from the corresponding author on reasonable request. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
genbank files,parse
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要