A catalogue of small proteins from the global microbiome

Yiqian Duan, Célio Dias Santos Júnior,Thomas S.B. Schmidt, Anthony Fullam, Breno L. S. de Almeida,Chengkai Zhu,Kuhn Michael,Xing-Ming Zhao, Peer Bork,Luis Pedro Coelho

biorxiv(2023)

引用 0|浏览3
暂无评分
摘要
Small open reading frames (smORFs) shorter than 100 codons are widespread and perform essential roles in microorganisms, where they encode proteins active in several cell functions, including signal pathways, stress response, and antibacterial activities. However, the ecology, distribution and role of small proteins in the global microbiome remain unknown. Here, we constructed a global microbial smORFs catalogue (GMSC) derived from 63,410 publicly available metagenomes across 75 distinct habitats and 87,920 high-quality isolate genomes. GMSC contains 965 million non-redundant smORFs with comprehensive annotations. We found that archaea harbor more small proteins proportionally than bacteria. We moreover provide a tool called GMSC-mapper to identify and annotate small proteins from microbial (meta)genomes. Overall, this publicly-available resource demonstrates the immense and underexplored diversity of small proteins. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要