uORF4u: a tool for annotation of conserved upstream open reading frames

Bioinformatics (Oxford, England)(2022)

引用 2|浏览8
暂无评分
摘要
Upstream open reading frames (uORFs, encoding so-called leader peptides) can regulate translation and transcription of downstream main ORFs (mORFs) in prokaryotes and eukaryotes. However, annotation of novel functional uORFs is challenging due their short size of usually less than 100 codons. While transcription- and translation-level next generation sequencing (NGS) methods can be used for genome-wide uORF identification, this data is not available for the vast majority of species with sequenced genomes. At the same time, the exponentially increasing amount of genome assemblies gives us the opportunity to take advantage of evolutionary conservation in our predictions of ORFs. Here we present a tool for conserved uORF annotation in 5ʹ upstream sequences of a user-defined protein of interest or a set of protein homologues. It can also be used to find small ORFs within a set of nucleotide sequences. The output includes publication-quality figures with multiple sequence alignments, sequence logos and locus annotation of the predicted uORFs in graphical vector format. uORF4u is written in Python3 and runs on Linux and MacOS. The command-line interface covers most practical use cases, while the provided Python API allows usage within a Python program and additional customisation. Source code is available from the GitHub page: https://github.com/art-egorov/uorf4u. Detailed documentation that includes an example-driven guide available at the software home page: https://art-egorov.github.io/uorf4u. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要