NeatSeq-Flow: A Lightweight High Throughput Sequencing Workflow Platform for Non-Programmers and Programmers alike

bioRxiv(2018)

引用 12|浏览20
暂无评分
摘要
Nowadays, it has become almost a necessity for many biologists to execute bioinformatics workflows (WFs) as part of their research. However, most WF-management software packages require for their operation at least some programming expertise. Here we describe NeatSeq-Flow, a platform that enables users with no programming knowledge to design and execute complex high throughput sequencing WFs. This is achieved by using a compendium of pre-built modules as well as a generic module, both do not require programming expertise. Nonetheless, NeatSeq-Flow retains the flexibility to generate sophisticated WF modules using templates and only basic Python programming abilities. NeatSeq-Flow is designed to enable easy sharing of WFs and modules by conceptually separating modules, WF design, sample information and execution. Moreover, NeatSeq-Flow works hand in hand with CONDA environments for easy installation of the WF9s analysis programs in one go. NeatSeq-Flow enables efficient WF execution on computer clusters by parallelizing on both samples and WF steps. NeatSeq-Flow operates by shell-script generation; thus it allows full transparency of the WF process. NeatSeq-Flow offers real-time WF execution monitoring, detailed documentation and self-sustaining WF backups for reproducibility. All of these features make NeatSeq-Flow an easy-to-use WF platform while not compromising for flexibility, reproducibility, transparency and efficiency.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要