THAPBI PICT - a fast, cautious, and accurate metabarcoding analysis pipeline

bioRxiv (Cold Spring Harbor Laboratory)(2023)

引用 2|浏览2
暂无评分
摘要
ABSTRACT THAPBI PICT is an open source software pipeline for metabarcoding analysis with multiplexed Illumina paired-end reads, including where different amplicons are sequenced together. We demonstrate using worked examples with our own and public data sets how, with appropriate primer settings and a custom database, THAPBI PICT can be applied to other amplicons and organisms, and used for reanalysis of existing datasets. The core dataflow of the implementation is (i) data reduction to unique marker sequences, often called amplicon sequence variants (ASVs), (ii) dynamic thresholds for discarding low abundance sequences to remove noise and artifacts (rather than error correction by default), before (iii) classification using a curated reference database. The default classifier assigns a label to each query sequence based on a database match that is either perfect, or a single base pair edit away (substitution, deletion or insertion). Abundance thresholds for inclusion can be set by the user or automatically using per-batch negative or synthetic control samples. Output is designed for practical interpretation by nonspecialists and includes a read report (ASVs with classification and counts per sample), sample report (samples with counts per species classification), and a topological graph of ASVs as nodes with short edit distances as edges. Source code available from https://github.com/peterjc/thapbi-pict/with documentation including installation instructions.
更多
查看译文
关键词
accurate metabarcoding analysis pipeline
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要