Parallel Computing with DNA Forensics Data

2022 IEEE High Performance Extreme Computing Conference (HPEC)(2022)

引用 0|浏览4
暂无评分
摘要
High-throughput sequencing (HTS) of single nu-cleotide polymorphisms (SNPs) provides advanced DNA forensics capabilities including complex mixture analysis. This paper describes a scalable pipeline for large DNA forensics data which can either be utilized on a standalone system or can also be used on high performance computing systems. This pipeline enables parallelization of processing of multiple samples. Surveillance modules detect completed sequencing datasets on both Illumina and Ion Torrent platforms. GrigoraSNPs is used for automated SNP allele calling from FASTQ files. These results are automatically loaded into the IdPrism DNA mixture analysis system. HTS SNP data analysis typically completes in roughly 7 minutes for 100M sequences, including SNP allele calling, enabling rapid access to the results within the IdPrism system for identification and complex mixture analysis of multiplexed samples.
更多
查看译文
关键词
IdPrism,FastID,Mixture deconvolution,High Performance Computing,GrigoraSNPs,IlluminaSNPs,DNA Forensics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要