ArCH: improving the performance of clonal hematopoiesis variant calling and interpretation

Irenaeus C. C. Chan, Alex Panchot, Evelyn Schmidt,Samantha McNulty,Brian J. Wiley,Jie Liu, Kimberly Turner,Lea Moukarzel,Wendy S. W. Wong,Duc Tran, J. Scott Beeler, Armel Landry Batchi-Bouyou,Mitchell J. Machiela,Danielle M. Karyadi, Benjamin J. Krajacich,Junhua Zhao,Semyon Kruglyak, Bryan Lajoie, Shawn Levy,Minal Patel,Philip W. Kantoff,Christopher E. Mason,Daniel C. Link,Todd E. Druley,Konrad H. Stopsack,Kelly L. Bolton

BIOINFORMATICS(2024)

引用 0|浏览1
暂无评分
摘要
Motivation The acquisition of somatic mutations in hematopoietic stem and progenitor stem cells with resultant clonal expansion, termed clonal hematopoiesis (CH), is associated with increased risk of hematologic malignancies and other adverse outcomes. CH is generally present at low allelic fractions, but clonal expansion and acquisition of additional mutations leads to hematologic cancers in a small proportion of individuals. With high depth and high sensitivity sequencing, CH can be detected in most adults and its clonal trajectory mapped over time. However, accurate CH variant calling is challenging due to the difficulty in distinguishing low frequency CH mutations from sequencing artifacts. The lack of well-validated bioinformatic pipelines for CH calling may contribute to lack of reproducibility in studies of CH.Results Here, we developed ArCH, an Artifact filtering Clonal Hematopoiesis variant calling pipeline for detecting single nucleotide variants and short insertions/deletions by combining the output of four variant calling tools and filtering based on variant characteristics and sequencing error rate estimation. ArCH is an end-to-end cloud-based pipeline optimized to accept a variety of inputs with customizable parameters adaptable to multiple sequencing technologies, research questions, and datasets. Using deep targeted sequencing data generated from six acute myeloid leukemia patient tumor: normal dilutions, 31 blood samples with orthogonal validation, and 26 blood samples with technical replicates, we show that ArCH improves the sensitivity and positive predictive value of CH variant detection at low allele frequencies compared to standard application of commonly used variant calling approaches.Availability and implementation The code for this workflow is available at: https://github.com/kbolton-lab/ArCH.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要