Chrome Extension
WeChat Mini Program
Use on ChatGLM

VariCarta: a comprehensive database of harmonized genomic variants found in ASD sequencing studies

Autism Research(2019)

Cited 3|Views10
No score
Abstract
Background Recent years has seen a boom in the application of the next-generation sequencing technology to the study of human diseases, including Autism Spectrum Disorder (ASD), where the focus has been on identifying rare, possibly causative genomic variants in ASD individuals. Because of the high genetic heterogeneity of ASD, a large number of subjects is needed to establish evidence for a variant or gene ASD-association, thus aggregating data across cohorts and studies is necessary. However, methodological inconsistencies and subject overlap across studies complicate data aggregation. Description Here we present VariCarta, a web-based database developed to address these challenges by collecting, reconciling and consistently cataloguing literature-derived genomic variants found in ASD subjects using ongoing semi-manual curation. The careful manual curation combined with a robust data import pipeline rectifies errors, converts variants into a standardized format, identifies and harmonizes cohort overlaps and documents data provenance. The harmonization aspect is especially important since it prevents the potential double-counting of variants which can lead to inflation of gene-based evidence for ASD-association. Conclusion VariCarta is the largest collection of systematically curated, harmonized and comprehensively annotated literature-derived ASD-associated variants. The database currently contains 35,615 variant events from 8,044 subjects, collected across 50 publications, and reconciles 6,057 variants that have been reported in literature multiple times. VariCarta is freely accessible at .
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined