A pipeline for RNA-seq data processing and quality assessment.

BIOINFORMATICS(2011)

引用 70|浏览0
暂无评分
摘要
We present an R based pipeline, ArrayExpressHTS, for pre-processing, expression estimation and data quality assessment of high-throughput sequencing transcriptional profiling (RNA-seq) datasets. The pipeline starts from raw sequence files and produces standard Bioconductor R objects containing gene or transcript measurements for downstream analysis along with web reports for data quality assessment. It may be run locally on a user's own computer or remotely on a distributed R-cloud farm at the European Bioinformatics Institute. It can be used to analyse user's own datasets or public RNA-seq datasets from the ArrayExpress Archive.The R package is available at www.ebi.ac.uk/tools/rcloud with online documentation at www.ebi.ac.uk/Tools/rwiki/, also available as supplementary material.
更多
查看译文
关键词
supplementary data,own datasets,uk supplementary information,bioinformatics online,standard bioconductor r object,online documentation,rna-seq data processing,r package,public rna-seq datasets,european bioinformatics institute,data quality assessment,data processing,sequence alignment,internet,rna,computational biology,gene expression profiling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要