Title: NGSPERL: a semi-automated framework for large scale next generation sequencing data analysis

Authors: Quanhu Sheng; Shilin Zhao; Mingsheng Guo; Yu Shyr

Addresses: Center for Quantitative Sciences, Department of Cancer Biology, Vanderbilt University, Nashville TN 37232, USA ' Center for Quantitative Sciences, Department of Cancer Biology, Vanderbilt University, Nashville, TN 37232, USA ' Center for Quantitative Sciences, Department of Cancer Biology, Vanderbilt University, Nashville, TN 37232, USA ' Center for Quantitative Sciences, Department of Cancer Biology, Vanderbilt University, Nashville, TN 37232, USA

Abstract: High-throughput sequencing technologies have been widely used in medical and biological research, especially in cancer biology. With the huge amounts of sequencing data being generated, data analysis has become the bottle-neck of the research procedure. We have designed and implemented NGSPERL, a semi-automated module-based framework, for high-throughput sequencing data analysis. Three major analysis pipelines with multiple tasks have been developed for RNA sequencing, exome sequencing, and small RNA sequencing data. Each task was developed as module. The module uses the output from the previous task as the input parameter to generate the corresponding portable batch system (PBS) script. The PBS scripts can be either submitted to cluster or run directly based on user choice. Multiple tasks can also be combined together as a single task to simplify the data analysis. Such a flexible framework will significantly automate and simplify the process of large scale sequencing data analysis.

Keywords: next generation sequencing; high throughput; module-based pipelines; genomics; computational biology; data analysis; RNA sequencing; exome sequencing; PBS scripts; bioinformatics.

DOI: 10.1504/IJCBDD.2015.072082

International Journal of Computational Biology and Drug Design, 2015 Vol.8 No.3, pp.203 - 211

Received: 11 Aug 2014
Accepted: 18 Nov 2014

Published online: 30 Sep 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article