Title: Assessing gene length biases in gene set analysis of Genome-Wide Association Studies

Authors: Peilin Jia, Jian Tian, Zhongming Zhao

Addresses: Departments of Biomedical Informatics and Psychiatry, Vanderbilt University Medical Centre, Nashville, Tennessee 37232, USA. ' Department of Computer Science, Vanderbilt University, Nashville, Tennessee 37232, USA. ' Departments of Biomedical Informatics, Psychiatry, and Cancer Biology, Vanderbilt University Medical Centre, Nashville, Tennessee 37232, USA

Abstract: Genome-Wide Association Studies (GWAS) have rapidly become a major genetics approach to studying complex diseases. Although many susceptibility variants and genes have been uncovered by single marker analysis, gene set based analysis is emerging as a very promising approach aiming to detect joint association of a set of genes with disease. In the available gene set based methods, it is often the smallest P value of the Single Nucleotide Polymorphisms (SNPs) in a gene region is used to represent the gene-level association signal. This approach may introduce strong bias of association signal towards long genes. In this study, we propose a resampling strategy by randomly generating genomic intervals across the accessible genomic region to estimate the background distribution of P values at the gene level. Comparing with the gene-wise P value in real data, the proportion of random intervals could be used to assess the bias that might be introduced by gene length and in turn to help the investigators choose the appropriate gene set analysis algorithms in their GWAS datasets. Our method uses only summarised GWAS data with no need of permutation, thus, it is computationally efficient. A computer program is freely available for the users.

Keywords: GWAS; genome-wide association studies; pathway enrichment analysis; gene sets; gene length bias; genetics; single nucleotide polymorphisms.

DOI: 10.1504/IJCBDD.2010.038394

International Journal of Computational Biology and Drug Design, 2010 Vol.3 No.4, pp.297 - 310

Published online: 04 Feb 2011 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article