An integrated multivariate group sparse approach to identify differentially expressed genes of breast cancer data Online publication date: Sat, 18-May-2019
by N.A.D.N. Napagoda
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 22, No. 2, 2019
Abstract: Identifying differentially expressed genes play an important role in disease diagnosis and prognosis. In this study, we use Student's t-statistic for analysing genes of publically available breast cancer data. Different t values in same gene from multiple data cannot be used for identifying cancer related genes separately. The presence of noise in gene expression data may affect the performance of the study. Therefore, we develop an Integrated Multivariate Group Sparse (IMGS) model based on the combined Student's t-statistic of the independent multiple data sets. Stability selection is used to identify the optimal values of tuning parameter in IMGS method. We illustrate the performance of Student's t-statistic, GeneMeta, metaMa and IMGS model on breast cancer genes with reference genes in GWAS. According to the results, the IMGS model is the more appropriate statistical approach than other three methods to identify the most significant genes of multiple gene expression data.
Online publication date: Sat, 18-May-2019
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email firstname.lastname@example.org