Authors: Liangjiang Wang, Aidong Zhang, Murali Ramanathan
Addresses: Department of Computer Science and Engineering, State University of New York at Buffalo, 201 Bell Hall, Buffalo, NY 14260, USA. ' Department of Computer Science and Engineering, State University of New York at Buffalo, 201 Bell Hall, Buffalo, NY 14260, USA. ' Department of Pharmaceutical Sciences, State University of New York at Buffalo, Buffalo, NY 14260, USA
Abstract: Biomedical research is now generating large amounts of data, ranging from clinical test results to microarray gene expression profiles. The scale and complexity of these datasets give rise to substantial challenges in data management and analysis. It is highly desirable that data warehousing and online analytical processing technologies can be applied to biomedical data integration and mining. The major difficulty probably lies in the task of capturing and modelling diverse biological objects and their complex relationships. This paper describes multidimensional data modelling for biomedical data warehouse design. Since the conventional models such as star schema appear to be insufficient for modelling clinical and genomic data, we develop a new model called BioStar schema. The new model can capture the rich semantics of biomedical data and provide greater extensibility for the fast evolution of biological research methodologies.
Keywords: clinical data; genomic data; data integration; multidimensional data modelling; biomedical data; data warehouse design; bioinformatics; data warehousing; online analytical processing; data mining; computational biology.
International Journal of Bioinformatics Research and Applications, 2005 Vol.1 No.1, pp.63 - 80
Published online: 21 Apr 2005 *Full-text access for editors Access for subscribers Purchase this article Comment on this article