Problems and systematic solutions in data quality
by Xingsen Li, Lingling Zhang, Peng Zhang, Yong Shi
International Journal of Services Sciences (IJSSCI), Vol. 2, No. 1, 2009

Abstract: Data are important for making decisions. However, the quality of the data affects the quality of decisions. Data mining as one of the most important sources of knowledge needs high quality data to mine, but data of sufficient quality is often lacking. By systematically analysing the reasons causing low data quality in data mining, we found that general methods on improving data quality by data cleaning are not enough. A new method for improving data quality called data mining consulting has been established. It defines data quality in a wider range from the customer view of data mining, finds potential data quality problems at an earlier stage and solves the data quality problem by a series methods including software techniques, data mining principles and management rules such as ISO 9000. Its application in a web company shows that it is practical and can increase data quality from the very beginning.

Online publication date: Thu, 11-Dec-2008

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Services Sciences (IJSSCI):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com