The implementation of an automatic web-driven data analysis framework Online publication date: Mon, 19-Jun-2017
by Chun-Hsiung Tseng; Yung-Hui Chen; Yan-Ru Jiang
International Journal of Social and Humanistic Computing (IJSHC), Vol. 2, No. 3/4, 2017
Abstract: Containing a huge amount of data, the web is undoubtedly a very good source of information. However, performing analysis against data fetched from the web is not an easy task. First, the web is designed to be document-centric rather than data-centric. The former refers to websites that are designed for presenting documents only while the latter refers to websites that are designed for rendering datasets. As a result, reading data shown on web pages is comfortable but collecting data is difficult. Imagine repeating the copy-paste procedure for thousands of web pages. Second, the diversity of the presentation style of web pages makes data normalisation essential but difficult. Last but not the least, data analysis itself demands high statistics skill and sometimes may even require domain expertise. In this research, the researchers would like to address these issues by designing a data analysis tool for the web.
Online publication date: Mon, 19-Jun-2017
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Social and Humanistic Computing (IJSHC):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email email@example.com