Title: In-depth querying of web-based medical documents: beyond single page results

Authors: Aastha Madaan; Wanming Chu

Addresses: Graduate Department of Computer Science and Engineering, University of Aizu, Aizu-Wakamatsu, Fukushima, 965-8580, Japan ' Graduate Department of Computer Science and Engineering, University of Aizu, Aizu-Wakamatsu, Fukushima, 965-8580, Japan

Abstract: The World Wide Web has become a large source of health information. The paper-based medical resources are becoming available on the web. Hence, web-based information retrieval, automatic page-adaptation and in-depth querying are gaining importance especially in the healthcare domain. To address the problems of these ever-expanding information systems over the internet, traditional information retrieval techniques are applied. This study is an attempt to highlight the challenges faced by the users in the healthcare domain for in-depth querying of web-based healthcare information resources. It compares the existing approaches for in-depth querying for segment-level searches rather than page-level searches. It proposes a web document segmentation-based 'query-by-segment tag (QBT)' query-interface. It utilises the semantic and structural relationships among the various content groups of a web document. Such a query-interface enables the user to perform in-depth querying.

Keywords: online medical documents; DOM tree; in-depth querying; query interface; webpage segmentation; page-level search; health information; information retrieval; internet; web documents; healthcare information.

DOI: 10.1504/IJCSE.2015.072650

International Journal of Computational Science and Engineering, 2015 Vol.11 No.3, pp.284 - 296

Received: 01 Jun 2013
Accepted: 18 Jun 2013

Published online: 23 Oct 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article