Inferring relevant blocks on hyperlinked web page based on block-to-block similarity
by Keiichiro Tsukamoto; Yuki Koizumi; Hiroyuki Ohsaki; Kunio Hato; Junichi Murayama
International Journal of Knowledge and Web Intelligence (IJKWI), Vol. 4, No. 4, 2013

Abstract: Internet users devote considerable time and effort to collecting information from the web. To do so efficiently, after following a hyperlink, a user must be able to rapidly determine whether the desired information is contained on the destination web page. In this paper, therefore, we propose a method called hyperlink referring block estimation (HERB), which infers the existence and location of relevant contents on destination web pages. HERB utilises user context in web browsing, in particular, the selected hyperlink and the text around it. Through experiments simulating ordinary web browsing, we quantitatively investigate the effectiveness of HERB. Our experiments show that HERB can infer blocks relevant to a hyperlink with approximately 65% precision and 70% recall. Furthermore, we design two HERB implementations, namely, a web proxy and a web browser, and we present an overview of a web proxy prototype and an example use case.

Online publication date: Sat, 26-Jul-2014

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Knowledge and Web Intelligence (IJKWI):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com