<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns="http://purl.org/rss/1.0/">
<channel rdf:about="http://www.inderscience.com/current_issue_rss/index.php?journal=ijkwi">
<title>Most recent issue published online for the International Journal of Knowledge and Web Intelligence.</title>
<description>International Journal of Knowledge and Web Intelligence</description>
<link>http://www.inderscience.com/browse/index.php?journalID=284&amp;year=2011&amp;vol=2&amp;issue=4</link>
<dc:publisher>Inderscience Publishers Ltd</dc:publisher>
<dc:language>en-uk</dc:language>
<prism:publicationName>International Journal of Knowledge and Web Intelligence</prism:publicationName>
<prism:issn>1755-8255</prism:issn>
<prism:eIssn>1755-8263</prism:eIssn>
<prism:copyright>&#169; 2011 Inderscience Publishers Ltd</prism:copyright>
<prism:rightsAgent>editor@inderscience.com</prism:rightsAgent>
<image rdf:resource="https://www.inderscience.com/images/files/coverImgs/ijkwi_scoverijkwi.jpg" />
<items>
<rdf:Seq>
<rdf:li rdf:resource="http://dx.doi.org/10.1504/IJKWI.2011.045162" />
<rdf:li rdf:resource="http://dx.doi.org/10.1504/IJKWI.2011.045163" />
<rdf:li rdf:resource="http://dx.doi.org/10.1504/IJKWI.2011.045164" />
<rdf:li rdf:resource="http://dx.doi.org/10.1504/IJKWI.2011.045165" />
</rdf:Seq>
</items>
</channel>
<image rdf:about="https://www.inderscience.com/images/files/coverImgs/ijkwi_scoverijkwi.jpg">
<title>International Journal of Knowledge and Web Intelligence</title>
<url>https://www.inderscience.com/images/files/coverImgs/ijkwi_scoverijkwi.jpg</url>
<link>http://www.inderscience.com/browse/index.php?journalID=284&amp;year=2011&amp;vol=2&amp;issue=4</link>
</image>
<item rdf:about="http://dx.doi.org/10.1504/IJKWI.2011.045162">
<title>Automatic web pages hierarchical classification using dynamic domain ontologies</title>
<link>http://www.inderscience.com/link.php?id=45162</link>
<description>The use of ontologies for knowledge representation has had a fast increase in the last years and they are used in several application context. One of these challenging applications is the web. Managing large amount of information on internet needs more efficient and effective methods and techniques for mining and representing information. In this article, we present a methodology for automatic topic annotation of web pages. We describe an algorithm for words disambiguation using an apposite metric for measuring the semantic relatedness and we show a technique which allows to detect the topic of the analysed document using ontologies extracted from a knowledge base. The strategy is implemented in a system where these information are used to build a topic hierarchy automatically created and not a priori defined for classifying web pages. Experimental results are presented and discussed in order to measure the effectiveness of our approach.</description>
<content:encoded><![CDATA[<p><a href="http://www.inderscience.com/link.php?id=45162"><b>Automatic web pages hierarchical classification using dynamic domain ontologies</b></A><br />Antonio M. Rinaldi<br /><i>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 231 - 256</i><br />The use of ontologies for knowledge representation has had a fast increase in the last years and they are used in several application context. One of these challenging applications is the web. Managing large amount of information on internet needs more efficient and effective methods and techniques for mining and representing information. In this article, we present a methodology for automatic topic annotation of web pages. We describe an algorithm for words disambiguation using an apposite metric for measuring the semantic relatedness and we show a technique which allows to detect the topic of the analysed document using ontologies extracted from a knowledge base. The strategy is implemented in a system where these information are used to build a topic hierarchy automatically created and not a priori defined for classifying web pages. Experimental results are presented and discussed in order to measure the effectiveness of our approach.</p>]]></content:encoded>
<dc:identifier>10.1504/IJKWI.2011.045162</dc:identifier>
<dc:source>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 231 - 256</dc:source>
<dc:creator>Antonio M. Rinaldi</dc:creator>
<dc:contributor>Dipartimento di Informatica e Sistemistica, Universit&#224; di Napoli Federico II, 80125 Via Claudio, 21, Napoli, Italy</dc:contributor>
<dc:subject>knowledge engineering</dc:subject>
<dc:subject>document analysis</dc:subject>
<dc:subject>text processing</dc:subject>
<dc:subject>semantic networks</dc:subject>
<dc:subject>ontologies</dc:subject>
<dc:subject>web page classification</dc:subject>
<dc:subject>natural language processing</dc:subject>
<dc:subject>NLP</dc:subject>
<dc:subject>hierarchical classification</dc:subject>
<dc:subject>topic annotation</dc:subject>
<dc:subject>subject annotation</dc:subject>
<dc:subject>websites.</dc:subject>
<dc:date>2012-01-29T23:20:50-05:00</dc:date>
<prism:volume>2</prism:volume>
<prism:number>4</prism:number>
<prism:startingPage>231</prism:startingPage>
<prism:endingPage>256</prism:endingPage>
<prism:publicationDate>2012-01-29T23:20:50-05:00</prism:publicationDate>
</item>
<item rdf:about="http://dx.doi.org/10.1504/IJKWI.2011.045163">
<title>Clustering&#45;based web page prediction</title>
<link>http://www.inderscience.com/link.php?id=45163</link>
<description>Web page prediction plays an important role by predicting and fetching probable web page of next request in advance, resulting in reducing the user latency. The users surf the internet either by entering URL or search for some topic or through link of same topic. For searching and for link prediction, clustering plays an important role. Besides the topic, navigational behaviour is not ignored. This paper proposes a web page prediction model giving significant importance to the user&#39;s interest using the clustering technique and the navigational behaviour of the user through Markov model. The clustering technique is used for the accumulation of the similar web pages. Similar web pages of same type reside in the same cluster, the cluster containing web pages have the similarity with respect to topic of the session. The clustering algorithms considered are K&#45;means and K&#45;mediods, where K is determined by HITS algorithm. Finally, the predicted web pages are stored in form of cellular automata to make the system more memory efficient.</description>
<content:encoded><![CDATA[<p><a href="http://www.inderscience.com/link.php?id=45163"><b>Clustering&#45;based web page prediction</b></A><br />Ruma Dutta; Anirban Kundu; Debajyoti Mukhopadhyay<br /><i>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 257 - 271</i><br />Web page prediction plays an important role by predicting and fetching probable web page of next request in advance, resulting in reducing the user latency. The users surf the internet either by entering URL or search for some topic or through link of same topic. For searching and for link prediction, clustering plays an important role. Besides the topic, navigational behaviour is not ignored. This paper proposes a web page prediction model giving significant importance to the user&#39;s interest using the clustering technique and the navigational behaviour of the user through Markov model. The clustering technique is used for the accumulation of the similar web pages. Similar web pages of same type reside in the same cluster, the cluster containing web pages have the similarity with respect to topic of the session. The clustering algorithms considered are K&#45;means and K&#45;mediods, where K is determined by HITS algorithm. Finally, the predicted web pages are stored in form of cellular automata to make the system more memory efficient.</p>]]></content:encoded>
<dc:identifier>10.1504/IJKWI.2011.045163</dc:identifier>
<dc:source>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 257 - 271</dc:source>
<dc:creator>Ruma Dutta; Anirban Kundu; Debajyoti Mukhopadhyay</dc:creator>
<dc:contributor>Netaji Subhash Engineering College, West Bengal University of Technology, Garia, Kolkata, 700152, West Benagal, India. &#39; Kuang&#45;Chi Institute of Advanced Technology, Software Building, No. 9 Gaoxin Zhong, 1st Road, High&#45;Tech Industrial Estate, Nanshan District, Shenzhen, Guangdong, 518057, China. &#39; Maharashtra Institute of Technology, S. No. 124, Paud Road, Kothrud, Pune 411038, Maharashtra, India; Web Intelligence and Distributed Computing Research Lab &#40;WIDiCoReL&#41;, GreenTower, C&#45;9&#47;1, Golf Green, Kolkata, 700095, India</dc:contributor>
<dc:subject>web page prediction</dc:subject>
<dc:subject>HITS algorithm</dc:subject>
<dc:subject>clustering</dc:subject>
<dc:subject>cellular automata</dc:subject>
<dc:subject>user latency</dc:subject>
<dc:subject>similar web pages</dc:subject>
<dc:subject>web page similarity</dc:subject>
<dc:subject>web search.</dc:subject>
<dc:date>2012-01-29T23:20:50-05:00</dc:date>
<prism:volume>2</prism:volume>
<prism:number>4</prism:number>
<prism:startingPage>257</prism:startingPage>
<prism:endingPage>271</prism:endingPage>
<prism:publicationDate>2012-01-29T23:20:50-05:00</prism:publicationDate>
</item>
<item rdf:about="http://dx.doi.org/10.1504/IJKWI.2011.045164">
<title>A framework for utilising usage trends in the crawling and indexing process of search engines</title>
<link>http://www.inderscience.com/link.php?id=45164</link>
<description>Making search engines responsive to human needs requires understanding of user navigations through the search results in response to the submitted queries. The user behaviour characterisation provides an interesting perspective towards understanding the workload imposed on the search engine and can be used to address crucial points such as load balancing, content caching, data distribution and result optimisation. The user browsing behaviour is recorded in the query logs of search engines and usually referred to as web usage data. In this paper, a technique to utilise the users&#39; browsing behaviour at the crawling and indexing process is being proposed so as to direct the crawler to download the important pages, which were not previously crawled. As the work attempts to index most of important pages based on user feedback, it would benefit the search engine to enhance its efficiency. To add further to the proposed work, the existing data structures maintained by the search engines has been refined so as to support the proposed user feedback mechanism and open more research directions.</description>
<content:encoded><![CDATA[<p><a href="http://www.inderscience.com/link.php?id=45164"><b>A framework for utilising usage trends in the crawling and indexing process of search engines</b></A><br />Neelam Duhan; A.K. Sharma<br /><i>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 272 - 291</i><br />Making search engines responsive to human needs requires understanding of user navigations through the search results in response to the submitted queries. The user behaviour characterisation provides an interesting perspective towards understanding the workload imposed on the search engine and can be used to address crucial points such as load balancing, content caching, data distribution and result optimisation. The user browsing behaviour is recorded in the query logs of search engines and usually referred to as web usage data. In this paper, a technique to utilise the users&#39; browsing behaviour at the crawling and indexing process is being proposed so as to direct the crawler to download the important pages, which were not previously crawled. As the work attempts to index most of important pages based on user feedback, it would benefit the search engine to enhance its efficiency. To add further to the proposed work, the existing data structures maintained by the search engines has been refined so as to support the proposed user feedback mechanism and open more research directions.</p>]]></content:encoded>
<dc:identifier>10.1504/IJKWI.2011.045164</dc:identifier>
<dc:source>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 272 - 291</dc:source>
<dc:creator>Neelam Duhan; A.K. Sharma</dc:creator>
<dc:contributor>Department of Computer Engineering, YMCA University of Science and Technology, Zakir Nagar, Sector&#45;6, Faridabad, India. &#39; Department of Computer Engineering, YMCA University of Science and Technology, Zakir Nagar, Sector&#45;6, Faridabad, India</dc:contributor>
<dc:subject>World Wide Web</dc:subject>
<dc:subject>internet</dc:subject>
<dc:subject>search engines</dc:subject>
<dc:subject>crawlers</dc:subject>
<dc:subject>indexing</dc:subject>
<dc:subject>query logs</dc:subject>
<dc:subject>usage trends</dc:subject>
<dc:subject>user behaviour</dc:subject>
<dc:subject>crawling</dc:subject>
<dc:subject>load balancing</dc:subject>
<dc:subject>content caching</dc:subject>
<dc:subject>data distribution</dc:subject>
<dc:subject>results optimisation</dc:subject>
<dc:subject>browsing behaviour</dc:subject>
<dc:subject>search engine efficiency.</dc:subject>
<dc:date>2012-01-29T23:20:50-05:00</dc:date>
<prism:volume>2</prism:volume>
<prism:number>4</prism:number>
<prism:startingPage>272</prism:startingPage>
<prism:endingPage>291</prism:endingPage>
<prism:publicationDate>2012-01-29T23:20:50-05:00</prism:publicationDate>
</item>
<item rdf:about="http://dx.doi.org/10.1504/IJKWI.2011.045165">
<title>Reengineering PDF&#45;based documents targeting complex software specifications</title>
<link>http://www.inderscience.com/link.php?id=45165</link>
<description>We discuss how to reengineer complex PDF&#45;based documents, such as specifications and technical books, so that end users have a better experience with them. Specifications of the object management group &#40;OMG&#41; are our initial targets. Such specifications are dense and intricate to use, and tend to have complicated structures. Our approach includes format conversion, logical structure extraction, text extraction and multi&#45;layer hypertext generation. Logical structure extraction is central, and results in an XML document with a schema tailored to the type of document. Many key concepts of a document are expressed in this schema, including concepts extracted from the patterns of words used in headings. For example in OMG specifications, package relationships and class associations can often be extracted from the wording of headings. When we produce, in the final step, a multilayer hypertext version of the document, these extracted concepts allow a richer user experience.</description>
<content:encoded><![CDATA[<p><a href="http://www.inderscience.com/link.php?id=45165"><b>Reengineering PDF&#45;based documents targeting complex software specifications</b></A><br />Mehrdad Nojoumian; Timothy C. Lethbridge<br /><i>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 292 - 319</i><br />We discuss how to reengineer complex PDF&#45;based documents, such as specifications and technical books, so that end users have a better experience with them. Specifications of the object management group &#40;OMG&#41; are our initial targets. Such specifications are dense and intricate to use, and tend to have complicated structures. Our approach includes format conversion, logical structure extraction, text extraction and multi&#45;layer hypertext generation. Logical structure extraction is central, and results in an XML document with a schema tailored to the type of document. Many key concepts of a document are expressed in this schema, including concepts extracted from the patterns of words used in headings. For example in OMG specifications, package relationships and class associations can often be extracted from the wording of headings. When we produce, in the final step, a multilayer hypertext version of the document, these extracted concepts allow a richer user experience.</p>]]></content:encoded>
<dc:identifier>10.1504/IJKWI.2011.045165</dc:identifier>
<dc:source>International Journal of Knowledge and Web Intelligence, Vol. 2, No. 4 (2011) pp. 292 - 319</dc:source>
<dc:creator>Mehrdad Nojoumian; Timothy C. Lethbridge</dc:creator>
<dc:contributor>David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada. &#39; School of Electrical Engineering and Computer Science, 800 King Edward Ave., Ottawa, ON K1N 6N5, Canada</dc:contributor>
<dc:subject>digital libraries</dc:subject>
<dc:subject>electronic publishing</dc:subject>
<dc:subject>improving user experiences</dc:subject>
<dc:subject>browsing interfaces</dc:subject>
<dc:subject>e&#45;publishing</dc:subject>
<dc:subject>document reengineering</dc:subject>
<dc:subject>PDF based documents</dc:subject>
<dc:subject>software specifications</dc:subject>
<dc:subject>object management group</dc:subject>
<dc:subject>OMG specifications</dc:subject>
<dc:subject>XML documents</dc:subject>
<dc:subject>format conversion</dc:subject>
<dc:subject>logical structure extraction</dc:subject>
<dc:subject>text extraction</dc:subject>
<dc:subject>multi&#45;layer hypertext generation</dc:subject>
<dc:subject>package relationships</dc:subject>
<dc:subject>class associations</dc:subject>
<dc:subject>document headings</dc:subject>
<dc:subject>word patterns.</dc:subject>
<dc:date>2012-01-29T23:20:50-05:00</dc:date>
<prism:volume>2</prism:volume>
<prism:number>4</prism:number>
<prism:startingPage>292</prism:startingPage>
<prism:endingPage>319</prism:endingPage>
<prism:publicationDate>2012-01-29T23:20:50-05:00</prism:publicationDate>
</item>
</rdf:RDF>

