Some linguistic methods of improving the quality of document retrieval on the internet
by Alexander Gelbukh, Grigori Sidorov, Yoel Ledo-Mezquita
International Journal of Electronic Business (IJEB), Vol. 3, No. 3/4, 2005

Abstract: One of the problems of e-business is to find relevant documents for making correct decisions. The main problem of the Internet is the huge amount of documents, which makes it difficult to find the relevant ones, hence the importance of the methods allowing for improving the quality of document retrieval. We discuss some linguistic problems of document retrieval on the internet related to the following natural language phenomena: (1) morphological processes: e.g., takes, took, taken are grammar forms of take; (2) polysemy and homonymy: most words have several senses, e.g., bank is a financial institution, shore, bench, etc.; (3) non-linearity of syntactic relations: in the case of a query that contains word combinations, the words forming a word combination can be separated by other words in the documents. Some linguistic-based methods and strategies related to the discussed problems are proposed that improve the quality of document retrieval or show the necessity of application of linguistic methods.

Online publication date: Thu, 30-Jun-2005

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Electronic Business (IJEB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com