Inderscience PublishersInderscience PublishersInderscience Publishers
  PUBLISHERS OF DISTINGUISHED ACADEMIC, SCIENTIFIC AND PROFESSIONAL JOURNALS

Article Abstract

Title: Persistent queries over dynamic text streams
  Author: Javed Aslam, Ekaterina Pelekhov, Daniela Rus   Email author(s)
  Address: College of Computer Science, Northeastern University, Boston, MA, USA. ' Department of Computer Science, Dartmouth College, Hanover, NH, USA. ' Department of EECS, MIT and Dartmouth, Boston, MA, USA
  Journal: International Journal of Electronic Business 2005 - Vol. 3, No.3/4  pp. 288 - 299
  Abstract: We wish to develop automated tools for information organisation that support information processing in the age of information overload. We present a filtering-based approach to persistent queries that uses clustering. We use the online version of the star algorithm developed in our previous work as our clustering tool because this algorithm computes, with high precision, naturally occurring topics in a collection and it admits an efficient online solution for dynamic streams of text. We describe the principle behind the filtering algorithms and show experimental data. We then discuss a system that uses these algorithms in support of information push by allowing users to submit persistent queries. Finally, we evaluate the persistent query system using TREC data.
  Keywords: information retrieval; filtering; persistent queries; information processing; clustering; e-business; electronic business; dynamic text streams; digital collections; browsable hierarchies; star algorithm.
  DOI: 10.1504/IJEB.2005.007273
  Access for editors and complimentary subscribers       Access for Subscribers   Purchase this Paper        We welcome your comments about this paper Comment on the Paper