Title: Adaptive information extraction from unstructured documents

Authors: Csaba Dezsenyi, Tadeusz P. Dobrowiecki, Tamas Meszaros

Addresses: Department of Mesurement and Information Systems, Budapest University of Technology and Economics, Magyar Tudosok korutja 2., Budapest, H-1117, Hungary. ' Department of Mesurement and Information Systems, Budapest University of Technology and Economics, Magyar Tudosok korutja 2., Budapest, H-1117, Hungary. ' Department of Mesurement and Information Systems, Budapest University of Technology and Economics, Magyar Tudosok korutja 2., Budapest, H-1117, Hungary

Abstract: The authors present a novel adaptive framework that enables efficient development of applications demanding complex document analysis. In processing natural language documents the task is to transform them into application specific structured form. Such transformation has to be designed taking into account various abstraction levels and granularity of the processing and the multitude of possibly related requests driving the application. The proposed solution is based on the adaptively planned and executed network of information processing modules. The paper presents an overview of the framework, with the focus on the adaptive mechanism. An illustrative pilot application is also provided.

Keywords: information extraction; IE; adaptive systems; language engineering platform; document analysis; unstructured documents; natural language documents; information processing.

DOI: 10.1504/IJIIDS.2007.014948

International Journal of Intelligent Information and Database Systems, 2007 Vol.1 No.2, pp.156 - 180

Published online: 20 Aug 2007 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article