Title: An automatic extraction method of word tendency judgement for specific subjects

Authors: Kazuhiro Morita, El-Sayed Atlam, Masao Fuketa, Yuya Iwabu, Jun-ichi Aoe

Addresses: Information Solution, Institute of Technology and Science, University of Tokushima, 2-1 Minami-Josanjima-Cho, Tokushima-Shi, Tokushima 770-8506, Japan. ' Information Solution, Institute of Technology and Science, University of Tokushima, 2-1 Minami-Josanjima-Cho, Tokushima-Shi, Tokushima 770-8506, Japan. ' Information Solution, Institute of Technology and Science, University of Tokushima, 2-1 Minami-Josanjima-Cho, Tokushima-Shi, Tokushima 770-8506, Japan. ' Information Solution, Institute of Technology and Science, University of Tokushima, 2-1 Minami-Josanjima-Cho, Tokushima-Shi, Tokushima 770-8506, Japan. ' Information Solution, Institute of Technology and Science, University of Tokushima, 2-1 Minami-Josanjima-Cho, Tokushima-Shi, Tokushima 770-8506, Japan

Abstract: In recent years, there has been a tremendous growth of online text information related to digital libraries, medical diagnostic systems, remote education, news sources and electronic commerce. There is a great need to search and organise huge amount of information in text documents. This paper focuses on word tendencies in documents and presents an automatic extraction method for specific subject. Field judgment is conducted by using field association words and similarity among word tendencies, and other word tendencies are computed with information. Then word tendencies which have the same subject are grouped as one group and the important word tendencies are chosen from that group. Finally, a system suggests word tendencies from specific subjects and fields are implemented. From the experimental result, about 67% of suggested word tendencies have been associated with popular subjects.

Keywords: word tendencies; field association words; similarity; automatic extraction; specific subjects; subject recognition; information retrieval.

DOI: 10.1504/IJCAT.2009.026604

International Journal of Computer Applications in Technology, 2009 Vol.35 No.2/3/4, pp.281 - 295

Published online: 20 Jun 2009 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article