Title: Study of repeated e-government project audit based on text mining

Authors: Yan Hong Chen; Hui Hui Li; Zhi Nan Yu

Addresses: School of Information, Zhejiang University of Finance and Economics, Hangzhou, China ' School of Information, Zhejiang University of Finance and Economics, Hangzhou, China ' School of Information, Zhejiang University of Finance and Economics, Hangzhou, China

Abstract: In recent years, a large amount of unstructured text data is produced in the auditing field. In order to obtain the abundant potential knowledge and auditing trails, researchers pay more attention to the text mining technology. In this paper, we first introduce the basic concepts and application of text mining. Then, we use TF-IDF method to model text documents as term frequency vectors, and compute similarity between text documents by using cosine similarity. The results of experiment in the repeated e-government project audit show the analysis method of text achieved a relatively good accuracy.

Keywords: texting mining; project audit; TF-IDF; repeated project; e-government.

DOI: 10.1504/IJITM.2017.086871

International Journal of Information Technology and Management, 2017 Vol.16 No.4, pp.391 - 404

Received: 24 Jan 2016
Accepted: 04 May 2016

Published online: 02 Oct 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article