Title: Feature extraction of public English vocabulary text based on wavelet transform
Authors: Shengyan Qiu
Addresses: Qingdao Vocational and Technical College of Hotel Management, Qingdao 266100, China
Abstract: Aiming at the problems of low extraction accuracy and long time-consuming in the existing public English vocabulary text feature extraction methods, a public English vocabulary text feature extraction method based on wavelet transform is proposed. Set the existence space of public English vocabulary text, convert the text into vector mode, and cluster the text vector by clustering and fusion method. The distance between lexical text vectors in space is calculated, the similarity between vectors is determined, and the noise in text data is removed by information gain method; divide several small bands of different sizes, determine the change frequency of small band vocabulary text data, integrate the eigenvalues of different bands of text, and carry out wavelet transform on the integrated data features to obtain the transformation result, which is the text feature. The results show that the accuracy of the proposed method is 96%.
Keywords: wavelet transform; public English vocabulary text; feature extraction; clustering and fusion; inner product calculation; small band.
DOI: 10.1504/IJRIS.2022.125453
International Journal of Reasoning-based Intelligent Systems, 2022 Vol.14 No.2/3, pp.97 - 103
Received: 13 Oct 2021
Accepted: 16 Mar 2022
Published online: 09 Sep 2022 *