Title: Measurement of sentence similarity based on constituency parsing and dilated convolution

Authors: MingYu Ji; ChenLong Wang; Gang Liu

Addresses: College of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, Heilongjiang, China ' College of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, Heilongjiang, China ' College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, Heilongjiang, China

Abstract: Measurement of sentence similarity is widely used in the field of natural language processing, the current mainstream method is based on neural network similarity model. In actual application, the method of neural network has some disadvantages. On the one hand, when sentences are input to the neural network, the problem of semantic loss is caused by the interception and zero-filling operation of the sentence that is too long or too short. On the other hand, ignore the semantic relation between interval words. Thus, this paper proposes a method of measuring sentence similarity based on constituency parsing and dilated convolution. It uses constituency parsing to design rules to reduce unimportant semantic component of long sentences and supplement important semantic component of short sentences. In addition, the receptive fields in sentence dimension and word vector dimension are dilated to capture the semantic association of the two-dimensional interval words. Finally, the method is verified on two data sets.

Keywords: sentence similarity; neural network; constituency parsing; semantic component; dilated convolution.

DOI: 10.1504/IJCAT.2020.111842

International Journal of Computer Applications in Technology, 2020 Vol.64 No.3, pp.252 - 259

Received: 23 May 2020
Accepted: 23 Jun 2020

Published online: 16 Dec 2020 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article