Title: Improved class-specific vector for biomedical question type classification

Authors: Tanu Gupta; Ela Kumar

Addresses: Department of Computer Science and Engineering, Indira Gandhi Delhi Technical University for Women, New Delhi, India ' Department of Computer Science and Engineering, Indira Gandhi Delhi Technical University for Women, New Delhi, India

Abstract: Polysemy is a constant issue in biomedical terms which also affects its QA system. In our work, we consider polysemous words as weak aspect in biomedical question classification and propose two vector model-based solutions that determine the class-specific features of biomedical terms. In first approach, label independent class vector and general word vector are combined using linear compositionality property of vector to generate multiple class-specific embeddings of words. Second is the feature fusion approach, which combines the class-specific sense vector of a word with vectors generated in the first approach. Besides this, a survey dataset COVID-S is introduced in this paper, which is a collection of public queries, myths, and doubts about novel COVID-19 diseases. The series of experiments are performed on two biomedical questions datasets, BioASQ8b and COVID-S, and the results of comparisons with other state-of-arts prove its integrity using SVM and naïve Bayes.

Keywords: class vector; polysemy; biomedical question classification; sense embedding; COVID-S.

DOI: 10.1504/IJCSE.2023.129737

International Journal of Computational Science and Engineering, 2023 Vol.26 No.2, pp.182 - 191

Received: 11 Mar 2021
Accepted: 01 Nov 2021

Published online: 22 Mar 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article