Title: Morphosemantic strategies for the automatic enrichment of Italian lexical databases in the medical domain
Authors: Flora Amato; Antonino Mazzeo; Annibale Elia; Alessandro Maisto; Serena Pelosi
Addresses: Department of Electrical Engineering and Information Technology, University of Napoli Federico II, 80125 Napoli, Italy ' Department of Electrical Engineering and Information Technology, University of Napoli Federico II, 80125 Napoli, Italy ' Department of Political, Social and Communication Sciences, University of Salerno, 84084 Fisciano, Italy ' Department of Political, Social and Communication Sciences, University of Salerno, 84084 Fisciano, Italy ' Department of Political, Social and Communication Sciences, University of Salerno, 84084 Fisciano, Italy
Abstract: Because of the importance of the information conveyed by the clinical documents and owing to the large quantity of raw texts produced in the healthcare system, it became a determinant challenge, in the NLP research field, to arrange the extraction and the management of meaningful data, starting from real text occurrences. In this paper we approach a corpus of 5000 medical diagnoses with sophisticated linguistic and computational devices, which are able to access the semantic dimension of words and sentences contained in it. Our morphosemantic method is grounded on a list of neoclassical formative elements pertaining to the medical domain which has been used for the automatic creation and population of medical lexical resources. The outcomes of this work are automatically built electronic dictionaries and thesauri and an annotated corpus for the NLP in the medical domain.
Keywords: medical thesauri population; neoclassical formative elements; natural language processing; medical lexicon.
DOI: 10.1504/IJGUC.2017.088262
International Journal of Grid and Utility Computing, 2017 Vol.8 No.4, pp.312 - 320
Received: 17 Feb 2015
Accepted: 26 Nov 2015
Published online: 01 Dec 2017 *