Named entity recognition for weather domain text in Hindi Online publication date: Fri, 29-Oct-2021
by Gaurav Yadav; Santosh Singh Rathore; Debanjan Sadhya
International Journal of Swarm Intelligence (IJSI), Vol. 6, No. 2, 2021
Abstract: Named entity recognition (NER) is the process of categorisation of a given entity in texts into a corresponding pre-defined category such as PE for name of the person, LOC for the name of location in the text, ORG for name of an organisation, etc. NER is an important step in the process of text mining when searching for textual information is to be done. Each information domain has a different set of entities, which requires the development of domain dependent NER system. This paper presents a NER approach for entity identification in the weather domain texts in the Hindi language. The presented approach is two-fold. In the first fold, we collect weather data by crawling the Hindi weather forecasting websites. In the second fold, we apply a machine learning algorithm with a vector representation of the Hindi language on the collected data to train the model. Further, the model is used to classify entities in the unknown weather text data. Experimental results showed that the presented approach produced an improved result for NER for the used weather dataset.
Online publication date: Fri, 29-Oct-2021
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Swarm Intelligence (IJSI):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email firstname.lastname@example.org