Statistical pair pruning towards target class in learning-based anaphora resolution for Tamil Online publication date: Mon, 27-Nov-2017
by K. Arul Deepa; C. Deisy
International Journal of Advanced Intelligence Paradigms (IJAIP), Vol. 9, No. 5/6, 2017
Abstract: Anaphora resolution is an important task to be achieved in many natural language understanding (NLU) applications including machine translation. This paper proposes learning-based system to resolve pronouns in Tamil text built around various classification algorithms. To improve learning accuracy, the system is built in two folds. First is feature vector production where mentions are identified, characterised then a feature vectors of lexical, syntactic and semantic features are produced. Next is the pair pruning module where, number of non-target class pairs is reduced by deep statistical analysis of feature vector. Incorporating deeper pair pruning module dramatically increases the f-measure score when compared to training the same models but without the pruning module. On the tourism dataset of TDIL we trained the system with various classification algorithms and obtained encouraging results for a challenging language, Tamil. We discuss how varying the ratio of f-measure, precision and recall is between with and without the pruning module in comparative model.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Advanced Intelligence Paradigms (IJAIP):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com