Hybrid singular value decomposition; a model of human text classification Online publication date: Thu, 14-Dec-2006
by Amirali Noorinaeini, Mark R. Lehto
International Journal of Human Factors Modelling and Simulation (IJHFMS), Vol. 1, No. 1, 2006
Abstract: The objective of this study was to investigate and compare the accuracy of three Singular Value Decomposition (SVD) based models in classifying injury narratives into external-cause-of-injury and poisoning (E-codes) categories. Two SVD-Bayesian models and one SVD-Regression model were developed for free text classification purposes. This study used injury narratives and corresponding E-codes assigned by human experts from the 1997 and 1998 US National Health Interview Survey (NHIS). Sensitivity, specificity and positive predictive value were measured by comparing all the three models' results with E-code categories assigned by experts. The performance of the equidistant Bayes model and regression model improved as more SVD vectors were used for the input. The regression model was compared to the fuzzy Bayes model as well. It was concluded that all three models are capable of learning from human experts to accurately categorise cause-of-injury codes from injury narratives, with the regression-based model being the strongest.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Human Factors Modelling and Simulation (IJHFMS):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com