Title: Predictive mining for stock market based on live news TF-IDF features

Authors: Vaishali Ingle; Sachin Deshmukh

Addresses: Department of Computer Science and IT, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, Maharashtra-431002, India ' Department of Computer Science and IT, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, Maharashtra-431002, India

Abstract: The various machine learning algorithms are used for prediction of stock market movement. The data collected for stock market is in the form of breaking news from various finance websites. The TF-IDF features extracted from online news data are used for creation of HMM model along with log likelihood values. The next day's stock price is predicted as either higher or lower than current day's stock price. Results obtained from proposed model is compared with results from other machine learning predictive techniques such as random forest, KNN, multiple regression, bagging and boosting. The proposed model produces approximately 70% of accurate prediction. The captured features are graphically represented with word cloud. The results can be further improved with the use of deep learning ensemble methods.

Keywords: text mining; stock market; HMM; bagging; boosting; multiple regression; random forest; finance news; TF-IDF; word cloud; autonomic computing.

DOI: 10.1504/IJAC.2017.089703

International Journal of Autonomic Computing, 2017 Vol.2 No.4, pp.341 - 365

Received: 07 Dec 2016
Accepted: 22 Sep 2017

Published online: 07 Feb 2018 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article