Title: Building multi-factor stock selection models using balanced split regression trees with sorting normalisation and hybrid variables

Authors: I-Cheng Yeh; Che-Hui Lien; Tao-Ming Ting

Addresses: Department of Civil Engineering, Tamkang University, New Taipei City, Taiwan ' Department of Marketing, International Business, and Entrepreneurship, Thompson Rivers University, Kamloops, BC, Canada ' Department of Senior Citizen Service Management, Ching Kuo Institute of Management and Health, Keelung City, Taiwan

Abstract: This research employed regression trees to build the predictive models of the rate of return of the portfolio and conducted an empirical study in the Taiwan stock market. Our study employed the sorting normalisation approach to normalise independent and dependent variables and used balanced split regression trees to improve the defects of the traditional regression trees. The results show (a) using the sorting normalised independent and dependent variables can build a predictive model with a better capability in predicting the rate of return of the portfolio, (b) the balanced split regression trees perform well except in the training period from 1999 to 2000. One possible reason is that the dot-com bubble achieved its peak in 2000 which changes investors' behaviour, (c) during the training period, the predictive ability of the model using data from the bull market outperforms the model using data from the bear market.

Keywords: stock markets; stock selection models; multi-factor selection models; balanced split regression trees; sorting normalisation; hybrid variables; Taiwan; modelling; bull markets; bear markets.

DOI: 10.1504/IJFIP.2015.070081

International Journal of Foresight and Innovation Policy, 2015 Vol.10 No.1, pp.48 - 74

Received: 14 Oct 2014
Accepted: 29 Dec 2014

Published online: 25 Jun 2015 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article