Title: Computational prediction of toxicity
Authors: Meenakshi Mishra; Hongliang Fei; Jun Huan
Addresses: Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, 66047-7621 KS, USA ' Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, 66047-7621 KS, USA ' Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, 66047-7621 KS, USA
Abstract: With increasing number of chemicals produced each year, it still remains a daunting task to keep up with the toxicity profile of each chemical. In this paper, we attempt to predict toxicity of compounds using computational techniques, where results from certain in vitro assays applied on 309 chemicals, along with computed properties of chemicals are used to predict the toxicity caused by them at a particular endpoint. We show that both Random Forest (RF) and Naïve Bayes (NB) have a good performance. We also show that using small and related trees in RF helps to further improve the performance.
Keywords: toxicity prediction; random forest; machine learning; computational prediction; EPA; TOXCAST; computational learning methods; L1 norm; L2 norm; graph boosting; bioinformatics; chemicals; chemical toxicity; naive Bayes.
DOI: 10.1504/IJDMB.2013.056082
International Journal of Data Mining and Bioinformatics, 2013 Vol.8 No.3, pp.338 - 348
Received: 09 May 2011
Accepted: 09 May 2011
Published online: 20 Oct 2014 *