You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Rashmi Raghu (JIRA)" <ji...@apache.org> on 2016/03/09 02:38:40 UTC
[jira] [Created] (MADLIB-976) Random Forest - extremely long
training time
Rashmi Raghu created MADLIB-976:
-----------------------------------
Summary: Random Forest - extremely long training time
Key: MADLIB-976
URL: https://issues.apache.org/jira/browse/MADLIB-976
Project: Apache MADlib
Issue Type: Bug
Components: Module: Random Forest
Reporter: Rashmi Raghu
When running Random Forest training function on a modest data set it took a long time - much longer than expected or desired. The training data set has around 8000 rows and 400 features. Several models on similar data all took around 40,000 seconds to run. Each time we used 100 trees and 14 random features selected. Dataset shared offline.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)