You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Pralabh Kumar <pr...@gmail.com> on 2017/06/16 02:30:30 UTC
Re: featureSubsetStrategy parameter for GradientBoostedTreesModel
Hi everyone
Currently GBT doesn't expose featureSubsetStrategy as exposed by Random
Forest.
.
GradientBoostedTrees in Spark have hardcoded feature subset strategy to
"all" while calling random forest in DecisionTreeRegressor.scala
val trees = RandomForest.run(data, oldStrategy, numTrees = 1,
featureSubsetStrategy = "all",
It should provide functionality to the user to set featureSubsetStrategy
("auto", "all" ,"sqrt" , "log2" , "onethird") ,
the way random forest does.
This will help GBT to have randomness at feature level.
Jira SPARK-20199 <https://issues.apache.org/jira/browse/SPARK-20199>
Please let me know , if my understanding is correct.
Regards
Pralabh Kumar
On Fri, Jun 16, 2017 at 7:53 AM, Pralabh Kumar <pr...@gmail.com>
wrote:
> Hi everyone
>
> Currently GBT doesn't expose featureSubsetStrategy as exposed by Random
> Forest.
>
> .
> GradientBoostedTrees in Spark have hardcoded feature subset strategy to
> "all" while calling random forest in DecisionTreeRegressor.scala
>
> val trees = RandomForest.run(data, oldStrategy, numTrees = 1,
> featureSubsetStrategy = "all",
>
>
> It should provide functionality to the user to set featureSubsetStrategy
> ("auto", "all" ,"sqrt" , "log2" , "onethird") ,
> the way random forest does.
>
> This will help GBT to have randomness at feature level.
>
> Jira SPARK-20199 <https://issues.apache.org/jira/browse/SPARK-20199>
>
> Please let me know , if my understanding is correct.
>
> Regards
> Pralabh Kumar
>