You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Xusen Yin (JIRA)" <ji...@apache.org> on 2016/04/01 09:26:25 UTC

[jira] [Updated] (SPARK-14299) Scala ML examples code merge and clean up

     [ https://issues.apache.org/jira/browse/SPARK-14299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xusen Yin updated SPARK-14299:
------------------------------
    Description: 
Duplicated code that I found in scala/examples/ml:

* scala/ml
** CrossValidatorExample.scala --> ModelSelectionViaCrossValidationExample
** TrainValidationSplitExample.scala --> ModelSelectionViaTrainValidationSplitExample
** DeveloperApiExample.scala --> I delete it for now because it's only about how to create your own classifieri, etc, which can be learned easily from other examples and ml codes.
** SimpleParamsExample.scala --> merge with LogisticRegressionSummaryExample.scala
** SimpleTextClassificationPipeline.scala --> ModelSelectionViaCrossValidationExample
** DataFrameExample.scala --> merge with LogisticRegressionSummaryExample.scala

* Intend to reserve with command-line support:
** DecisionTreeExample.scala --> DecisionTreeRegressionExample, DecisionTreeClassificationExample
** GBTExample.scala --> GradientBoostedTreeClassifierExample, GradientBoostedTreeRegressorExample
** LinearRegressionExample.scala --> LinearRegressionWithElasticNetExample
** LogisticRegressionExample.scala --> LogisticRegressionWithElasticNetExample, LogisticRegressionSummaryExample
** RandomForestExample.scala --> RandomForestRegressorExample, RandomForestClassifierExample

When merging and cleaning those code, be sure not disturb the previous example on and off blocks.

I'll take this one as an example. 

  was:
Duplicated code that I found in scala/examples/ml:

* scala/ml
** CrossValidatorExample.scala --> ModelSelectionViaCrossValidationExample
** DecisionTreeExample.scala --> DecisionTreeRegressionExample, DecisionTreeClassificationExample
** GBTExample.scala --> GradientBoostedTreeClassifierExample, GradientBoostedTreeRegressorExample
** LinearRegressionExample.scala --> LinearRegressionWithElasticNetExample
** LogisticRegressionExample.scala --> LogisticRegressionWithElasticNetExample, LogisticRegressionSummaryExample
** RandomForestExample.scala --> RandomForestRegressorExample, RandomForestClassifierExample
** TrainValidationSplitExample.scala --> ModelSelectionViaTrainValidationSplitExample
** DeveloperApiExample.scala --> I delete it for now because it's only about how to create your own classifieri, etc, which can be learned easily from other examples and ml codes.
** SimpleParamsExample.scala --> merge with LogisticRegressionSummaryExample.scala
** SimpleTextClassificationPipeline.scala --> ModelSelectionViaCrossValidationExample
** DataFrameExample.scala --> merge with LogisticRegressionSummaryExample.scala

When merging and cleaning those code, be sure not disturb the previous example on and off blocks.

I'll take this one as an example. 


> Scala ML examples code merge and clean up
> -----------------------------------------
>
>                 Key: SPARK-14299
>                 URL: https://issues.apache.org/jira/browse/SPARK-14299
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Examples
>            Reporter: Xusen Yin
>            Priority: Minor
>              Labels: starter
>
> Duplicated code that I found in scala/examples/ml:
> * scala/ml
> ** CrossValidatorExample.scala --> ModelSelectionViaCrossValidationExample
> ** TrainValidationSplitExample.scala --> ModelSelectionViaTrainValidationSplitExample
> ** DeveloperApiExample.scala --> I delete it for now because it's only about how to create your own classifieri, etc, which can be learned easily from other examples and ml codes.
> ** SimpleParamsExample.scala --> merge with LogisticRegressionSummaryExample.scala
> ** SimpleTextClassificationPipeline.scala --> ModelSelectionViaCrossValidationExample
> ** DataFrameExample.scala --> merge with LogisticRegressionSummaryExample.scala
> * Intend to reserve with command-line support:
> ** DecisionTreeExample.scala --> DecisionTreeRegressionExample, DecisionTreeClassificationExample
> ** GBTExample.scala --> GradientBoostedTreeClassifierExample, GradientBoostedTreeRegressorExample
> ** LinearRegressionExample.scala --> LinearRegressionWithElasticNetExample
> ** LogisticRegressionExample.scala --> LogisticRegressionWithElasticNetExample, LogisticRegressionSummaryExample
> ** RandomForestExample.scala --> RandomForestRegressorExample, RandomForestClassifierExample
> When merging and cleaning those code, be sure not disturb the previous example on and off blocks.
> I'll take this one as an example. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org