You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Yanbo Liang (JIRA)" <ji...@apache.org> on 2015/09/30 18:15:04 UTC

[jira] [Comment Edited] (SPARK-9695) Add random seed Param to ML Pipeline

    [ https://issues.apache.org/jira/browse/SPARK-9695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14937046#comment-14937046 ] 

Yanbo Liang edited comment on SPARK-9695 at 9/30/15 4:15 PM:
-------------------------------------------------------------

[~josephkb]
I agree to make HasSeed the main API to decide whether an algorithm takes a seed.
I propose to make Pipeline extends from HasSeed and users can setSeed for the whole pipeline. After users setting seed for the pipeline, it will pass to all meta-algs who implemented HasSeed.
Then we can have reproducible results for entire pipelines. Please correct me if I have some misunderstands.


was (Author: yanboliang):
[~josephkb]
I agree to make HasSeed the main API to decide whether an algorithm takes a seed.
I propose to make PipelineStage extends from HasSeed and all meta-algs will get the same seed and can have reproducible results for entire pipelines. Please correct me if I have some misunderstands.

> Add random seed Param to ML Pipeline
> ------------------------------------
>
>                 Key: SPARK-9695
>                 URL: https://issues.apache.org/jira/browse/SPARK-9695
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: Joseph K. Bradley
>
> Note this will require some discussion about whether to make HasSeed the main API for whether an algorithm takes a seed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org