You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2014/09/26 23:21:34 UTC

[jira] [Commented] (SPARK-3251) Clarify learning interfaces

    [ https://issues.apache.org/jira/browse/SPARK-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14150000#comment-14150000 ] 

Joseph K. Bradley commented on SPARK-3251:
------------------------------------------

I just linked JIRA related to this; that JIRA covers a variety of learning interfaces mentioned in a few JIRAs and PRs.  It would be great to get feedback about developing a high-level view of learning interfaces, especially in light of the MLlib API update being discussed (in the linked JIRA's parent).  Thanks!

>  Clarify learning interfaces
> ----------------------------
>
>                 Key: SPARK-3251
>                 URL: https://issues.apache.org/jira/browse/SPARK-3251
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>    Affects Versions: 1.1.0, 1.1.1
>            Reporter: Christoph Sawade
>
> *Make threshold mandatory*
> Currently, the output of predict for an example is either the score
> or the class. This side-effect is caused by clearThreshold. To
> clarify that behaviour three different types of predict (predictScore,
> predictClass, predictProbabilty) were introduced; the threshold is not
> longer optional.
> *Clarify classification interfaces*
> Currently, some functionality is spreaded over multiple models.
> In order to clarify the structure and simplify the implementation of
> more complex models (like multinomial logistic regression), two new
> classes are introduced:
> - BinaryClassificationModel: for all models that derives a binary classification from a single weight vector. Comprises the tresholding functionality to derive a prediction from a score. It basically captures SVMModel and LogisticRegressionModel.
> - ProbabilitistClassificaitonModel: This trait defines the interface for models that return a calibrated confidence score (aka probability).
> *Misc*
> - some renaming
> - add test for probabilistic output



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org