You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2015/10/27 16:03:27 UTC

[jira] [Assigned] (SPARK-11349) Support transform string label for RFormula

     [ https://issues.apache.org/jira/browse/SPARK-11349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-11349:
------------------------------------

    Assignee: Apache Spark

> Support transform string label for RFormula
> -------------------------------------------
>
>                 Key: SPARK-11349
>                 URL: https://issues.apache.org/jira/browse/SPARK-11349
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: Yanbo Liang
>            Assignee: Apache Spark
>
> Currently RFormula can only handle label with NumericType or BinaryType (cast it to DoubleType as the label of Linear Regression training), we should also support label of StringType which is needed for Logistic Regression (glm with family = "binomial"). 
> For label of StringType, we should use StringIndexer to transform it to 0-based index.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org