You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Yanbo Liang (JIRA)" <ji...@apache.org> on 2015/07/09 03:45:04 UTC

[jira] [Commented] (SPARK-8345) Add an SQL node as a feature transformer

    [ https://issues.apache.org/jira/browse/SPARK-8345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14619725#comment-14619725 ] 

Yanbo Liang commented on SPARK-8345:
------------------------------------

[~mengxr] I will work on it.

> Add an SQL node as a feature transformer
> ----------------------------------------
>
>                 Key: SPARK-8345
>                 URL: https://issues.apache.org/jira/browse/SPARK-8345
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: Xiangrui Meng
>
> Some simple feature transformations can take leverage on SQL operators. Users do not need to create an ML transformer for each of them. We can have an SQL transformer that executes an SQL command which operates on the input dataframe.
> {code}
> val sql = new SQL()
>   .setStatement("SELECT *, length(text) AS text_length FROM __THIS__")
> {code}
> where "__THIS__" will be replaced by a temp table that represents the DataFrame.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org