You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2016/09/20 14:57:20 UTC

[jira] [Commented] (SPARK-10835) Change Output of NGram to Array(String, True)

    [ https://issues.apache.org/jira/browse/SPARK-10835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15506806#comment-15506806 ] 

Sean Owen commented on SPARK-10835:
-----------------------------------

[~yuhaoyan] just reviving this one because it came up on the list. I think it's of course straightforward to also change NGram, but it's also not going to actually output null arrays. It seems like we could have just fixed Word2Vec to not reject non-nullable array input types? or was the point that a similar problem would still exist for UDFs?

> Change Output of NGram to Array(String, True)
> ---------------------------------------------
>
>                 Key: SPARK-10835
>                 URL: https://issues.apache.org/jira/browse/SPARK-10835
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>            Reporter: Sumit Chawla
>            Assignee: yuhao yang
>            Priority: Minor
>
> Currently output type of NGram is Array(String, false), which is not compatible with LDA  since their input type is Array(String, true). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org