You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 03:59:34 UTC

[jira] [Updated] (SPARK-20632) Allow 'Column.getItem()' API to accept Vector columns

     [ https://issues.apache.org/jira/browse/SPARK-20632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon updated SPARK-20632:
---------------------------------
    Labels: bulk-closed  (was: )

> Allow 'Column.getItem()' API to accept Vector columns
> -----------------------------------------------------
>
>                 Key: SPARK-20632
>                 URL: https://issues.apache.org/jira/browse/SPARK-20632
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core
>    Affects Versions: 1.6.3, 2.1.1
>            Reporter: Kevin Ushey
>            Priority: Minor
>              Labels: bulk-closed
>
> The 'getItem()' API is quite handy for extracting values from Dataset columns of type 'ArrayType'. It would be quite useful if this could also accept 'Vector' columns, e.g. those generated by the various MLLib routines (probability columns).
> If I understand correctly, users are forced to define custom UDFs to handle this case, and the UDF type required for vector columns is not always obvious.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org