You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2017/03/15 15:47:41 UTC
[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926393#comment-15926393 ]
Sean Owen commented on SPARK-19962:
-----------------------------------
Can you describe what this does, and give an example of input/output? I'm not sure what this describes or whether it already exists in Spark.
> add DictVectorizor for DataFrame
> --------------------------------
>
> Key: SPARK-19962
> URL: https://issues.apache.org/jira/browse/SPARK-19962
> Project: Spark
> Issue Type: Wish
> Components: ML
> Affects Versions: 2.1.0
> Reporter: yu peng
> Labels: features
>
> it's really useful to have something like sklearn.feature_extraction.DictVectorizor
> Since out features lives in json/data frame like format and classifier/regressors only take vector input. so there is a gap between them.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org