You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2015/05/01 19:12:05 UTC
[jira] [Commented] (SPARK-6293) SQLContext.implicits should provide
automatic conversion for RDD[Row]
[ https://issues.apache.org/jira/browse/SPARK-6293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14523469#comment-14523469 ]
Joseph K. Bradley commented on SPARK-6293:
------------------------------------------
I don't think it will be an issue once the RDD methods are added to DataFrame
> SQLContext.implicits should provide automatic conversion for RDD[Row]
> ---------------------------------------------------------------------
>
> Key: SPARK-6293
> URL: https://issues.apache.org/jira/browse/SPARK-6293
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Affects Versions: 1.3.0
> Reporter: Joseph K. Bradley
>
> When a DataFrame is converted to an RDD[Row], it should be easier to convert it back to a DataFrame via toDF. E.g.:
> {code}
> val df: DataFrame = myRDD.toDF("col1", "col2") // This works for types like RDD[scala.Tuple2[...]]
> val splits = df.rdd.randomSplit(...)
> val split0: RDD[Row] = splits(0)
> val df0 = split0.toDF("col1", "col2") // This fails
> {code}
> The failure happens because SQLContext.implicits does not provide an automatic conversion for Rows. (It does handle Products, but Row does not implement Product.)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org