You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Yin Huai (JIRA)" <ji...@apache.org> on 2015/04/15 06:01:58 UTC

[jira] [Commented] (SPARK-5881) RDD remains cached after the table gets overridden by "CACHE TABLE"

    [ https://issues.apache.org/jira/browse/SPARK-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14495621#comment-14495621 ] 

Yin Huai commented on SPARK-5881:
---------------------------------

[~lian cheng] [~marmbrus] I think we need some major changes with our cache manager to fix it. I am inclined to bump the version.

> RDD remains cached after the table gets overridden by "CACHE TABLE"
> -------------------------------------------------------------------
>
>                 Key: SPARK-5881
>                 URL: https://issues.apache.org/jira/browse/SPARK-5881
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>            Reporter: Yin Huai
>            Priority: Critical
>
> {code}
> val rdd = sc.parallelize((1 to 10).map(i => s"""{"a":$i, "b":"str${i}"}"""))
> sqlContext.jsonRDD(rdd).registerTempTable("jt")
> sqlContext.sql("CACHE TABLE foo AS SELECT * FROM jt")
> sqlContext.sql("CACHE TABLE foo AS SELECT a FROM jt")
> {code}
> After the second CACHE TABLE command, the RDD for the first table still remains in the cache.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org