You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Edwin <al...@yahoo.com> on 2014/11/05 23:38:09 UTC

cache function is not working on RDD from parallelize

Hi,
On a 5 node cluster, say I have data on the driver application node,
and then I call parallelize on the data, I get a rdd back. 
However, when I call cache on the rdd the rdd won't be cached (I checked
that through timing on count the realized-cached rdd, take as long as before
it was realized). So does anyone have any idea on this?
Thanks
Edwin



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/cache-function-is-not-working-on-RDD-from-parallelize-tp18219.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org