You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Yann Luppo <Ya...@LiveNation.com> on 2013/10/23 23:01:55 UTC
JavaPairRDD unpersist
Hi,
I've noticed that in 0.8.0 JavaRDD got a new "unpersist" method. I was wondering if this method was going to be ported eventually to JavaPairRDD.
Is this somewhere on the road map? Or should we just change all of our JavaPairRDD to JavaRDDs w/ tuples for now?
Thanks,
Yann
Re: JavaPairRDD unpersist
Posted by Yann Luppo <Ya...@LiveNation.com>.
Perfect! Thank you Josh.
From: Josh Rosen <ro...@gmail.com>>
Reply-To: "user@spark.incubator.apache.org<ma...@spark.incubator.apache.org>" <us...@spark.incubator.apache.org>>
Date: Wednesday, October 23, 2013 2:19 PM
To: "user@spark.incubator.apache.org<ma...@spark.incubator.apache.org>" <us...@spark.incubator.apache.org>>
Subject: Re: JavaPairRDD unpersist
This is a bug; all three Java*RDD classes should have had unpersist() methods added to them, instead of just JavaRDD (https://github.com/apache/incubator-spark/commit/4a318774088f829fe54c3ef0b5f565a845631b4e). I'll submit a pull request to fix this. In 0.8, you can access the underlying Scala RDD and call unpersist() on that:
javaPairRDD.rdd().unpersist()
If you want to know the details of why we need to add unpersist() to each Java*RDD class, rather than adding it once in JavaRDDLike, check out the "implementation traits" section of https://cwiki.apache.org/confluence/display/SPARK/Java+API+Internals
- Josh
On Wed, Oct 23, 2013 at 2:01 PM, Yann Luppo <Ya...@livenation.com>> wrote:
Hi,
I've noticed that in 0.8.0 JavaRDD got a new "unpersist" method. I was wondering if this method was going to be ported eventually to JavaPairRDD.
Is this somewhere on the road map? Or should we just change all of our JavaPairRDD to JavaRDDs w/ tuples for now?
Thanks,
Yann
Re: JavaPairRDD unpersist
Posted by Josh Rosen <ro...@gmail.com>.
This is a bug; all three Java*RDD classes should have had unpersist()
methods added to them, instead of just JavaRDD (
https://github.com/apache/incubator-spark/commit/4a318774088f829fe54c3ef0b5f565a845631b4e).
I'll submit a pull request to fix this. In 0.8, you can access the
underlying Scala RDD and call unpersist() on that:
javaPairRDD.rdd().unpersist()
If you want to know the details of why we need to add unpersist() to each
Java*RDD class, rather than adding it once in JavaRDDLike, check out the
"implementation traits" section of
https://cwiki.apache.org/confluence/display/SPARK/Java+API+Internals
- Josh
On Wed, Oct 23, 2013 at 2:01 PM, Yann Luppo <Ya...@livenation.com>wrote:
> Hi,
>
> I've noticed that in 0.8.0 JavaRDD got a new "unpersist" method. I was
> wondering if this method was going to be ported eventually to JavaPairRDD.
> Is this somewhere on the road map? Or should we just change all of our
> JavaPairRDD to JavaRDDs w/ tuples for now?
>
> Thanks,
> Yann
>