You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spark.apache.org by Debasish Das <de...@gmail.com> on 2014/11/14 01:19:30 UTC
TimSort in 1.2
Hi,
I am noticing the first step for Spark jobs does a TimSort in 1.2
branch...and there is some time spent doing the TimSort...Is this assigning
the RDD blocks to different nodes based on a sort order ?
Could someone please point to a JIRA about this change so that I can read
more about it ?
Thanks.
Deb
Re: TimSort in 1.2
Posted by Reza Zadeh <re...@databricks.com>.
See https://issues.apache.org/jira/browse/SPARK-2045
and https://issues.apache.org/jira/browse/SPARK-3280
On Thu, Nov 13, 2014 at 4:19 PM, Debasish Das <de...@gmail.com>
wrote:
> Hi,
>
> I am noticing the first step for Spark jobs does a TimSort in 1.2
> branch...and there is some time spent doing the TimSort...Is this assigning
> the RDD blocks to different nodes based on a sort order ?
>
> Could someone please point to a JIRA about this change so that I can read
> more about it ?
>
> Thanks.
> Deb
>