You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by "1427357147@qq.com" <14...@qq.com> on 2018/03/20 07:59:29 UTC

the meaining of "samplePointsPerPartitionHint" in RangePartitioner

HI  all,

The belowing is the code of RangePartitioner.
class RangePartitioner[K : Ordering : ClassTag, V](
    partitions: Int,
    rdd: RDD[_ <: Product2[K, V]],
    private var ascending: Boolean = true,
    val samplePointsPerPartitionHint: Int = 20)
I feel puzzled about the samplePointsPerPartitionHint.
My issue is :
    what is the samplePointsPerPartitionHint used for please?
If I set samplePointsPerPartitionHint as 1000000 or 20,what will happed please?

Thanks.

Robin Shao




1427357147@qq.com