You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@spark.apache.org by Sea aj <sa...@gmail.com> on 2017/06/22 15:58:16 UTC

How does Spark deal with Data Skewness?

Hi everyone,

I have read about some interesting ideas on how to manage skew but I was
not sure if any of these techniques are being used in Spark 2.x versions or
not? To name a few, "Salting the Data" and "Dynamic Repartitioning" are
techniques introduced in Spark Summits. I am really curious to know whether
if Spark takes care of skew at all or not?





  <https://mailtrack.io/> Sent with Mailtrack
<https://mailtrack.io/install?source=signature&lang=en&referral=saj3saj@gmail.com&idSignature=22>