You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by Alan Gates <al...@gmail.com> on 2016/07/06 16:05:13 UTC
Re: Optimally assigning reducers
My first guess is that your join has significant skew in the keys, so many are getting assigned to a single reducer. Have you tried the skew join algorithm[1]?
Alan.
1. https://pig.apache.org/docs/r0.16.0/perf.html#skewed-joins
> On Jul 6, 2016, at 08:55, Nigam, Vibhor <Vi...@comcast.com> wrote:
>
> Hi
>
> I am facing a problem in my pig script. It has a simple inner join and a grouping. However after around 70% of the script gets processed all the reduction process gets assigned to one reducer, which in turn increases the complete time of the script heavily.
>
> I need to use this script for automating the process which under given circumstances seems problematic. Kindly, let me know how can I overcome this and assign reducers optimally
>
> Best Regards
> Vibhor Nigam
> Product Engineer III
> TnP, Comcast
> 1717 Arch Street, Philadelphia
>
>