You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-user@hadoop.apache.org by Viswanathan J <ja...@gmail.com> on 2013/08/29 12:14:37 UTC

Fwd: Pig GROUP operator - Data is shuffled and wind up together for the same grouping key

Appreciate the response.  I'm facing this issue in prod.

---------- Forwarded message ----------
From: Viswanathan J <ja...@gmail.com>
Date: Thu, Aug 29, 2013 at 2:00 PM
Subject: Pig GROUP operator - Data is shuffled and wind up together for the
same grouping key
To: "user@pig.apache.org" <us...@pig.apache.org>


Hi,

I'm using pig version 0.11.0

While using GROUP operator in Pig all the data is shuffled, so that rows in
different partitions that have the same grouping key wind up together and
got wrong results for grouping.

While storing the result data, it is share work between multiple
calculations.

How to solve this? Please advice.

-- 
Regards,
Viswa.J



-- 
Regards,
Viswa.J