You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Steven Wong <sw...@netflix.com> on 2011/11/01 08:44:06 UTC

hive.map.aggr

I have a query doing JOIN and GROUP BY:

SELECT ...
FROM x JOIN y ON (...)
GROUP BY ...;

In the first MR job, the reduce phase performs hash aggregation when hive.map.aggr=true, but the reduce phase doesn't perform hash aggregation when hive.map.aggr=false. Why does hive.map.aggr affect the reduce phase behavior when its name and description indicate it controls map-side aggregation?

<property>
  <name>hive.map.aggr</name>
  <value>true</value>
  <description>Whether to use map-side aggregation in Hive Group By queries</description>
</property>

Thanks.
Steven