You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Steven Wong <sw...@netflix.com> on 2011/11/01 08:44:06 UTC
hive.map.aggr
I have a query doing JOIN and GROUP BY:
SELECT ...
FROM x JOIN y ON (...)
GROUP BY ...;
In the first MR job, the reduce phase performs hash aggregation when hive.map.aggr=true, but the reduce phase doesn't perform hash aggregation when hive.map.aggr=false. Why does hive.map.aggr affect the reduce phase behavior when its name and description indicate it controls map-side aggregation?
<property>
<name>hive.map.aggr</name>
<value>true</value>
<description>Whether to use map-side aggregation in Hive Group By queries</description>
</property>
Thanks.
Steven