You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Lalitha Viswanathan (JIRA)" <ji...@apache.org> on 2016/07/01 23:13:10 UTC
[jira] [Commented] (TEZ-3206) Have unordered partitioned KV output
send partition stats via VertexManagerEvent
[ https://issues.apache.org/jira/browse/TEZ-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15359810#comment-15359810 ]
Lalitha Viswanathan commented on TEZ-3206:
------------------------------------------
Hi, I wanted hive.tez.auto.reducer.parallelism=true feature, in shuffle hash join (hive.optimize.dynamic.partition.hashjoin=true). Cloned the master branch, compiled and deployed the 0.9-SNAPSHOT binaries of tez, and re-tried. But I don't see the feature taking effect. Am I missing something? Thanks!
> Have unordered partitioned KV output send partition stats via VertexManagerEvent
> ---------------------------------------------------------------------------------
>
> Key: TEZ-3206
> URL: https://issues.apache.org/jira/browse/TEZ-3206
> Project: Apache Tez
> Issue Type: New Feature
> Reporter: Ming Ma
> Assignee: Ming Ma
> Fix For: 0.9.0
>
> Attachments: TEZ-3206-2.patch, TEZ-3206-3.patch, TEZ-3206-4.patch, TEZ-3206.patch
>
>
> As part of the auto-parallelism feature, ordered partitioned KV output's partition stats are sent to ShuffleVertexManager via VertexManagerEvent. But this isn't available for unordered partitioned output. Having {{UnorderedPartitionedKVWriter}} send partition stats will enable the auto-parallelism support for unordered KV or other custom data routing mechanisms that depend on partition size.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)