You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2019/05/04 00:03:00 UTC

[jira] [Commented] (TEZ-3310) Handle splits grouping better when locality information is not available (or only when localhost is available)

    [ https://issues.apache.org/jira/browse/TEZ-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16832940#comment-16832940 ] 

Todd Lipcon commented on TEZ-3310:
----------------------------------

Sure enough, this is causing problems for pseudo-distributed-cluster testing. The "min split length" config gets ignored because all of the splits are on localhost, and thus queries have different behavior on this cluster than on a remote one.

> Handle splits grouping better when locality information is not available (or only when localhost is available)
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: TEZ-3310
>                 URL: https://issues.apache.org/jira/browse/TEZ-3310
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Rajesh Balamohan
>            Priority: Minor
>
> This is a follow up JIRA to TEZ-3291. TEZ-3291 tries to handle the case when only localhost is specified in the locations. It would be good to improve handling of splits grouping when Tez does not have enough information about the locality.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)