You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Robert Kanter (JIRA)" <ji...@apache.org> on 2017/09/27 20:59:02 UTC

[jira] [Created] (YARN-7262) Add a hierarchy into the ZKRMStateStore for delegation token znodes to prevent jute buffer overflow

Robert Kanter created YARN-7262:
-----------------------------------

             Summary: Add a hierarchy into the ZKRMStateStore for delegation token znodes to prevent jute buffer overflow
                 Key: YARN-7262
                 URL: https://issues.apache.org/jira/browse/YARN-7262
             Project: Hadoop YARN
          Issue Type: Improvement
    Affects Versions: 2.6.0
            Reporter: Robert Kanter
            Assignee: Robert Kanter


We've seen users who are running into a problem where the RM is storing so many delegation tokens in the {{ZKRMStateStore}} that the _listing_ of those znodes is higher than the jute buffer. This is fine during operations, but becomes a problem on a fail over because the RM will try to read in all of the token znodes (i.e. call {{getChildren}} on the parent znode).  This is particularly bad because everything appears to be okay, but then if a failover occurs you end up with no active RMs.

There was a similar problem with the Yarn application data that was fixed in YARN-2962 by adding a (configurable) hierarchy of znodes so the RM could pull subchildren without overflowing the jute buffer (though it's off by default).
We should add a hierarchy similar to that of YARN-2962, but for the delegation token znodes.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org