You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@slider.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2017/11/06 22:34:06 UTC

[jira] [Commented] (SLIDER-1199) Blacklist nodes that exceed the node failure threshold for a role

    [ https://issues.apache.org/jira/browse/SLIDER-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16241063#comment-16241063 ] 

Hudson commented on SLIDER-1199:
--------------------------------

SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #13193 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/13193/])
YARN-6185. Apply SLIDER-1199 to yarn native services for blacklisting (jianhe: rev 500695d7260689ef77f075c64eef69f684722b29)
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/actions/ResetFailureWindow.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/providers/AbstractProviderService.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/state/AppState.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/operations/AsyncRMOperationHandler.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/state/NodeInstance.java
* (add) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/operations/UpdateBlacklistOperation.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/state/RoleHistory.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/operations/ProviderNotifyingOperationHandler.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/operations/RMOperationHandlerActions.java
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-slider/hadoop-yarn-slider-core/src/main/java/org/apache/slider/server/appmaster/SliderAppMaster.java


> Blacklist nodes that exceed the node failure threshold for a role
> -----------------------------------------------------------------
>
>                 Key: SLIDER-1199
>                 URL: https://issues.apache.org/jira/browse/SLIDER-1199
>             Project: Slider
>          Issue Type: Bug
>          Components: appmaster
>            Reporter: Billie Rinaldi
>            Assignee: Billie Rinaldi
>             Fix For: Slider 0.92
>
>         Attachments: SLIDER-1199.1.patch, SLIDER-1199.2.patch, SLIDER-1199.3.patch, SLIDER-1199.4.patch, SLIDER-1199.5.patch
>
>
> From the code, it seems like when the node failure threshold for a role is exceeded, that node is no longer suggested for placement. But there is nothing preventing the RM from selecting the node again. If the node were blacklisted, perhaps that would prevent new allocations on problem nodes.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)