You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "jianjin qin (Jira)" <ji...@apache.org> on 2021/05/10 05:27:00 UTC
[jira] [Updated] (YARN-10765) Fair scheduler continuous scheduling
thread crashes while sorting nodes
[ https://issues.apache.org/jira/browse/YARN-10765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
jianjin qin updated YARN-10765:
-------------------------------
Attachment: (was: 0001-fix-fair-scheduler-continuous-scheduling-thread-cras.patch)
> Fair scheduler continuous scheduling thread crashes while sorting nodes
> -----------------------------------------------------------------------
>
> Key: YARN-10765
> URL: https://issues.apache.org/jira/browse/YARN-10765
> Project: Hadoop YARN
> Issue Type: Bug
> Components: scheduler
> Affects Versions: 2.7.0, 2.7.1
> Reporter: jianjin qin
> Priority: Major
> Fix For: 3.0.0
>
> Attachments: YARN-10765-001.patch
>
>
> NodeAvailableResourceComparator use variable nodes which contains to compare, but nodeIdList is not same with nodes.keySet() during sort. when node removed before sort, such as n2,n3 are removed, compare(n2, n3)=1 and compare(n3, n2)=1, this violates the reflexivity of comparator, we may get the exception: java.lang.IllegalArgumentException: Comparison method violates its general contract!
> {code:java}
> 2021-01-19 10:22:55,102 ERROR org.apache.hadoop.yarn.YarnUncaughtExceptionHandler: Thread Thread[FairSchedulerContinuousScheduling,5,main] threw an Exception.
> java.lang.IllegalArgumentException: Comparison method violates its general contract!
> at java.util.TimSort.mergeLo(TimSort.java:747)
> at java.util.TimSort.mergeAt(TimSort.java:483)
> at java.util.TimSort.mergeCollapse(TimSort.java:410)
> at java.util.TimSort.sort(TimSort.java:214)
> at java.util.TimSort.sort(TimSort.java:173)
> at java.util.Arrays.sort(Arrays.java:659)
> at java.util.Collections.sort(Collections.java:217)
> at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.continuousSchedulingAttempt(FairScheduler.java:1069)
> at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler$ContinuousSchedulingThread.run(FairScheduler.java:322)
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org