You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Andrew Wong (Jira)" <ji...@apache.org> on 2020/07/08 01:00:03 UTC

[jira] [Comment Edited] (KUDU-3154) RangerClientTestBase.TestLogging sometimes fails

    [ https://issues.apache.org/jira/browse/KUDU-3154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17153156#comment-17153156 ] 

Andrew Wong edited comment on KUDU-3154 at 7/8/20, 12:59 AM:
-------------------------------------------------------------

I was able to reproduce this on an Ubuntu 14.04 container with openJDK 8.

Running jstack (seeĀ [^kudu-3154_jstacks.txt] on the subprocess, I saw the following:
{code:java}
"policyDownloadTimer" #14 daemon prio=5 os_prio=0 tid=0x00007f71c0941000 nid=0x4c8de in Object.wait() [0x00007f716f4e2000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000006c0235468> (a org.apache.ranger.plugin.util.DownloadTrigger)
        at java.lang.Object.wait(Object.java:502)
        at org.apache.ranger.plugin.util.DownloadTrigger.waitForCompletion(DownloadTrigger.java:27)
        - locked <0x00000006c0235468> (a org.apache.ranger.plugin.util.DownloadTrigger)
        at org.apache.ranger.plugin.util.DownloaderTask.run(DownloaderTask.java:42)
        at java.util.TimerThread.mainLoop(Timer.java:555)
        at java.util.TimerThread.run(Timer.java:505)
{code}
This appears to be [waiting|https://github.com/apache/ranger/blob/master/agents-common/src/main/java/org/apache/ranger/plugin/util/DownloadTrigger.java#L25] for the [download trigger to fire|https://github.com/apache/ranger/blob/95a1aa57a96b392227a4f0aa4e7ecc396dca2261/agents-common/src/main/java/org/apache/ranger/plugin/util/PolicyRefresher.java#L214], but for some reason that isn't happening. It's true that the policy refresher in my logs doesn't successfully refresh the policy, but the trigger should still fire.


was (Author: andrew.wong):
I was able to reproduce this on an Ubuntu 14.04 container with openJDK 8.

Running jstack (seeĀ [^kudu-3154_jstacks.txt] on the subprocess, I saw the following:
{code:java}
"policyDownloadTimer" #14 daemon prio=5 os_prio=0 tid=0x00007f71c0941000 nid=0x4c8de in Object.wait() [0x00007f716f4e2000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000006c0235468> (a org.apache.ranger.plugin.util.DownloadTrigger)
        at java.lang.Object.wait(Object.java:502)
        at org.apache.ranger.plugin.util.DownloadTrigger.waitForCompletion(DownloadTrigger.java:27)
        - locked <0x00000006c0235468> (a org.apache.ranger.plugin.util.DownloadTrigger)
        at org.apache.ranger.plugin.util.DownloaderTask.run(DownloaderTask.java:42)
        at java.util.TimerThread.mainLoop(Timer.java:555)
        at java.util.TimerThread.run(Timer.java:505)
{code}
This appears to be waiting for the [download trigger to fire|https://github.com/apache/ranger/blob/95a1aa57a96b392227a4f0aa4e7ecc396dca2261/agents-common/src/main/java/org/apache/ranger/plugin/util/PolicyRefresher.java#L214], but for some reason that isn't happening. It's true that the policy refresher in my logs doesn't successfully refresh the policy, but the trigger should still fire.

> RangerClientTestBase.TestLogging sometimes fails
> ------------------------------------------------
>
>                 Key: KUDU-3154
>                 URL: https://issues.apache.org/jira/browse/KUDU-3154
>             Project: Kudu
>          Issue Type: Bug
>          Components: ranger, test
>    Affects Versions: 1.13.0
>            Reporter: Alexey Serbin
>            Priority: Major
>         Attachments: kudu-3154_jstacks.txt, ranger_client-test.txt, ranger_client-test.txt.xz
>
>
> The {{RangerClientTestBase.TestLogging}} scenario of the {{ranger_client-test}} sometimes fails (all types of builds) with error message like below:
> {noformat}
> src/kudu/ranger/ranger_client-test.cc:398: Failure
> Failed                                                                          
> Bad status: Timed out: timed out while in flight                                
> I0620 07:06:02.907177  1140 server.cc:247] Received an EOF from the subprocess  
> I0620 07:06:02.910923  1137 server.cc:317] get failed, inbound queue shut down: Aborted:
> I0620 07:06:02.910964  1141 server.cc:380] outbound queue shut down: Aborted:   
> I0620 07:06:02.910995  1138 server.cc:317] get failed, inbound queue shut down: Aborted:
> I0620 07:06:02.910984  1139 server.cc:317] get failed, inbound queue shut down: Aborted:
> {noformat}
> The log is attached.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)