You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Sahil Takiar (Jira)" <ji...@apache.org> on 2019/09/18 20:16:00 UTC

[jira] [Resolved] (HDFS-14846) libhdfs tests are failing on trunk due to jni usage bugs

     [ https://issues.apache.org/jira/browse/HDFS-14846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sahil Takiar resolved HDFS-14846.
---------------------------------
    Fix Version/s: 3.3.0
       Resolution: Fixed

> libhdfs tests are failing on trunk due to jni usage bugs
> --------------------------------------------------------
>
>                 Key: HDFS-14846
>                 URL: https://issues.apache.org/jira/browse/HDFS-14846
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: libhdfs, native
>            Reporter: Sahil Takiar
>            Assignee: Sahil Takiar
>            Priority: Major
>             Fix For: 3.3.0
>
>
> While working on HDFS-14564, I noticed that the libhdfs tests are failing on trunk (both on Hadoop QA and locally). I did some digging and found out that the {{-Xcheck:jni}} flag is causing a bunch of crashes. I haven't been able to pinpoint what caused this regression, but my best guess is that an upgrade in the JDK we use in Hadoop QA started causing these failures. I looked back at some old JIRAs and it looks like the tests work on Java 1.8.0_212, but Hadoop QA is running 1.8.0_222 (as is my local env) (I couldn't confirm this theory because I'm having trouble getting Java 1.8.0_212 installed next to 1.8.0_222 on my Ubuntu machine) (even after re-winding the commit history back to a known good commit where the libhdfs passed, the tests still fail, so I don't think a code change caused the regressions).
> The failures are a bunch of "FATAL ERROR in native method: Bad global or local ref passed to JNI" errors. After doing some debugging, it looks like {{-Xcheck:jni}} now errors out if any code tries to pass a local ref to {{DeleteLocalRef}} twice (previously it looked like it didn't complain) (we have some checks to avoid this, but it looks like they don't work as expected).
> There are a few places in the libhdfs code where this pattern causes a crash, as well as one place in {{JniBasedUnixGroupsMapping}}.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org