You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Joe McDonnell (Jira)" <ji...@apache.org> on 2021/09/30 21:00:03 UTC

[jira] [Updated] (IMPALA-10944) Impala crashes with SIGSEGV in TmpFileGroup::Close() in release build

     [ https://issues.apache.org/jira/browse/IMPALA-10944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joe McDonnell updated IMPALA-10944:
-----------------------------------
    Priority: Critical  (was: Major)

> Impala crashes with SIGSEGV in TmpFileGroup::Close() in release build
> ---------------------------------------------------------------------
>
>                 Key: IMPALA-10944
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10944
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: Impala 4.1.0
>            Reporter: Joe McDonnell
>            Priority: Critical
>
> When running a workload on release docker images, the Impala coordinator crashed with the following stack:
> {code:java}
> Crash reason:  SIGSEGV
> Crash address: 0x58
> Process uptime: not availableThread 514 (crashed)
>  0  impalad!impala::TmpFileGroup::Close() [atomic.h : 93 + 0x0]
>     rax = 0x0000000000000000   rdx = 0x0000000000000000
>     rcx = 0x000000000000000c   rbx = 0x000000000e72ec00
>     rsi = 0x000000000e72ec00   rdi = 0x000000000a00d100
>     rbp = 0x00007f3ef6c73010   rsp = 0x00007f3ef6c73000
>      r8 = 0x0000000000000000    r9 = 0x0000000000000000
>     r10 = 0x000000000478ff4c   r11 = 0x0000000000000000
>     r12 = 0x0000000000000000   r13 = 0x000000000ae61600
>     r14 = 0x00007f405ec8c7d0   r15 = 0x00007f3ff7d52d38
>     rip = 0x000000000126e46b
>     Found by: given as instruction pointer in context
>  1  impalad!impala::QueryState::ReleaseBackendResources() [query-state.cc : 137 + 0x5]
>     rbx = 0x000000000a418400   rbp = 0x00007f3ef6c73030
>     rsp = 0x00007f3ef6c73020   r12 = 0x000000000a418400
>     r13 = 0x000000000ae61600   r14 = 0x00007f405ec8c7d0
>     r15 = 0x00007f3ff7d52d38   rip = 0x000000000122632b
>     Found by: call frame info
>  2  impalad!impala::QueryExecMgr::ExecuteQueryHelper(impala::QueryState*) [query-exec-mgr.cc : 178 + 0x8]
>     rbx = 0x000000000a418400   rbp = 0x00007f3ef6c73870
>     rsp = 0x00007f3ef6c73040   r12 = 0x000000000475a6c4
>     r13 = 0x000000000ae61600   r14 = 0x00007f405ec8c7d0
>     r15 = 0x00007f3ff7d52d38   rip = 0x0000000001221751
>     Found by: call frame info
>  3  impalad!impala::Thread::SuperviseThread(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, boost::function<void ()>, impala::ThreadDebugInfo const*, impala::Promise<long, (impala::PromiseMode)0>*) [function_template.hpp : 770 + 0x7]
>     rbx = 0x00007f3ef6c73900   rbp = 0x00007f3ef6c73b70
>     rsp = 0x00007f3ef6c73880   r12 = 0x00007f3ef6c738e0
>     r13 = 0x0000000006670ec0   r14 = 0x00007f405ec8c7d0
>     r15 = 0x00007f3ff7d52d38   rip = 0x00000000017103e2
>     Found by: call frame info
>  4  impalad!boost::detail::thread_data<boost::_bi::bind_t<void, void (*)(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, boost::function<void ()>, impala::ThreadDebugInfo const*, impala::Promise<long, (impala::PromiseMode)0>*), boost::_bi::list5<boost::_bi::value<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, boost::_bi::value<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, boost::_bi::value<boost::function<void ()> >, boost::_bi::value<impala::ThreadDebugInfo*>, boost::_bi::value<impala::Promise<long, (impala::PromiseMode)0>*> > > >::run() [bind.hpp : 531 + 0xc]
>     rbx = 0x0000000009a10ac0   rbp = 0x00007f3ef6c73bd0
>     rsp = 0x00007f3ef6c73b80   r12 = 0x00007f3ef6c73b80
>     r13 = 0x00000000017100e0   r14 = 0x00007f3ff7d53900
>     r15 = 0x00007f3ff7d52d00   rip = 0x000000000171185b
>     Found by: call frame info
>  5  impalad!thread_proxy + 0x72
>     rbx = 0x0000000009a10ac0   rbp = 0x000000000a00ce20
>     rsp = 0x00007f3ef6c73be0   r12 = 0x0000000000000000
>     r13 = 0x0000000000a01000   r14 = 0x0000000000000000
>     r15 = 0x00007f3ef6c74700   rip = 0x0000000001f960c2
>     Found by: call frame info
>  6  libpthread-2.17.so + 0x7ea5
>     rbx = 0x0000000000000000   rbp = 0x0000000000000000
>     rsp = 0x00007f3ef6c73c20   r12 = 0x0000000000000000
>     r13 = 0x0000000000a01000   r14 = 0x0000000000000000
>     r15 = 0x00007f3ef6c74700   rip = 0x00007f405ec87ea5
>     Found by: call frame info
>  7  libc-2.17.so + 0xfe9fd
>     rsp = 0x00007f3ef6c73cc0   rip = 0x00007f405b6e09fd
>     Found by: stack scanning
> {code}
> We've seen this once so far, and it is intermittent. The Impalad coordinator logs don't show anything interesting:
> {noformat}
> Wrote minidump to /opt/impala/logs/minidumps/impalad/d55e48bf-9973-477e-f6bafabc-c77ece24.dmp
> A fatal error has been detected by the Java Runtime Environment:
>   SIGSEGV (0xb) at pc=0x000000000126e46b, pid=1, tid=0x00007f3ef6c74700
> JRE version: OpenJDK Runtime Environment (8.0_302-b08) (build 1.8.0_302-b08)
> Java VM: OpenJDK 64-Bit Server VM (25.302-b08 mixed mode linux-amd64 compressed oops)
> Problematic frame:
> C  [impalad+0xe6e46b]  impala::TmpFileGroup::Close()+0x7b
> Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
> An error report file with more information is saved as:
> /opt/impala/hs_err_pid1.log{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org