You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Yida Wu (Jira)" <ji...@apache.org> on 2021/12/21 00:57:00 UTC
[jira] [Resolved] (IMPALA-10944) Impala crashes with SIGSEGV in TmpFileGroup::Close() in release build
[ https://issues.apache.org/jira/browse/IMPALA-10944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yida Wu resolved IMPALA-10944.
------------------------------
Resolution: Fixed
The issue is due to IMPALA-10945, an unfavorable behavior change on the remote temporary path configuration, since IMPALA-10945 is fixed, the problem is resolved.
> Impala crashes with SIGSEGV in TmpFileGroup::Close() in release build
> ---------------------------------------------------------------------
>
> Key: IMPALA-10944
> URL: https://issues.apache.org/jira/browse/IMPALA-10944
> Project: IMPALA
> Issue Type: Bug
> Components: Backend
> Affects Versions: Impala 4.1.0
> Reporter: Joe McDonnell
> Assignee: Yida Wu
> Priority: Critical
>
> When running a workload on release docker images, the Impala coordinator crashed with the following stack:
> {code:java}
> Crash reason: SIGSEGV
> Crash address: 0x58
> Process uptime: not availableThread 514 (crashed)
> 0 impalad!impala::TmpFileGroup::Close() [atomic.h : 93 + 0x0]
> rax = 0x0000000000000000 rdx = 0x0000000000000000
> rcx = 0x000000000000000c rbx = 0x000000000e72ec00
> rsi = 0x000000000e72ec00 rdi = 0x000000000a00d100
> rbp = 0x00007f3ef6c73010 rsp = 0x00007f3ef6c73000
> r8 = 0x0000000000000000 r9 = 0x0000000000000000
> r10 = 0x000000000478ff4c r11 = 0x0000000000000000
> r12 = 0x0000000000000000 r13 = 0x000000000ae61600
> r14 = 0x00007f405ec8c7d0 r15 = 0x00007f3ff7d52d38
> rip = 0x000000000126e46b
> Found by: given as instruction pointer in context
> 1 impalad!impala::QueryState::ReleaseBackendResources() [query-state.cc : 137 + 0x5]
> rbx = 0x000000000a418400 rbp = 0x00007f3ef6c73030
> rsp = 0x00007f3ef6c73020 r12 = 0x000000000a418400
> r13 = 0x000000000ae61600 r14 = 0x00007f405ec8c7d0
> r15 = 0x00007f3ff7d52d38 rip = 0x000000000122632b
> Found by: call frame info
> 2 impalad!impala::QueryExecMgr::ExecuteQueryHelper(impala::QueryState*) [query-exec-mgr.cc : 178 + 0x8]
> rbx = 0x000000000a418400 rbp = 0x00007f3ef6c73870
> rsp = 0x00007f3ef6c73040 r12 = 0x000000000475a6c4
> r13 = 0x000000000ae61600 r14 = 0x00007f405ec8c7d0
> r15 = 0x00007f3ff7d52d38 rip = 0x0000000001221751
> Found by: call frame info
> 3 impalad!impala::Thread::SuperviseThread(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, boost::function<void ()>, impala::ThreadDebugInfo const*, impala::Promise<long, (impala::PromiseMode)0>*) [function_template.hpp : 770 + 0x7]
> rbx = 0x00007f3ef6c73900 rbp = 0x00007f3ef6c73b70
> rsp = 0x00007f3ef6c73880 r12 = 0x00007f3ef6c738e0
> r13 = 0x0000000006670ec0 r14 = 0x00007f405ec8c7d0
> r15 = 0x00007f3ff7d52d38 rip = 0x00000000017103e2
> Found by: call frame info
> 4 impalad!boost::detail::thread_data<boost::_bi::bind_t<void, void (*)(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, boost::function<void ()>, impala::ThreadDebugInfo const*, impala::Promise<long, (impala::PromiseMode)0>*), boost::_bi::list5<boost::_bi::value<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, boost::_bi::value<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, boost::_bi::value<boost::function<void ()> >, boost::_bi::value<impala::ThreadDebugInfo*>, boost::_bi::value<impala::Promise<long, (impala::PromiseMode)0>*> > > >::run() [bind.hpp : 531 + 0xc]
> rbx = 0x0000000009a10ac0 rbp = 0x00007f3ef6c73bd0
> rsp = 0x00007f3ef6c73b80 r12 = 0x00007f3ef6c73b80
> r13 = 0x00000000017100e0 r14 = 0x00007f3ff7d53900
> r15 = 0x00007f3ff7d52d00 rip = 0x000000000171185b
> Found by: call frame info
> 5 impalad!thread_proxy + 0x72
> rbx = 0x0000000009a10ac0 rbp = 0x000000000a00ce20
> rsp = 0x00007f3ef6c73be0 r12 = 0x0000000000000000
> r13 = 0x0000000000a01000 r14 = 0x0000000000000000
> r15 = 0x00007f3ef6c74700 rip = 0x0000000001f960c2
> Found by: call frame info
> 6 libpthread-2.17.so + 0x7ea5
> rbx = 0x0000000000000000 rbp = 0x0000000000000000
> rsp = 0x00007f3ef6c73c20 r12 = 0x0000000000000000
> r13 = 0x0000000000a01000 r14 = 0x0000000000000000
> r15 = 0x00007f3ef6c74700 rip = 0x00007f405ec87ea5
> Found by: call frame info
> 7 libc-2.17.so + 0xfe9fd
> rsp = 0x00007f3ef6c73cc0 rip = 0x00007f405b6e09fd
> Found by: stack scanning
> {code}
> We've seen this once so far, and it is intermittent. The Impalad coordinator logs don't show anything interesting:
> {noformat}
> Wrote minidump to /opt/impala/logs/minidumps/impalad/d55e48bf-9973-477e-f6bafabc-c77ece24.dmp
> A fatal error has been detected by the Java Runtime Environment:
> SIGSEGV (0xb) at pc=0x000000000126e46b, pid=1, tid=0x00007f3ef6c74700
> JRE version: OpenJDK Runtime Environment (8.0_302-b08) (build 1.8.0_302-b08)
> Java VM: OpenJDK 64-Bit Server VM (25.302-b08 mixed mode linux-amd64 compressed oops)
> Problematic frame:
> C [impalad+0xe6e46b] impala::TmpFileGroup::Close()+0x7b
> Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
> An error report file with more information is saved as:
> /opt/impala/hs_err_pid1.log{noformat}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)