You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Tim Armstrong (JIRA)" <ji...@apache.org> on 2018/11/28 21:25:00 UTC
[jira] [Created] (IMPALA-7906) Crash in JVM
PSPromotionManager::copy_to_survivor_space
Tim Armstrong created IMPALA-7906:
-------------------------------------
Summary: Crash in JVM PSPromotionManager::copy_to_survivor_space
Key: IMPALA-7906
URL: https://issues.apache.org/jira/browse/IMPALA-7906
Project: IMPALA
Issue Type: Bug
Components: Backend
Affects Versions: Impala 3.2.0
Reporter: Tim Armstrong
Assignee: Tim Armstrong
Attachments: hs_err_pid6290.log
{noformat}
#0 0x00007f44ca5261f7 in raise () from /lib64/libc.so.6
#1 0x00007f44ca5278e8 in abort () from /lib64/libc.so.6
#2 0x00007f44cd726185 in os::abort(bool) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#3 0x00007f44cd8c8593 in VMError::report_and_die() () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#4 0x00007f44cd8c8a7e in crash_handler(int, siginfo*, void*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#5 0x00007f44cd724f72 in os::Linux::chained_handler(int, siginfo*, void*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#6 0x00007f44cd72b5f6 in JVM_handle_linux_signal () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#7 0x00007f44cd721be3 in signalHandler(int, siginfo*, void*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#8 <signal handler called>
#9 0x00007f44cd713e95 in oopDesc::print_on(outputStream*) const () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#10 0x00007f44cd72afdb in os::print_register_info(outputStream*, void*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#11 0x00007f44cd8c6c13 in VMError::report(outputStream*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#12 0x00007f44cd8c818a in VMError::report_and_die() () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#13 0x00007f44cd72b68f in JVM_handle_linux_signal () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#14 0x00007f44cd721be3 in signalHandler(int, siginfo*, void*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#15 <signal handler called>
#16 0x00007f44cd78f562 in oopDesc* PSPromotionManager::copy_to_survivor_space<false>(oopDesc*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#17 0x00007f44cd7924a5 in PSRootsClosure<false>::do_oop(oopDesc**) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#18 0x00007f44cd716a96 in InterpreterOopMap::iterate_oop(OffsetClosure*) const () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#19 0x00007f44cd38f789 in frame::oops_interpreted_do(OopClosure*, CLDClosure*, RegisterMap const*, bool) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#20 0x00007f44cd86eaa1 in JavaThread::oops_do(OopClosure*, CLDClosure*, CodeBlobClosure*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#21 0x00007f44cd79270f in ThreadRootsTask::do_it(GCTaskManager*, unsigned int) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#22 0x00007f44cd3d7ecf in GCTaskThread::run() () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#23 0x00007f44cd727338 in java_start(Thread*) () from /usr/java/jdk1.8.0_144/jre/lib/amd64/server/libjvm.so
#24 0x00007f44ca8bbe25 in start_thread () from /lib64/libpthread.so.0
#25 0x00007f44ca5e934d in clone () from /lib64/libc.so.6
{noformat}
These are the tests running at the time
{noformat}
006:53:04 [gw1] PASSED query_test/test_mem_usage_scaling.py::TestQueryMemLimitScaling::test_mem_usage_scaling[mem_limit: -1 | protocol: beeswax | exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0} | table_format: parquet/none]
06:53:07 query_test/test_mem_usage_scaling.py::TestQueryMemLimitScaling::test_mem_usage_scaling[mem_limit: 400m | protocol: beeswax | exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0} | table_format: parquet/none]
06:53:07 [gw5] PASSED query_test/test_analytic_tpcds.py::TestAnalyticTpcds::test_analytic_functions_tpcds[batch_size: 1 | protocol: beeswax | exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0} | table_format: parquet/none]
06:53:08 query_test/test_cancellation.py::TestCancellationParallel::test_cancel_select[protocol: beeswax | table_format: text/gzip/block | exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0} | query_type: SELECT | wait_action: 0:GETNEXT:WAIT | cancel_delay: 0.01 | cpu_limit_s: 100000 | query: select * from lineitem limit 50 | fail_rpc_action: COORD_CANCEL_QUERY_FINSTANCES_RPC:FAIL | join_before_close: True | buffer_pool_limit: 0]
06:53:08 [gw5] PASSED query_test/test_cancellation.py::TestCancellationParallel::test_cancel_select[protocol: beeswax | table_format: text/gzip/block | exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0} | query_type: SELECT | wait_action: 0:GETNEXT:WAIT | cancel_delay: 0.01 | cpu_limit_s: 100000 | query: select * from lineitem limit 50 | fail_rpc_action: COORD_CANCEL_QUERY_FINSTANCES_RPC:FAIL | join_before_close: True | buffer_pool_limit: 0]
06:53:08 [gw2] PASSED query_test/test_decimal_casting.py::TestDecimalCasting::test_min_max_zero_null[cast_from: number | decimal_type: (31, 14) | exec_option: {'decimal_v2': 'true'}]
06:53:09 query_test/test_decimal_casting.py::TestDecimalCasting::test_min_max_zero_null[cast_from: number | decimal_type: (31, 22) | exec_option: {'decimal_v2': 'true'}]
06:54:07 query_test/test_cancellation.py::TestCancellationParallel::test_cancel_select[protocol: beeswax | table_format: kudu/none | exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0} | query_type: SELECT | wait_action: 0:GETNEXT:WAIT | cancel_delay: 0 | cpu_limit_s: 100000 | query: compute stats lineitem | fail_rpc_action: COORD_CANCEL_QUERY_FINSTANCES_RPC:FAIL | join_before_close: True | buffer_pool_limit: 0]
06:54:08 [gw6] FAILED query_test/test_decimal_fuzz.py::TestDecimalFuzz::test_decimal_ops[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 5000, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0}]
06:54:08 query_test/test_decimal_fuzz.py::TestDecimalFuzz::test_width_bucket[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 5000, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0}]
06:54:08 [gw6] FAILED query_test/test_decimal_fuzz.py::TestDecimalFuzz::test_width_bucket[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 5000, 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 'exec_single_node_rows_threshold': 0}]
06:54:08 query_test/test_decimal_queries.py::TestDecimalQueries::test_queries[protocol: beeswax | exec_option: {'disable_codegen_rows_threshold': 0, 'disable_codegen': 'false', 'decimal_v2': 'false', 'batch_size': 0} | table_format: text/none]
06:54:08 [gw6] ERROR query_test/test_decimal_queries.py::TestDecimalQueries::test_queries[protocol: beeswax | exec_option: {'disable_codegen_rows_threshold': 0, 'disable_codegen': 'false', 'decimal_v2': 'false', 'batch_size': 0} | table_format: text/none]
06:54:08 query_test/test_decimal_queries.py::TestDecimalQueries::test_queries[protocol: hs2 | exec_option: {'disable_codegen_rows_threshold': 0, 'disable_codegen': 'true', 'decimal_v2': 'false', 'batch_size': 0} | table_format: parquet/none]
{noformat}
One thing that's a little interesting is that it's running select repeat('AZ', 128 * 1024 * 1024), which passes a large string from the backend to frontend - maybe something went wrong there?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)