You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Sailesh Mukil (JIRA)" <ji...@apache.org> on 2017/09/08 21:09:00 UTC

[jira] [Resolved] (IMPALA-5724) test_union hangs in exhaustive test run

     [ https://issues.apache.org/jira/browse/IMPALA-5724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sailesh Mukil resolved IMPALA-5724.
-----------------------------------
    Resolution: Duplicate

> test_union hangs in exhaustive test run
> ---------------------------------------
>
>                 Key: IMPALA-5724
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5724
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Distributed Exec
>    Affects Versions: Impala 2.10.0
>         Environment: rhel7
>            Reporter: Matthew Jacobs
>            Assignee: Sailesh Mukil
>            Priority: Critical
>              Labels: statestore
>         Attachments: statestored.log
>
>
> On a recent exhaustive jenkins run (on rhel7), TestStatestore timed out:
> {code}
> 08:41:31 [gw2] PASSED unittests/test_file_parser.py::TestTestFileParser::test_parse_commented_out_test_as_comment 
> 08:41:31 unittests/test_result_verifier.py::TestResultVerifier::test_result_row_indexing[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] 
> 08:41:31 [gw2] PASSED unittests/test_result_verifier.py::TestResultVerifier::test_result_row_indexing[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] 
> 08:41:31 [gw3] PASSED statestore/test_statestore.py::TestStatestore::test_update_is_delta 
> 08:41:45 [gw0] PASSED statestore/test_statestore.py::TestStatestore::test_failure_detected Build timed out (after 1,440 minutes). Marking the build as failed.
> 20:56:59 Build was aborted
> 20:56:59 Archiving artifacts
> 20:56:59 
> 20:56:59 [gw0] node down: Not properly terminated
> 20:56:59 [gw0] FAILED statestore/test_statestore.py::TestStatestore::test_topic_persistence 
> 20:56:59 Replacing crashed slave gw0
> 20:57:05 Recording test results
> 20:57:08 Email was triggered for: Failure
> 20:57:08 Sending email for trigger: Failure
> 20:57:08 Sending email to: impala-jenkins@cloudera.com
> 20:57:08 
> 20:57:08 Deleting project workspace... 
> 20:57:08 done
> 20:57:08 
> 20:57:08 Finished: FAILURE
> {code}
> The statestore logs show a lot of errors like
> {code}
> I0726 08:24:01.102785 30978 statestore.cc:526] Preparing initial test_skipped_b1501e92-7215-11e7-a5fa-02581563417c topic update for python-test-client-b1507018-7215-11e7-a5fa-02581563417c. Size = 8.00 B
> I0726 08:24:01.103085 30978 thrift-util.cc:123] TSocket::open() connect() <Host: localhost Port: 45518>Connection refused
> I0726 08:24:01.415092 30978 status.cc:55] RPC Error: Client for localhost:45518 hits an unexpected exception: TProtocolException: Invalid data, type: N6apache6thrift8protocol18TProtocolExceptionE rpc send completed: true
>     @          0x12590d6  impala::Status::Status()
>     @          0x15ee502  impala::ClientConnection<>::DoRpc<>()
>     @          0x15e7431  impala::Statestore::SendTopicUpdate()
>     @          0x15e9610  impala::Statestore::DoSubscriberUpdate()
>     @          0x15fecfe  boost::_mfi::mf3<>::operator()()
>     @          0x15fd5a5  boost::_bi::list4<>::operator()<>()
>     @          0x15fb3de  boost::_bi::bind_t<>::operator()<>()
>     @          0x15f88d3  boost::detail::function::void_function_obj_invoker2<>::invoke()
>     @          0x15f4dfd  boost::function2<>::operator()()
>     @          0x15f06ef  impala::ThreadPool<>::WorkerThread()
>     @          0x160038d  boost::_mfi::mf1<>::operator()()
>     @          0x15ffe17  boost::_bi::list2<>::operator()<>()
>     @          0x15fedfd  boost::_bi::bind_t<>::operator()()
>     @          0x15fd88c  boost::detail::function::void_function_obj_invoker0<>::invoke()
>     @          0x13d6148  boost::function0<>::operator()()
>     @          0x16a7031  impala::Thread::SuperviseThread()
>     @          0x16afb38  boost::_bi::list4<>::operator()<>()
>     @          0x16afa7b  boost::_bi::bind_t<>::operator()()
>     @          0x16afa3e  boost::detail::thread_data<>::run()
>     @          0x1ba055a  thread_proxy
>     @     0x7f23cdfa9df3  start_thread
>     @     0x7f23cdcd71ad  __clone
> I0726 08:24:01.415179 30978 client-cache.cc:170] Broken Connection, destroy client for localhost:45518
> I0726 08:24:01.415273 30978 statestore.cc:697] Unable to send topic update message to subscriber python-test-client-b1507018-7215-11e7-a5fa-02581563417c, received error: RPC Error: Client for localhost:45518 hits an unexpected exception: TProtocolException: Invalid data, type: N6apache6thrift8protocol18TProtocolExceptionE rpc send completed: true
> {code}
> I've attached the full statestored log.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)