You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "David Knupp (JIRA)" <ji...@apache.org> on 2017/12/11 20:48:00 UTC

[jira] [Created] (IMPALA-6306) Impalad becomes unreachable trying to load tpch_nested_parquet

David Knupp created IMPALA-6306:
-----------------------------------

             Summary: Impalad becomes unreachable trying to load tpch_nested_parquet
                 Key: IMPALA-6306
                 URL: https://issues.apache.org/jira/browse/IMPALA-6306
             Project: IMPALA
          Issue Type: Bug
          Components: Frontend
    Affects Versions: Impala 2.11.0
            Reporter: David Knupp


I've been trying (unsuccessfully) to load the tpch_nested_parquet table onto a remote cluster using Impala's standard data load scripts. I finally confirmed that I can complete the data load process if I simply comment out the lines from {{testdata/bin/create-load-data.sh}} that calls load_nested.py:
{noformat}
# run-step "Loading nested data" load-nested.log \
#   ${IMPALA_HOME}/testdata/bin/load_nested.py ${LOAD_NESTED_ARGS:-}
{noformat}
With the all of the other data completely loaded, I tried to run load_nested.py by hand, and saw this error:
{noformat}
systest@remote-joe:~$ Impala/testdata/bin/load_nested.py --cm-host impala-dataload-testing-1.vpc.cloudera.com
2017-12-10 13:45:12,663 INFO:db_connection[234]:Creating database tpch_nested_parquet
2017-12-10 13:45:12,965 INFO:load_nested[98]:Creating temp orders (chunk 1 of 1)
2017-12-10 13:45:33,724 INFO:load_nested[128]:Creating temp customers (chunk 1 of 1)
Traceback (most recent call last):
  File "Impala/testdata/bin/load_nested.py", line 320, in <module>
    load()
  File "Impala/testdata/bin/load_nested.py", line 130, in load
    impala.execute("CREATE TABLE tmp_customer_string AS " + tmp_customer_sql)
  File "/data1/systest/Impala/tests/comparison/db_connection.py", line 206, in execute
    return self._cursor.execute(sql, *args, **kwargs)
  File "/data1/systest/Impala/infra/python/env/local/lib/python2.7/site-packages/impala/hiveserver2.py", line 304, in execute
    self._wait_to_finish()  # make execute synchronous
  File "/data1/systest/Impala/infra/python/env/local/lib/python2.7/site-packages/impala/hiveserver2.py", line 380, in _wait_to_finish
    raise OperationalError(resp.errorMessage)
impala.error.OperationalError: Cancelled due to unreachable impalad(s): impala-dataload-testing-2.vpc.cloudera.com:22000
{noformat}

From the impalad log:
{noformat}
I1210 13:45:12.356262 17040 Frontend.java:909] Compiling query: DESCRIBE tpch_nested_parquet.part
I1210 13:45:12.358700 17040 Frontend.java:948] Compiled query.
I1210 13:45:12.358832 17040 jni-util.cc:211] org.apache.impala.common.AnalysisException: Could not resolve path: 'tpch_nested_parquet.part'
        at org.apache.impala.analysis.Analyzer.resolvePath(Analyzer.java:800)
        at org.apache.impala.analysis.Analyzer.resolvePath(Analyzer.java:753)
        at org.apache.impala.analysis.DescribeTableStmt.analyze(DescribeTableStmt.java:106)
        at org.apache.impala.analysis.AnalysisContext.analyze(AnalysisContext.java:388)
        at org.apache.impala.analysis.AnalysisContext.analyze(AnalysisContext.java:369)
        at org.apache.impala.service.Frontend.analyzeStmt(Frontend.java:920)
        at org.apache.impala.service.Frontend.createExecRequest(Frontend.java:1069)
        at org.apache.impala.service.JniFrontend.createExecRequest(JniFrontend.java:156)
I1210 13:45:12.371624 17040 status.cc:125] AnalysisException: Could not resolve path: 'tpch_nested_parquet.part'
    @           0x9597f9  impala::Status::Status()
    @           0xc9df62  impala::JniUtil::GetJniExceptionMsg()
    @           0xba2a7b  impala::Frontend::GetExecRequest()
    @           0xbc0558  impala::ImpalaServer::ExecuteInternal()
    @           0xbc6858  impala::ImpalaServer::Execute()
    @           0xc2244e  impala::ImpalaServer::ExecuteStatement()
    @          0x10a8326  apache::hive::service::cli::thrift::TCLIServiceProcessor::process_ExecuteStatement()
    @          0x10a1f44  apache::hive::service::cli::thrift::TCLIServiceProcessor::dispatchCall()
    @           0x929ecc  apache::thrift::TDispatchProcessor::process()
    @           0xafa43f  apache::thrift::server::TAcceptQueueServer::Task::run()
    @           0xaf4d35  impala::ThriftThread::RunRunnable()
    @           0xaf5b12  boost::detail::function::void_function_obj_invoker0<>::invoke()
    @           0xd10b63  impala::Thread::SuperviseThread()
    @           0xd112a4  boost::detail::thread_data<>::run()
    @          0x128afda  (unknown)
    @     0x7f2a6a7a8e25  start_thread
    @     0x7f2a6a4d634d  __clone
I1210 13:45:12.371713 17040 impala-server.cc:992] UnregisterQuery(): query_id=2748b77529da2004:7602cd6d00000000
I1210 13:45:12.371726 17040 impala-server.cc:1075] Cancel(): query_id=2748b77529da2004:7602cd6d00000000
I1210 13:45:12.373303 17040 impala-hs2-server.cc:381] CloseSession(): request=TCloseSessionReq {
  01: sessionHandle (struct) = TSessionHandle {
    01: sessionId (struct) = THandleIdentifier {
      01: guid (string) = ";\xad\xe0P\xb8hH\xda\x82v+\xeft\xb1h9",
      02: secret (string) = "w\xb9!#\xcd:A\xeb\xbf\x83g\xec<\x91\xf4\xd7",
    },
  },
}
I1210 13:45:12.374457 17040 impala-server.cc:1796] Connection from client ::ffff:172.26.22.57:59617 closed, closing 1 associated session(s)
I1210 13:45:12.375746 17041 impala-hs2-server.cc:381] CloseSession(): request=TCloseSessionReq {
  01: sessionHandle (struct) = TSessionHandle {
    01: sessionId (struct) = THandleIdentifier {
      01: guid (string) = ";\xad\xe0P\xb8hH\xda\x82v+\xeft\xb1h9",
      02: secret (string) = "w\xb9!#\xcd:A\xeb\xbf\x83g\xec<\x91\xf4\xd7",
    },
  },
}
I1210 13:45:12.387533 17041 status.cc:125] Invalid session id: da4868b850e0ad3b:3968b174ef2b7682
    @           0x9597f9  impala::Status::Status()
    @           0xbc5d1e  impala::ImpalaServer::CloseSessionInternal()
    @           0xc1884e  impala::ImpalaServer::CloseSession()
    @          0x10961e6  apache::hive::service::cli::thrift::TCLIServiceProcessor::process_CloseSession()
    @          0x10a1f44  apache::hive::service::cli::thrift::TCLIServiceProcessor::dispatchCall()
    @           0x929ecc  apache::thrift::TDispatchProcessor::process()
    @           0xafa43f  apache::thrift::server::TAcceptQueueServer::Task::run()
    @           0xaf4d35  impala::ThriftThread::RunRunnable()
    @           0xaf5b12  boost::detail::function::void_function_obj_invoker0<>::invoke()
    @           0xd10b63  impala::Thread::SuperviseThread()
    @           0xd112a4  boost::detail::thread_data<>::run()
    @          0x128afda  (unknown)
    @     0x7f2a6a7a8e25  start_thread
    @     0x7f2a6a4d634d  __clone
I1210 13:45:12.399276 17041 status.cc:125] Invalid session id: da4868b850e0ad3b:3968b174ef2b7682
    @           0x9597f9  impala::Status::Status()
    @           0xbc5d1e  impala::ImpalaServer::CloseSessionInternal()
    @           0xc189ac  impala::ImpalaServer::CloseSession()
    @          0x10961e6  apache::hive::service::cli::thrift::TCLIServiceProcessor::process_CloseSession()
    @          0x10a1f44  apache::hive::service::cli::thrift::TCLIServiceProcessor::dispatchCall()
    @           0x929ecc  apache::thrift::TDispatchProcessor::process()
    @           0xafa43f  apache::thrift::server::TAcceptQueueServer::Task::run()
    @           0xaf4d35  impala::ThriftThread::RunRunnable()
    @           0xaf5b12  boost::detail::function::void_function_obj_invoker0<>::invoke()
    @           0xd10b63  impala::Thread::SuperviseThread()
    @           0xd112a4  boost::detail::thread_data<>::run()
    @          0x128afda  (unknown)
    @     0x7f2a6a7a8e25  start_thread
    @     0x7f2a6a4d634d  __clone
{noformat}




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)