You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Riza Suminto (Jira)" <ji...@apache.org> on 2022/03/11 17:52:00 UTC

[jira] [Work started] (IMPALA-11177) crash in useAsyncIoForStream due to unknown orc::StreamKind

     [ https://issues.apache.org/jira/browse/IMPALA-11177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on IMPALA-11177 started by Riza Suminto.
---------------------------------------------
> crash in useAsyncIoForStream due to unknown orc::StreamKind
> -----------------------------------------------------------
>
>                 Key: IMPALA-11177
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11177
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: Impala 4.1.0
>            Reporter: Quanlong Huang
>            Assignee: Riza Suminto
>            Priority: Critical
>
> Hit a DCHECK in useAsyncIoForStream() in an irrelevant build: [https://jenkins.impala.io/job/ubuntu-16.04-dockerised-tests/5391]
> Stacktrace:
> {code:java}
> F0310 11:55:24.087487 15832 hdfs-orc-scanner.cc:183] 0b4eaeb37f8e5d6b:84d11bab00000003] Check failed: false 
> *** Check failure stack trace: ***
>     @          0x574005c  google::LogMessage::Fail()
>     @          0x574190c  google::LogMessage::SendToLog()
>     @          0x573f9ba  google::LogMessage::Flush()
>     @          0x5743578  google::LogMessageFatal::~LogMessageFatal()
>     @          0x2c0d5e3  impala::useAsyncIoForStream()
>     @          0x2c0d77f  impala::HdfsOrcScanner::StartColumnReading()
>     @          0x2c15c36  impala::HdfsOrcScanner::NextStripe()
>     @          0x2c1503f  impala::HdfsOrcScanner::GetNextInternal()
>     @          0x2c13e12  impala::HdfsOrcScanner::ProcessSplit()
>     @          0x2d7266a  impala::HdfsScanNode::ProcessSplit()
>     @          0x2d719ec  impala::HdfsScanNode::ScannerThread()
>     @          0x2d70d49  _ZZN6impala12HdfsScanNode22ThreadTokenAvailableCbEPNS_18ThreadResourcePoolEENKUlvE_clEv
>     @          0x2d73975  _ZN5boost6detail8function26void_function_obj_invoker0IZN6impala12HdfsScanNode22ThreadTokenAvailableCbEPNS3_18ThreadResourcePoolEEUlvE_vE6invokeERNS1_15function_bufferE
>     @          0x23df6b3  boost::function0<>::operator()()
>     @          0x2abe062  impala::Thread::SuperviseThread()
>     @          0x2ac6a9a  boost::_bi::list5<>::operator()<>()
>     @          0x2ac69be  boost::_bi::bind_t<>::operator()()
>     @          0x2ac697f  boost::detail::thread_data<>::run()
>     @          0x43e9dd0  thread_proxy
>     @     0x7f15bf27e6b9  start_thread
>     @     0x7f15bbd8451c  clone 
>  {code}
> The query is
> {code}
> I0310 11:55:23.958173 29215 Frontend.java:1636] 0b4eaeb37f8e5d6b:84d11bab00000000] Analyzing query: select count(*) from (select distinct * from test_fuzz_alltypes_b98dffcf.alltypes) q db: functional_orc_def
> {code}
> Come from test_scanners_fuzz.py::TestScannersFuzzing::test_fuzz_alltypes
> {code}
>   74: client_identifier (string) = "query_test/test_scanners_fuzz.py::TestScannersFuzzing::()::test_fuzz_alltypes[protocol:beeswax|exec_option:{'debug_action':'-1:OPEN:SET_DENY_RESERVATION_PROBABILITY@0.5';'abort_on_error':False;'mem_limit':'512m';'num_nodes':0}|table_format:orc/def/block]",
> {code}
> The underlying ORC files are malformed. I think we should return false in such cases and let the ORC lib returns errors later.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org