You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2020/10/30 12:22:00 UTC
[jira] [Assigned] (IMPALA-10145) UnicodeDecodeError in Thrift
0.11.0 generated files
[ https://issues.apache.org/jira/browse/IMPALA-10145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Quanlong Huang reassigned IMPALA-10145:
---------------------------------------
Assignee: Quanlong Huang
> UnicodeDecodeError in Thrift 0.11.0 generated files
> ---------------------------------------------------
>
> Key: IMPALA-10145
> URL: https://issues.apache.org/jira/browse/IMPALA-10145
> Project: IMPALA
> Issue Type: Bug
> Reporter: Adam Tamas
> Assignee: Quanlong Huang
> Priority: Major
>
> If there is a string with undecodable characters in the query results, then an error will happen during the fetching while thrift 0.11.0 generated python files were in use which results in an UnicodeDecodeError.
> Depending on which protocol is in use with the impala-shell, the error will happen in different places.
> Examples for hs2-http and hs2 protocolls:
> {code:java}
> [localhost:28000] default> select unhex('aa');
> Query: select unhex('aa')
> Query submitted at: 2020-09-04 12:41:14 (Coordinator: http://tadam-OptiPlex-7070:25000)
> Query progress can be monitored at: http://tadam-OptiPlex-7070:25000/query_plan?query_id=d041ab999f597fec:46a8b51800000000
> Caught exception 'utf8' codec can't decode byte 0xaa in position 0: invalid start byte, type=<type 'exceptions.UnicodeDecodeError'> in FetchResults.
> Unknown Exception : 'utf8' codec can't decode byte 0xaa in position 0: invalid start byte
> Traceback (most recent call last):
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/impala_shell.py", line 1183, in _execute_stmt
> for rows in rows_fetched:
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/lib/impala_client.py", line 781, in fetch
> resp = self._do_hs2_rpc(FetchResults)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/lib/impala_client.py", line 942, in _do_hs2_rpc
> return rpc()
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/lib/impala_client.py", line 778, in FetchResults
> return self.imp_service.FetchResults(req)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/TCLIService.py", line 717, in FetchResults
> return self.recv_FetchResults()
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/TCLIService.py", line 736, in recv_FetchResults
> result.read(iprot)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/TCLIService.py", line 3593, in read
> self.success.read(iprot)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/ttypes.py", line 5888, in read
> self.results.read(iprot)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/ttypes.py", line 2670, in read
> _elem115.read(iprot)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/ttypes.py", line 2556, in read
> self.stringVal.read(iprot)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/ttypes.py", line 2352, in read
> _elem95 = iprot.readString().decode('utf-8') if sys.version_info[0] == 2 else iprot.readString()
> File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
> return codecs.utf_8_decode(input, errors, True)
> UnicodeDecodeError: 'utf8' codec can't decode byte 0xaa in position 0: invalid start byte
> [Not connected] >
> {code}
> {code:java}
> [localhost:21050] default> select unhex('aa');
> Query: select unhex('aa')
> Query submitted at: 2020-09-04 12:42:22 (Coordinator: http://tadam-OptiPlex-7070:25000)
> Query progress can be monitored at: http://tadam-OptiPlex-7070:25000/query_plan?query_id=3a481e2a0581ea7c:a6e1901800000000
> Caught exception 'utf8' codec can't decode byte 0xaa in position 0: invalid start byte, type=<type 'exceptions.UnicodeDecodeError'> in FetchResults.
> Unknown Exception : 'utf8' codec can't decode byte 0xaa in position 0: invalid start byte
> Traceback (most recent call last):
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/impala_shell.py", line 1183, in _execute_stmt
> for rows in rows_fetched:
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/lib/impala_client.py", line 781, in fetch
> resp = self._do_hs2_rpc(FetchResults)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/lib/impala_client.py", line 942, in _do_hs2_rpc
> return rpc()
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/lib/impala_client.py", line 778, in FetchResults
> return self.imp_service.FetchResults(req)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/TCLIService.py", line 717, in FetchResults
> return self.recv_FetchResults()
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/TCLIService.py", line 736, in recv_FetchResults
> result.read(iprot)
> File "/home/tadam/imp/impala/shell/build/impala-shell-4.0.0-SNAPSHOT/gen-py/TCLIService/TCLIService.py", line 3583, in read
> iprot._fast_decode(self, iprot, [self.__class__, self.thrift_spec])
> UnicodeDecodeError: 'utf8' codec can't decode byte 0xaa in position 0: invalid start byte
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org