You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Fang-Yu Rao (Jira)" <ji...@apache.org> on 2021/01/20 16:55:00 UTC

[jira] [Commented] (IMPALA-10434) impala-shell crash in parsing multiline queries that contain UTF-8 characters

    [ https://issues.apache.org/jira/browse/IMPALA-10434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17268692#comment-17268692 ] 

Fang-Yu Rao commented on IMPALA-10434:
--------------------------------------

Similar with IMPALA-1130 but the error details are different. Current error is:
{code:java}
[localhost:21050] default> select 'è'
                         > ;
Traceback (most recent call last):
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2063, in <module>
    impala_shell_main()
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2028, in impala_shell_main
    shell.cmdloop(intro)
  File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py", line 141, in cmdloop
    line = self.precmd(line)
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 632, in precmd
    args = self.sanitise_input(args.decode('utf-8'))  # python2
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 436, in sanitise_input
    tokens = args.strip().split(' ')
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 8: ordinal not in range(128)
{code}
I also verified that impala-3.4 doesn't have this issue. So maybe some changes in impala-4.0 cause this.


> impala-shell crash in parsing multiline queries that contain UTF-8 characters
> -----------------------------------------------------------------------------
>
>                 Key: IMPALA-10434
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10434
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Clients
>    Affects Versions: Impala 4.0
>            Reporter: Fang-Yu Rao
>            Assignee: Quanlong Huang
>            Priority: Critical
>
> I'm at master branch (commit a6a244099502329d9193b316ea26d5fd6451b6bd) and hit this error:
> {code:java}
> [localhost:21050] default> select "你好";
> Query: select "你好"
> Query submitted at: 2020-12-30 11:00:40 (Coordinator: http://quanlong-OptiPlex-BJ:25000)
> Query progress can be monitored at: http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=554d2348a28884c6:30835a4800000000
> +--------+
> | '你好' |
> +--------+
> | 你好   |
> +--------+
> Fetched 1 row(s) in 0.12s
> [localhost:21050] default> select
>                          > "你好";
> Traceback (most recent call last):
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2062, in <module>
>     impala_shell_main()
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2027, in impala_shell_main
>     shell.cmdloop(intro)
>   File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py", line 141, in cmdloop
>     line = self.precmd(line)
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 631, in precmd
>     args = self.sanitise_input(args.decode('utf-8'))  # python2
>   File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 435, in sanitise_input
>     tokens = args.strip().split(' ')
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 8: ordinal not in range(128) {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org