You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Fang-Yu Rao (Jira)" <ji...@apache.org> on 2022/02/18 19:23:00 UTC

[jira] [Created] (IMPALA-11133) compare_branches.py could fail if the author of a commit contains non-unicode characters

Fang-Yu Rao created IMPALA-11133:
------------------------------------

             Summary: compare_branches.py could fail if the author of a commit contains non-unicode characters
                 Key: IMPALA-11133
                 URL: https://issues.apache.org/jira/browse/IMPALA-11133
             Project: IMPALA
          Issue Type: Bug
          Components: Infrastructure
            Reporter: Fang-Yu Rao
            Assignee: Fang-Yu Rao
             Fix For: Impala 4.1.0


We found that compare_branches.py could fail if the author of a commit contains non-unicode characters.

For instance, if we try to compare a branch '{{{}apache-ref-master{}}}' consisting of a recently committed patch [https://gerrit.cloudera.org/c/18215/] (IMPALA-11113 and IMPALA-11114: fixed single_node_perf_run.py for TPCDS) to an older branch '{{{}cdw-master-staging{}}}' by "{{{}$IMPALA_HOME/bin/compare_branches.py --source_remote_name="" --source_branch apache-ref-master --target_remote_name="" --target_branch cdw-master-staging{}}}", we will get the following error message. Because the author field, i.e., "Gergely Fürnstáhl", contains no-unicode characters.
{code:java}
...
Traceback (most recent call last):
  File "./bin/compare_branches.py", line 287, in <module>
    main()
  File "./bin/compare_branches.py", line 270, in main
    print u'{0} {1} ({2}) - {3}'.format(commit_hash, msg, date, author)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 9: ordinal not in range(128)
{code}
One solution to resolving the issue is to surround '{{{}author{}}}' with '{{{}repr(){}}}'.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)