You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Gergely Fürnstáhl (Jira)" <ji...@apache.org> on 2022/02/09 09:43:00 UTC

[jira] [Created] (IMPALA-11113) single_node_perf_run.py throws UnicodeDecodeError for TPCDS dataset

Gergely Fürnstáhl created IMPALA-11113:
------------------------------------------

             Summary: single_node_perf_run.py throws UnicodeDecodeError for TPCDS dataset
                 Key: IMPALA-11113
                 URL: https://issues.apache.org/jira/browse/IMPALA-11113
             Project: IMPALA
          Issue Type: Bug
            Reporter: Gergely Fürnstáhl
            Assignee: Gergely Fürnstáhl


Possible fix:

[https://stackoverflow.com/questions/19872773/unicodedecodeerror-while-using-json-dumps]

Exception:

Traceback (most recent call last):
  File "/home/gfurnstahl/Impala/bin/run-workload.py", line 280, in <module>
    json.dump(result_map, f, cls=CustomJSONEncoder)
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/__init__.py", line 189, in dump
    for chunk in iterable:
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 434, in _iterencode
    for chunk in _iterencode_dict(o, _current_indent_level):
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 408, in _iterencode_dict
    for chunk in chunks:
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 332, in _iterencode_list
    for chunk in chunks:
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 443, in _iterencode
    for chunk in _iterencode(o, _current_indent_level):
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 434, in _iterencode
    for chunk in _iterencode_dict(o, _current_indent_level):
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 408, in _iterencode_dict
    for chunk in chunks:
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/json/encoder.py", line 313, in _iterencode_list
    yield buf + _encoder(value)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xc9 in position 47: invalid continuation byte
Traceback (most recent call last):
  File "./bin/single_node_perf_run.py", line 359, in <module>
    main()
  File "./bin/single_node_perf_run.py", line 349, in main
    perf_ab_test(options, args)
  File "./bin/single_node_perf_run.py", line 256, in perf_ab_test
    run_workload(temp_dir, workloads, options)
  File "./bin/single_node_perf_run.py", line 154, in run_workload
    configured_call(run_workload)
  File "./bin/single_node_perf_run.py", line 94, in configured_call
    return subprocess.check_call(["bash", "-c", cmd])
  File "/home/gfurnstahl/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/subprocess.py", line 190, in check_call
    raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['bash', '-c', 'source /home/gfurnstahl/Impala/bin/impala-config.sh && /home/gfurnstahl/Impala/bin/run-workload.py --workloads=tpcds:10 --impalads=localhost:21000 --results_json_file=/home/gfurnstahl/Impala/perf_results/perf_run_l1WHcn/27a1b4c1203fd1fc7929d23659eed0861703e9e1.json --query_iterations=3 --table_formats=parquet/none --plan_first']' returned non-zero exit status 1



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org