You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Gergely Fürnstáhl (Jira)" <ji...@apache.org> on 2022/09/29 10:18:00 UTC

[jira] [Resolved] (IMPALA-11576) query_test.test_iceberg.test_multiple_storage_locations fails on S3

     [ https://issues.apache.org/jira/browse/IMPALA-11576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gergely Fürnstáhl resolved IMPALA-11576.
----------------------------------------
    Target Version: Impala 4.2.0
        Resolution: Fixed

> query_test.test_iceberg.test_multiple_storage_locations fails on S3
> -------------------------------------------------------------------
>
>                 Key: IMPALA-11576
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11576
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Laszlo Gaal
>            Assignee: Gergely Fürnstáhl
>            Priority: Blocker
>              Labels: broken-build, impala-iceberg
>
> The test seems to fail on a badly constructed file name.
> Stack trace:{code}
> query_test.test_iceberg.TestIcebergTable.test_multiple_storage_locations[protocol: beeswax | exec_option: {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: parquet/none]
> query_test/test_iceberg.py:785: in test_multiple_storage_locations
>     vector, unique_database)
> common/impala_test_suite.py:706: in run_test_case
>     result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
> common/impala_test_suite.py:644: in __exec_in_impala
>     result = self.__execute_query(target_impalad_client, query, user=user)
> common/impala_test_suite.py:980: in __execute_query
>     return impalad_client.execute(query, user=user)
> common/impala_connection.py:212: in execute
>     return self.__beeswax_client.execute(sql_stmt, user=user)
> beeswax/impala_beeswax.py:189: in execute
>     handle = self.__execute_query(query_string.strip(), user=user)
> beeswax/impala_beeswax.py:367: in __execute_query
>     self.wait_for_finished(handle)
> beeswax/impala_beeswax.py:388: in wait_for_finished
>     raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
> E   ImpalaBeeswaxException: ImpalaBeeswaxException:
> E    Query aborted:Disk I/O error on impala-ec2-centos79-m6i-4xlarge-ondemand-1313.vpc.cloudera.com:27001: Failed to open HDFS file s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet
> E   Error(2): No such file or directory
> E   Root cause: FileNotFoundException: No such file or directory: s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a:/impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet{code}
> Here the file name (this is a single, contuguous string despite the apparent line breaks!)
> s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet
> contains the s3a: protocol specifier in the middle of the string, which seems to be a result of a badly set up concatenation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)