You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Michael Brown (JIRA)" <ji...@apache.org> on 2017/04/11 18:01:41 UTC

[jira] [Created] (IMPALA-5197) COMPUTE STATS with MT_DOP occasionally claim "Corrupt Parquet file"

Michael Brown created IMPALA-5197:
-------------------------------------

             Summary: COMPUTE STATS with MT_DOP occasionally claim "Corrupt Parquet file"
                 Key: IMPALA-5197
                 URL: https://issues.apache.org/jira/browse/IMPALA-5197
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
    Affects Versions: Impala 2.9.0
            Reporter: Michael Brown
            Priority: Critical


With IMPALA-5186, [~dhecht] noticed messages like:

{noformat}
I0407 12:57:05.306138 85140 status.cc:114] Corrupt Parquet file 'hdfs://vc0332.halxg.cloudera.com:8020/user/hive/warehouse/tpch_100_parquet.db/partsupp/3444dbb2ccec395e-45da764500000007_1009013170_data.0.parq': column 'ps_partkey' had 1024 remaining values but expected 0
{noformat}

I spent a bit more time investigating this, and it seems possible but difficult to reproduce this, though it's non-deterministic from what I can tell.

The stress test executes various {{COMPUTE STATS}} statements on the tables under test, with different {{MT_DOP}} settings. This is also in conjunction with a memory limit which the stress test applies to each statement.

Sometimes, it's possible to trigger these corrupt parquet file warnings. When that happens, the {{COMPUTE STATS}} fails with "memory limit exceeded".

For example, these queries reproduced the problem on the first try:

{noformat}
set mem_limit=1225m;
set mt_dop=16;
compute stats tpcds_300_decimal_parquet.store_sales;

set mem_limit=527m;
set mt_dop=4;
compute stats tpcds_300_decimal_parquet.store_sales;
{noformat}

These memory limits are right on the edge of the apparent limits of the statement. Sometimes the statement would appear to completely succeed; other times it would not be able to under the memory limits, but no corrupt messages were printed.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)