You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Riza Suminto (Jira)" <ji...@apache.org> on 2022/05/26 07:27:00 UTC
[jira] [Assigned] (IMPALA-5845) Impala should de-duplicate row parsing error
[ https://issues.apache.org/jira/browse/IMPALA-5845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Riza Suminto reassigned IMPALA-5845:
------------------------------------
Assignee: Riza Suminto
> Impala should de-duplicate row parsing error
> --------------------------------------------
>
> Key: IMPALA-5845
> URL: https://issues.apache.org/jira/browse/IMPALA-5845
> Project: IMPALA
> Issue Type: Bug
> Components: Backend
> Affects Versions: Impala 3.1.0
> Reporter: Juan Yu
> Assignee: Riza Suminto
> Priority: Major
> Labels: ramp-up, supportability
>
> Impala log file grew very quickly with lots of error like
> I0824 10:44:46.527885 8679 runtime-state.cc:217] Error from query 804d64b80df65fda:a5349b0700000000: Error parsing row: file: hdfs://nameservice1/user/hive/tpcds.db/store_sales/00005.parq, before offset: 120795952
> There are 622000 errors for only 141 unique files
> Impala already de-duplicate similar error in lots of scenarios, could the row parsing error be de-duplicated as well to reduce log size and easier troubleshooting?
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org