You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Prashant Wason (Jira)" <ji...@apache.org> on 2021/06/15 06:00:00 UTC

[jira] [Created] (HUDI-2013) Fallback to file listing may lead to data loss

Prashant Wason created HUDI-2013:
------------------------------------

             Summary: Fallback to file listing may lead to data loss
                 Key: HUDI-2013
                 URL: https://issues.apache.org/jira/browse/HUDI-2013
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Prashant Wason
            Assignee: Prashant Wason


When fallback to file listing mode is enabled (hoodie.metadata.fallback.enable, default is true), then if listing from the metadata table leads to an exception the normal file-system listing used.

Metadata table listing may fail if the table is inconsistent or due to bugs. Falling back to file listing has the following downsides:
 # It masks the issue as the commit does not fail (only an exception is logged).
 # By the time the issue is discovered, logs may have been lost
 # There is no guarantee that all the commits wrote/updated the correct files.

Since listing from metadata table is per-partition, the issue is further complicated when listing for some partitions succeeds (file-list retrieved from metadata table) and fails for other partitions (file list retrieved from filesystem).

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)