You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@hudi.apache.org by "Hang HOU (Jira)" <ji...@apache.org> on 2022/05/20 02:45:00 UTC

[jira] [Created] (HUDI-4127) Make the function of run_sync_tool.sh called periodically in special cases.

Hang HOU created HUDI-4127:
------------------------------

             Summary: Make the function of  run_sync_tool.sh called periodically in special cases.
                 Key: HUDI-4127
                 URL: https://issues.apache.org/jira/browse/HUDI-4127
             Project: Apache Hudi
          Issue Type: Wish
          Components: meta-sync
            Reporter: Hang HOU


When execute querys in hive，once the hudi table generate a new partition(like partition devided by timestamp)，querys in hive seems can't get the latest data of the hudi table，unless use  run_sync_tool.sh again.
I realised this “target” is not satisfy some situation，maybe we could make this feature more optional, so in some cases we could ignore the update of table partitions and get accurate rows when query hudi table in hive without do run_sync_tool.sh frequently.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)