You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Hang HOU (Jira)" <ji...@apache.org> on 2022/05/20 02:45:00 UTC
[jira] [Created] (HUDI-4127) Make the function of run_sync_tool.sh called periodically in special cases.
Hang HOU created HUDI-4127:
------------------------------
Summary: Make the function of run_sync_tool.sh called periodically in special cases.
Key: HUDI-4127
URL: https://issues.apache.org/jira/browse/HUDI-4127
Project: Apache Hudi
Issue Type: Wish
Components: meta-sync
Reporter: Hang HOU
When execute querys in hive,once the hudi table generate a new partition(like partition devided by timestamp),querys in hive seems can't get the latest data of the hudi table,unless use run_sync_tool.sh again.
I realised this “target” is not satisfy some situation,maybe we could make this feature more optional, so in some cases we could ignore the update of table partitions and get accurate rows when query hudi table in hive without do run_sync_tool.sh frequently.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)