You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Carl (Jira)" <ji...@apache.org> on 2021/08/11 05:52:00 UTC
[jira] [Created] (FLINK-23719) Support switch WAL in Flink SQL DDL
options
Carl created FLINK-23719:
----------------------------
Summary: Support switch WAL in Flink SQL DDL options
Key: FLINK-23719
URL: https://issues.apache.org/jira/browse/FLINK-23719
Project: Flink
Issue Type: New Feature
Components: Connectors / HBase
Affects Versions: 1.13.2
Reporter: Carl
My application scenario is like this:
Use flick SQL to write the batch data of hive table to a new HBase table every day.
In this case, I think it is appropriate to disable WAL for the following reasons:
1. If an error occurs, i can re-run it. No high security is required.
2. The write speed will be greatly improved. No additional writing to the wal disk log
3. Save disk IO resources of the cluster
However, the HBase official website is described below,
h4. 70.6.10. Disabling the WAL
It is possible to disable the WAL, to improve performance in certain specific situations. However, disabling the WAL puts your data at risk. The only situation where this is recommended is during a bulk load. This is because, in the event of a problem, the bulk load can be re-run with no risk of data loss.
The WAL is disabled by calling the HBase client field {{Mutation.writeToWAL(false)}}. Use the {{Mutation.setDurability(Durability.SKIP_WAL)}} and Mutation.getDurability() methods to set and get the field’s value. There is no way to disable the WAL for only a specific table.
[http://hbase.apache.org/2.3/book.html#hbase_default_configurations]
So i think it needs API level support, not properties support.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)