You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by si...@apache.org on 2023/03/15 22:10:26 UTC
[hudi] branch asf-site updated: [HUDI-5912] Update snapshot_exporter.md (#8142)
This is an automated email from the ASF dual-hosted git repository.
sivabalan pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 7620e6b346b [HUDI-5912] Update snapshot_exporter.md (#8142)
7620e6b346b is described below
commit 7620e6b346b6b8cd93d140a30c3a67db0e423091
Author: clp007 <ch...@gmail.com>
AuthorDate: Thu Mar 16 06:10:19 2023 +0800
[HUDI-5912] Update snapshot_exporter.md (#8142)
Replace with the latest version
---
.../version-0.13.0/snapshot_exporter.md | 20 ++++++++++----------
1 file changed, 10 insertions(+), 10 deletions(-)
diff --git a/website/versioned_docs/version-0.13.0/snapshot_exporter.md b/website/versioned_docs/version-0.13.0/snapshot_exporter.md
index b72a65dc763..168f3a81543 100644
--- a/website/versioned_docs/version-0.13.0/snapshot_exporter.md
+++ b/website/versioned_docs/version-0.13.0/snapshot_exporter.md
@@ -28,10 +28,10 @@ query, perform any repartitioning if required and will write the data as Hudi, p
Exporter scans the source dataset and then makes a copy of it to the target output path.
```bash
spark-submit \
- --jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar" \
+ --jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
- packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
+ /opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/hudi/" \
--output-format "hudi"
@@ -42,12 +42,12 @@ The Exporter can also convert the source dataset into other formats. Currently o
```bash
spark-submit \
- --jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar" \
+ --jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
- packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
+ /opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
- --target-output-path "/tmp/exported/json/" \
+ --target-output-path "/tmp/exported/hudi/" \
--output-format "json" # or "parquet"
```
@@ -60,10 +60,10 @@ By default, if no partitioning parameters are given, the output dataset will hav
Example:
```bash
spark-submit \
- --jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar" \
+ --jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
- packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
+ /opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/json/" \
--output-format "json" \
@@ -105,11 +105,11 @@ After putting this class in `my-custom.jar`, which is then placed on the job cla
```bash
spark-submit \
- --jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar,my-custom.jar" \
+ --jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar,my-custom.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
- packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
+ /opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/json/" \
--output-format "json" \
- --output-partitioner "com.foo.bar.MyPartitioner"
\ No newline at end of file
+ --output-partitioner "com.foo.bar.MyPartitioner"