You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by sr...@apache.org on 2023/10/21 21:39:24 UTC
[spark] branch master updated: [SPARK-45484][SQL][FOLLOWUP][DOCS] Update the document of parquet compression codec
This is an automated email from the ASF dual-hosted git repository.
srowen pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 4023ec9bb44 [SPARK-45484][SQL][FOLLOWUP][DOCS] Update the document of parquet compression codec
4023ec9bb44 is described below
commit 4023ec9bb4471efee36afcec041c114a4b86a2c8
Author: Jiaan Geng <be...@163.com>
AuthorDate: Sat Oct 21 16:39:13 2023 -0500
[SPARK-45484][SQL][FOLLOWUP][DOCS] Update the document of parquet compression codec
### What changes were proposed in this pull request?
This PR follows up https://github.com/apache/spark/pull/43310 to update the document of parquet compression codec.
### Why are the changes needed?
Update the document of parquet compression codec.
### Does this PR introduce _any_ user-facing change?
'No'.
### How was this patch tested?
N/A
### Was this patch authored or co-authored using generative AI tooling?
'No'.
Closes #43464 from beliefer/SPARK-45484_followup.
Authored-by: Jiaan Geng <be...@163.com>
Signed-off-by: Sean Owen <sr...@gmail.com>
---
docs/sql-data-sources-parquet.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/docs/sql-data-sources-parquet.md b/docs/sql-data-sources-parquet.md
index 925e47504e5..c2af58248ea 100644
--- a/docs/sql-data-sources-parquet.md
+++ b/docs/sql-data-sources-parquet.md
@@ -423,7 +423,7 @@ Data source options of Parquet can be set via:
<tr>
<td><code>compression</code></td>
<td><code>snappy</code></td>
- <td>Compression codec to use when saving to file. This can be one of the known case-insensitive shorten names (none, uncompressed, snappy, gzip, lzo, brotli, lz4, and zstd). This will override <code>spark.sql.parquet.compression.codec</code>.</td>
+ <td>Compression codec to use when saving to file. This can be one of the known case-insensitive shorten names (none, uncompressed, snappy, gzip, lzo, brotli, lz4, lz4_raw, and zstd). This will override <code>spark.sql.parquet.compression.codec</code>.</td>
<td>write</td>
</tr>
</table>
@@ -484,7 +484,7 @@ Configuration of Parquet can be done using the `setConf` method on `SparkSession
Sets the compression codec used when writing Parquet files. If either <code>compression</code> or
<code>parquet.compression</code> is specified in the table-specific options/properties, the precedence would be
<code>compression</code>, <code>parquet.compression</code>, <code>spark.sql.parquet.compression.codec</code>. Acceptable values include:
- none, uncompressed, snappy, gzip, lzo, brotli, lz4, zstd.
+ none, uncompressed, snappy, gzip, lzo, brotli, lz4, lz4_raw, zstd.
Note that <code>brotli</code> requires <code>BrotliCodec</code> to be installed.
</td>
<td>1.1.1</td>
---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org