You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@parquet.apache.org by ap...@apache.org on 2022/02/15 15:34:22 UTC

[parquet-testing] branch master updated: Add parquet file with optional metadata (#22)

This is an automated email from the ASF dual-hosted git repository.

apitrou pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git


The following commit(s) were added to refs/heads/master by this push:
     new 3edb72a  Add parquet file with optional metadata (#22)
3edb72a is described below

commit 3edb72ae36305b24bf45dde4af41c92e54be85cf
Author: Shani Solomon <84...@users.noreply.github.com>
AuthorDate: Tue Feb 15 17:34:14 2022 +0200

    Add parquet file with optional metadata (#22)
    
    Co-authored-by: Shani Solomon <84...@users.noreply.github.com>
---
 data/README.md                               |  15 ++++++++-------
 data/data_index_bloom_encoding_stats.parquet | Bin 0 -> 1643 bytes
 2 files changed, 8 insertions(+), 7 deletions(-)

diff --git a/data/README.md b/data/README.md
index 37f4289..78bb2ba 100644
--- a/data/README.md
+++ b/data/README.md
@@ -19,13 +19,14 @@
 
 # Test data files for Parquet compatibility and regression testing
 
-| File  | Description |
-|---|---|
-| delta_byte_array.parquet | string columns with DELTA_BYTE_ARRAY encoding. See [delta_byte_array.md](delta_byte_array.md) for details. |
-| delta_binary_packed.parquet | INT32 and INT64 columns with DELTA_BINARY_PACKED encoding. See [delta_binary_packed.md](delta_binary_packed.md) for details. |
-| delta_encoding_required_column.parquet | required INT32 and STRING columns with delta encoding. See [delta_encoding_required_column.md](delta_encoding_required_column.md) for details. |
-| delta_encoding_optional_column.parquet | optional INT64 and STRING columns with delta encoding. See [delta_encoding_optional_column.md](delta_encoding_optional_column.md) for details. |
-| nested_structs.rust.parquet | Used to test that the Rust Arrow reader can lookup the correct field from a nested struct. See [ARROW-11452](https://issues.apache.org/jira/browse/ARROW-11452) |
+| File                                         | Description                                                                                                                                                     |
+|----------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| delta_byte_array.parquet                     | string columns with DELTA_BYTE_ARRAY encoding. See [delta_byte_array.md](delta_byte_array.md) for details.                                                      |
+| delta_binary_packed.parquet                  | INT32 and INT64 columns with DELTA_BINARY_PACKED encoding. See [delta_binary_packed.md](delta_binary_packed.md) for details.                                    |
+| delta_encoding_required_column.parquet       | required INT32 and STRING columns with delta encoding. See [delta_encoding_required_column.md](delta_encoding_required_column.md) for details.                  |
+| delta_encoding_optional_column.parquet       | optional INT64 and STRING columns with delta encoding. See [delta_encoding_optional_column.md](delta_encoding_optional_column.md) for details.                  |
+| nested_structs.rust.parquet                  | Used to test that the Rust Arrow reader can lookup the correct field from a nested struct. See [ARROW-11452](https://issues.apache.org/jira/browse/ARROW-11452) |
+| data_index_bloom_encoding_stats.parquet | optional STRING column. Contains optional metadata: bloom filters, column index, offset index and encoding stats.                                               |
 
 TODO: Document what each file is in the table above.
 
diff --git a/data/data_index_bloom_encoding_stats.parquet b/data/data_index_bloom_encoding_stats.parquet
new file mode 100644
index 0000000..14ad191
Binary files /dev/null and b/data/data_index_bloom_encoding_stats.parquet differ