You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@arrow.apache.org by ap...@apache.org on 2022/07/21 18:23:01 UTC

[arrow-testing] branch master updated: ARROW-17100: Add example of Arrow 2.0 DataPageV2 compression issue (#80)

This is an automated email from the ASF dual-hosted git repository.

apitrou pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/arrow-testing.git


The following commit(s) were added to refs/heads/master by this push:
     new 5bab2f2  ARROW-17100: Add example of Arrow 2.0 DataPageV2 compression issue (#80)
5bab2f2 is described below

commit 5bab2f264a23f5af68f69ea93d24ef1e8e77fc88
Author: Will Jones <wi...@gmail.com>
AuthorDate: Thu Jul 21 11:22:57 2022 -0700

    ARROW-17100: Add example of Arrow 2.0 DataPageV2 compression issue (#80)
    
    * add example of data page v2 data
    
    * feat: add description of file
---
 data/parquet/ARROW-17100.parquet | Bin 0 -> 1342 bytes
 data/parquet/README.md           |  24 ++++++++++++++++++++++++
 2 files changed, 24 insertions(+)

diff --git a/data/parquet/ARROW-17100.parquet b/data/parquet/ARROW-17100.parquet
new file mode 100644
index 0000000..24c07a7
Binary files /dev/null and b/data/parquet/ARROW-17100.parquet differ
diff --git a/data/parquet/README.md b/data/parquet/README.md
new file mode 100644
index 0000000..00502d8
--- /dev/null
+++ b/data/parquet/README.md
@@ -0,0 +1,24 @@
+<!--
+  ~ Licensed to the Apache Software Foundation (ASF) under one
+  ~ or more contributor license agreements.  See the NOTICE file
+  ~ distributed with this work for additional information
+  ~ regarding copyright ownership.  The ASF licenses this file
+  ~ to you under the Apache License, Version 2.0 (the
+  ~ "License"); you may not use this file except in compliance
+  ~ with the License.  You may obtain a copy of the License at
+  ~
+  ~   http://www.apache.org/licenses/LICENSE-2.0
+  ~
+  ~ Unless required by applicable law or agreed to in writing,
+  ~ software distributed under the License is distributed on an
+  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+  ~ KIND, either express or implied.  See the License for the
+  ~ specific language governing permissions and limitations
+  ~ under the License.
+  -->
+
+# Parquet Test Files
+
+| File | Description |
+| --- | --- |
+| ARROW-17100.parquet | Parquet file written by PyArrow 2.0 with DataPageV2 and compressed columns. Prior to PyArrow 3.0, pages were compressed even if the is_compressed flag was 0. This was fixed in ARROW-10353, but for backwards compatibility readers may wish to support such a file. |
\ No newline at end of file