You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@parquet.apache.org by we...@apache.org on 2020/08/05 19:09:15 UTC

[parquet-testing] 01/01: PARQUET-1878: [C++] lz4 codec is not compatible with Hadoop Lz4Codec (#13)

This is an automated email from the ASF dual-hosted git repository.

wesm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git

commit 60deed44bb750dd584c7307690b450400e916cb6
Merge: 40379b3 25c8ce4
Author: Wes McKinney <we...@users.noreply.github.com>
AuthorDate: Wed Aug 5 14:09:04 2020 -0500

    PARQUET-1878: [C++] lz4 codec is not compatible with Hadoop Lz4Codec (#13)
    
    This patch adds two parquet files used for testing in https://github.com/apache/arrow/pull/7789
    
    File contents (same for both files):
    ```
    $ parquet-tools dump hadoop_snappy_348283e35cc107e1a1f6b1bb00e397fe576bf4bad16ec3216799aa52a7552357.parquet
    row group 0
    --------------------------------------------------------------------------------
    c0:   INT64 SNAPPY DO:0 FPO:4 SZ:97/93/0.96 VC:4 ENC:PLAIN_DICTIONARY,BIT_PACKED [more]...
    c1:   BINARY SNAPPY DO:0 FPO:101 SZ:65/61/0.94 VC:4 ENC:PLAIN_DICTIONA [more]...
    v11:  DOUBLE SNAPPY DO:0 FPO:166 SZ:110/108/0.98 VC:4 ENC:RLE,PLAIN_DI [more]...
        c0 TV=4 RL=0 DL=0 DS:  2 DE:PLAIN_DICTIONARY
        ----------------------------------------------------------------------------
        page 0:                 DLE:BIT_PACKED RLE:BIT_PACKED VLE:PLAIN_DICTIONARY [more]... VC:4
        c1 TV=4 RL=0 DL=0 DS:  2 DE:PLAIN_DICTIONARY
        ----------------------------------------------------------------------------
        page 0:                 DLE:BIT_PACKED RLE:BIT_PACKED VLE:PLAIN_DICTIONARY [more]... VC:4
        v11 TV=4 RL=0 DL=1 DS: 3 DE:PLAIN_DICTIONARY
        ----------------------------------------------------------------------------
        page 0:                 DLE:RLE RLE:BIT_PACKED VLE:PLAIN_DICTIONARY [more]... VC:4
    INT64 c0
    --------------------------------------------------------------------------------
    *** row group 1 of 1, values 1 to 4 ***
    value 1: R:0 D:0 V:1593604800
    value 2: R:0 D:0 V:1593604800
    value 3: R:0 D:0 V:1593604801
    value 4: R:0 D:0 V:1593604801
    BINARY c1
    --------------------------------------------------------------------------------
    *** row group 1 of 1, values 1 to 4 ***
    value 1: R:0 D:0 V:0x616263
    value 2: R:0 D:0 V:0x646566
    value 3: R:0 D:0 V:0x616263
    value 4: R:0 D:0 V:0x646566
    DOUBLE v11
    --------------------------------------------------------------------------------
    *** row group 1 of 1, values 1 to 4 ***
    value 1: R:0 D:1 V:42.0
    value 2: R:0 D:1 V:7.7
    value 3: R:0 D:1 V:42.125
    value 4: R:0 D:1 V:7.7
    ```

 data/hadoop_lz4_compressed.parquet     | Bin 0 -> 702 bytes
 data/non_hadoop_lz4_compressed.parquet | Bin 0 -> 2795 bytes
 2 files changed, 0 insertions(+), 0 deletions(-)