You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by GitBox <gi...@apache.org> on 2021/04/10 17:34:27 UTC

[GitHub] [parquet-format] jorgecarleitao commented on a change in pull request #170: PARQUET-2021: Add description of RLE/bit-packed encoder

jorgecarleitao commented on a change in pull request #170:
URL: https://github.com/apache/parquet-format/pull/170#discussion_r611072057



##########
File path: rle-bitpacked.md
##########
@@ -0,0 +1,78 @@
+# RLE-Bitpacked hybrid encoder
+
+The RLE-Bitpacked hybrid encoder is a parquet-specific encoder that combines two well known encoding strategies,
+[RLE](https://en.wikipedia.org/wiki/Run-length_encoding) and bitpacking. Note that "combine" here means this encoder allows both encodings within the same stream, and, during encoding, it can switch between them.
+
+This encoder is only used to encode integer values that may either represent definition levels, representation levels or ids of dictionary-encoded pages. Note that this encoder supports integers that can be represented in less than 8 bits.

Review comment:
       Actually, it seems that booleans are just bitpacked and therefore have no header nor RLE encoding.
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org