You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/06/27 12:39:55 UTC

[GitHub] [arrow] westonpace commented on a diff in pull request #13333: ARROW-16773: [Docs][Format] Document Run-Length encoding in Arrow columnar format

westonpace commented on code in PR #13333:
URL: https://github.com/apache/arrow/pull/13333#discussion_r907339549


##########
docs/source/format/Columnar.rst:
##########
@@ -957,7 +1016,7 @@ The ``Buffer`` Flatbuffers value describes the location and size of a
 piece of memory. Generally these are interpreted relative to the
 **encapsulated message format** defined below.
 
-The ``size`` field of ``Buffer`` is not required to account for padding
+The ``size`` field of ``Buffer`` is not required to account for paddingeng-career-mgmt

Review Comment:
   ```suggestion
   The ``size`` field of ``Buffer`` is not required to account for padding
   ```



##########
docs/source/format/Columnar.rst:
##########
@@ -765,6 +765,65 @@ application.
 We discuss dictionary encoding as it relates to serialization further
 below.
 
+.. _run-length-encoded-layout:
+
+Run-Length-encoded Layout
+-------------------------
+
+Run-Length is a data representation that represents data as sequences of the
+same value, called runs. Each run is represented as a value, and an integer
+describing how often this value is repeated.
+
+Any array can be run-length-encoded. A run-length encoded array has a single

Review Comment:
   ```suggestion
   Any array can be run-length encoded. A run-length encoded array has a single
   ```



##########
docs/source/format/Columnar.rst:
##########
@@ -765,6 +765,65 @@ application.
 We discuss dictionary encoding as it relates to serialization further
 below.
 
+.. _run-length-encoded-layout:
+
+Run-Length-encoded Layout

Review Comment:
   ```suggestion
   Run-Length Encoded Layout
   ```



##########
docs/source/format/Columnar.rst:
##########
@@ -765,6 +765,65 @@ application.
 We discuss dictionary encoding as it relates to serialization further
 below.
 
+.. _run-length-encoded-layout:
+
+Run-Length-encoded Layout
+-------------------------
+
+Run-Length is a data representation that represents data as sequences of the
+same value, called runs. Each run is represented as a value, and an integer
+describing how often this value is repeated.
+
+Any array can be run-length-encoded. A run-length encoded array has a single
+buffer holding as many signed 32-bit integers, as there are runs. The actual

Review Comment:
   ```suggestion
   buffer holding a signed 32-bit integer for each run. The actual
   ```
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org