You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by Antoine Pitrou <an...@python.org> on 2023/01/09 16:00:59 UTC

Re: [VOTE] Add RLE Arrays to Arrow Format

I've commented on the PR. I'm +1 on the principle and on the proposed 
format / layout additions.

Regards

Antoine.


Le 14/12/2022 à 17:27, Matt Topol a écrit :
> Hello,
> 
> I'd like to propose adding the RLE type based on earlier discussions[1][2]
> to the Arrow format:
> - Columnar Format description:
> https://github.com/apache/arrow/pull/13333/files#diff-8b68cf6859e881f2357f5df64bb073135d7ff6eeb51f116418660b3856564c60
> - Flatbuffers changes:
> https://github.com/apache/arrow/pull/14176/files#diff-e54b4f5d2d279acc5d1df5df9a7636f0142a8041fe02f07034e0d8be48444b07
> 
> There is a proposed implementation available in both C++ (written by Tobias
> Zagorni) and Go[3][4]. Both implementations have mostly the same tests
> implemented and were tested to be compatible over IPC with an archery test.
> In both cases, the implementations are split out among several Draft PRs so
> that they can be easily reviewed piecemeal if the vote is approved, with
> each Draft PR including the changes of the one before it. The links
> provided are the Draft PRs with the entirety of the changes included.
> 
> The vote will be open for at least 72 hours.
> 
> [ ] +1 add the proposed RLE type to the Apache Arrow format
> [ ] -1 do not add the proposed RLE type to the Apache Arrow format
> because...
> 
> Thanks much, and please let me know if any more information or links are
> needed (I've never proposed a vote before on here!)
> 
> --Matt
> 
> [1] https://lists.apache.org/thread/bfz3m5nyf7flq7n6q9b1bx3jhcn4wq29
> [2] https://lists.apache.org/thread/xb7c723csrtwt0md3m4p56bt0193n7jq
> [3] https://github.com/apache/arrow/pull/14179
> [4] https://github.com/apache/arrow/pull/14223
> 

Re: [VOTE] Add RLE Arrays to Arrow Format

Posted by Matt Topol <zo...@gmail.com>.
Thanks Antoine! I'll go respond to your comments now!

On Mon, Jan 9, 2023 at 11:01 AM Antoine Pitrou <an...@python.org> wrote:

>
> I've commented on the PR. I'm +1 on the principle and on the proposed
> format / layout additions.
>
> Regards
>
> Antoine.
>
>
> Le 14/12/2022 à 17:27, Matt Topol a écrit :
> > Hello,
> >
> > I'd like to propose adding the RLE type based on earlier
> discussions[1][2]
> > to the Arrow format:
> > - Columnar Format description:
> >
> https://github.com/apache/arrow/pull/13333/files#diff-8b68cf6859e881f2357f5df64bb073135d7ff6eeb51f116418660b3856564c60
> > - Flatbuffers changes:
> >
> https://github.com/apache/arrow/pull/14176/files#diff-e54b4f5d2d279acc5d1df5df9a7636f0142a8041fe02f07034e0d8be48444b07
> >
> > There is a proposed implementation available in both C++ (written by
> Tobias
> > Zagorni) and Go[3][4]. Both implementations have mostly the same tests
> > implemented and were tested to be compatible over IPC with an archery
> test.
> > In both cases, the implementations are split out among several Draft PRs
> so
> > that they can be easily reviewed piecemeal if the vote is approved, with
> > each Draft PR including the changes of the one before it. The links
> > provided are the Draft PRs with the entirety of the changes included.
> >
> > The vote will be open for at least 72 hours.
> >
> > [ ] +1 add the proposed RLE type to the Apache Arrow format
> > [ ] -1 do not add the proposed RLE type to the Apache Arrow format
> > because...
> >
> > Thanks much, and please let me know if any more information or links are
> > needed (I've never proposed a vote before on here!)
> >
> > --Matt
> >
> > [1] https://lists.apache.org/thread/bfz3m5nyf7flq7n6q9b1bx3jhcn4wq29
> > [2] https://lists.apache.org/thread/xb7c723csrtwt0md3m4p56bt0193n7jq
> > [3] https://github.com/apache/arrow/pull/14179
> > [4] https://github.com/apache/arrow/pull/14223
> >
>