You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by GitBox <gi...@apache.org> on 2019/11/15 20:33:18 UTC

[GitHub] [drill] paul-rogers opened a new pull request #1899: DRILL-7445: Create batch copier based on result set framework

paul-rogers opened a new pull request #1899: DRILL-7445: Create batch copier based on result set framework
URL: https://github.com/apache/drill/pull/1899
 
 
   The result set framework now provides both a reader and writer.
   This PR provides a copier that copies batches using this
   framework. Such a copier can:
   
   - Copy selected records
   - Copy all records, such as for an SV2 or SV4
   
   The copier uses the result set loader to create uniformly-sized
   output batches from input batches of any size. It does this
   by merging or splitting input batches as needed.
   
   Since the result set reader handles both SV2 and SV4s, the
   copier can filter or reorder rows based on the SV associated
   with the input batch.
   
   This version assumes single stream of input batches, and handles
   any schema changes in that input by creating output batches
   that track the input schema.
   
   Adds a "copy" method to the column writers.  This first version
   uses a simple, but perhaps slow, way to copy values: it
   materializes the value using the reader, then writes
   those materialized values using the writer. A future improvement
   can do direct buffer-to-buffer copies.
   
   A unit test verifies functionality for various use cases
   and data types.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services