You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/05/25 22:56:00 UTC

[jira] [Commented] (DRILL-6147) Limit batch size for Flat Parquet Reader

    [ https://issues.apache.org/jira/browse/DRILL-6147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16491360#comment-16491360 ] 

ASF GitHub Bot commented on DRILL-6147:
---------------------------------------

sachouche opened a new pull request #1294: DRILL-6147: Adding Columnar Parquet Batch Sizing functionality
URL: https://github.com/apache/drill/pull/1294
 
 
   Implemented the Parquet Batching Sizing functionality for the Flat Reader; please find the associated functional and design specification [here](https://docs.google.com/document/d/1A6zFkjxnC_-9RwG4h0sI81KI5ZEvJ7HzgClCUFpB5WE/edit#heading=h.g2id8coli1bb).
   
   @parthchandra, can you please review this PR? 
   
   Thanks!
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Limit batch size for Flat Parquet Reader
> ----------------------------------------
>
>                 Key: DRILL-6147
>                 URL: https://issues.apache.org/jira/browse/DRILL-6147
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Parquet
>            Reporter: salim achouche
>            Assignee: salim achouche
>            Priority: Major
>             Fix For: 1.14.0
>
>
> The Parquet reader currently uses a hard-coded batch size limit (32k rows) when creating scan batches; there is no parameter nor any logic for controlling the amount of memory used. This enhancement will allow Drill to take an extra input parameter to control direct memory usage.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)