You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "alamb (via GitHub)" <gi...@apache.org> on 2023/05/22 19:41:08 UTC

[GitHub] [arrow-rs] alamb commented on issue #3871: Avoid Buffering Arrow Data for Entire Row Group in parquet::ArrowWriter

alamb commented on issue #3871:
URL: https://github.com/apache/arrow-rs/issues/3871#issuecomment-1557840581

   This ticket will improve https://github.com/influxdata/influxdb_iox/issues/7783 -- thank you for filing it. 
    
   As part of this feature, I would like to request some user definable best effort limit of how much memory the parquet writer will buffer (so flush is a function of both "max_row_group_size" as well as "buffer_limit"). 
   
   If for some reason that is not possible or advisable, exposing the currently buffered size would be ok too (so external users can implement the buffer limiting themselves)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org