You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "StuartHadfield (via GitHub)" <gi...@apache.org> on 2023/02/21 13:27:37 UTC

[GitHub] [arrow] StuartHadfield commented on issue #34264: [Python] Control file size writing Parquet files

StuartHadfield commented on issue #34264:
URL: https://github.com/apache/arrow/issues/34264#issuecomment-1438491820

   Ah, yeah, so I get that you can control rows - I was wondering if there was some way I could tell the writer "Okay once you have written `n` bytes, flush" or something.
   
   I guess I could still make a rough inference on bytes-per-row if I know the total filesize, and the number of rows in the dataset as a whole.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org