You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "tustvold (via GitHub)" <gi...@apache.org> on 2023/02/09 15:44:59 UTC

[GitHub] [arrow-rs] tustvold commented on issue #3680: Should Parquet pages begin on the start of a row?

tustvold commented on issue #3680:
URL: https://github.com/apache/arrow-rs/issues/3680#issuecomment-1424405441

   I'm not sure it is a bug per se, but I definitely think the APIs shouldn't do it unilaterally as they currently do.
   
   I would support making `GenericColumnWriter::write_batch_internal` call `write_mini_batch` treating the `WriterProperties::write_batch_size` as being a number of rows, as opposed to levels. Tbh this is probably what most people assume it does anyway. I'll try to get something up today
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org