You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/11/29 21:46:00 UTC

[jira] [Commented] (PARQUET-869) Min/Max record counts for block size checks are not configurable

    [ https://issues.apache.org/jira/browse/PARQUET-869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17240374#comment-17240374 ] 

ASF GitHub Bot commented on PARQUET-869:
----------------------------------------

livelace commented on pull request #470:
URL: https://github.com/apache/parquet-mr/pull/470#issuecomment-735463112


   Well, I'm glad that I have found this bug before I started to save images into parquet files.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Min/Max record counts for block size checks are not configurable
> ----------------------------------------------------------------
>
>                 Key: PARQUET-869
>                 URL: https://issues.apache.org/jira/browse/PARQUET-869
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Pradeep Gollakota
>            Priority: Major
>
> While the min/max record counts for page size check are configurable via ParquetOutputFormat.MIN_ROW_COUNT_FOR_PAGE_SIZE_CHECK and ParquetOutputFormat.MAX_ROW_COUNT_FOR_PAGE_SIZE_CHECK configs and via ParquetProperties directly, the min/max record counts for block size check are hard coded inside InternalParquetRecordWriter.
> These two settings should also be configurable.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)