You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gabor Szadovszky (Jira)" <ji...@apache.org> on 2021/09/13 10:24:00 UTC

[jira] [Commented] (PARQUET-2088) Different created_by field values for application and library

    [ https://issues.apache.org/jira/browse/PARQUET-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17414092#comment-17414092 ] 

Gabor Szadovszky commented on PARQUET-2088:
-------------------------------------------

Could you please list what exact features do you think parquet-mr is enabling/disabling based on {{created_by}}? This field is used by the actual writer implementations (e.g. Impala, parquet-mr, parquet-cpp etc.). The example already explains how to use it: {{impala version 1.0 (build 6cf94d29b2b7115df4de2c06e2ab4326d721eb55)}}

> Different created_by field values for application and library
> -------------------------------------------------------------
>
>                 Key: PARQUET-2088
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2088
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>    Affects Versions: format-2.9.0
>            Reporter: Joshua Howard
>            Priority: Minor
>
> There seems to be a discrepancy in the Parquet format created_by field regarding how it should be filled out. The parquet-mr library uses this value to enable/disable features based on the parquet-mr version [here|#L64-L68]. Meanwhile, users are encouraged to make use of the application version [here|[https://www.javadoc.io/doc/org.apache.parquet/parquet-format/latest/org/apache/parquet/format/FileMetaData.html]]. It seems like there are multiple fields needed for an application and library version. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)