You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/04/04 19:50:00 UTC
[jira] [Commented] (PARQUET-2006) Column resolution by ID
[ https://issues.apache.org/jira/browse/PARQUET-2006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17517072#comment-17517072 ]
ASF GitHub Bot commented on PARQUET-2006:
-----------------------------------------
rdblue commented on code in PR #950:
URL: https://github.com/apache/parquet-mr/pull/950#discussion_r842085396
##########
parquet-column/src/main/java/org/apache/parquet/column/ColumnDescriptor.java:
##########
@@ -70,7 +71,20 @@ public ColumnDescriptor(String[] path, PrimitiveTypeName type,
* @param maxDef the maximum definition level for that path
*/
public ColumnDescriptor(String[] path, PrimitiveType type, int maxRep, int maxDef) {
+ this(path, null, type, maxRep, maxDef);
+ }
+
+ /**
+ * @param path the path to the leaf field in the schema
+ * @param id the id to the leaf field in the schema
+ * @param type the type of the field
+ * @param maxRep the maximum repetition level for that path
+ * @param maxDef the maximum definition level for that path
+ * @deprecated will be removed in 2.0.0; Use {@link #ColumnDescriptor(String[], PrimitiveType, int, int)}
Review Comment:
Is this correct? I would expect to deprecate the old one.
> Column resolution by ID
> -----------------------
>
> Key: PARQUET-2006
> URL: https://issues.apache.org/jira/browse/PARQUET-2006
> Project: Parquet
> Issue Type: New Feature
> Components: parquet-mr
> Reporter: Xinli Shang
> Assignee: Xinli Shang
> Priority: Major
>
> Parquet relies on the name. In a lot of usages e.g. schema resolution, this would be a problem. Iceberg uses ID and stored Id/name mappings.
> This Jira is to add column ID resolution support.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)