You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Dongjoon Hyun (Jira)" <ji...@apache.org> on 2021/02/15 01:25:00 UTC

[jira] [Commented] (PARQUET-1809) Add new APIs for nested predicate pushdown

    [ https://issues.apache.org/jira/browse/PARQUET-1809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284525#comment-17284525 ] 

Dongjoon Hyun commented on PARQUET-1809:
----------------------------------------

Hi, All.

According to the discussion history and the long pause (one year), it looks like being rejected due to the potential collision. 

Since Apache Parquet 1.12.0 RCs also do not have this, is this a final conclusion?

>  Add new APIs for nested predicate pushdown
> -------------------------------------------
>
>                 Key: PARQUET-1809
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1809
>             Project: Parquet
>          Issue Type: New Feature
>          Components: parquet-mr
>            Reporter: DB Tsai
>            Priority: Major
>
> Currently, Parquet's *org.apache.parquet.filter2.predicate.FilterApi* is using *dot* to split the column name into multi-parts of nested fields. The drawback is that this causes issues when the field name contains *dot*.
> The new APIs that will be added will take array of string directly for multi-parts of nested fields, so no confusion as using *dot* as a separator.  
> See https://github.com/apache/spark/pull/27728 and [SPARK-17636] for details.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)