You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aitozi (Jira)" <ji...@apache.org> on 2022/11/10 12:45:00 UTC

[jira] [Commented] (FLINK-25113) Cleanup from Parquet and Orc the partition key handling logic

    [ https://issues.apache.org/jira/browse/FLINK-25113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17631632#comment-17631632 ] 

Aitozi commented on FLINK-25113:
--------------------------------

hi [~slinkydeveloper], I created a preceding ticket to improve the hive source to handle the partition keys. I'd like to work on it, can you help assign the ticket to me ?

> Cleanup from Parquet and Orc the partition key handling logic
> -------------------------------------------------------------
>
>                 Key: FLINK-25113
>                 URL: https://issues.apache.org/jira/browse/FLINK-25113
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
>            Reporter: Francesco Guardiani
>            Priority: Major
>
> After https://issues.apache.org/jira/browse/FLINK-24617 the partition key handling logic is encapsuled within {{FileInfoExtractorBulkFormat}}. We should cleanup this logic from orc and parquet formats, in order to simplify it. Note: Hive still depends on this logic, but it should rather use {{FileInfoExtractorBulkFormat}} or similar.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)