You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aitozi (Jira)" <ji...@apache.org> on 2022/11/10 12:45:00 UTC
[jira] [Commented] (FLINK-25113) Cleanup from Parquet and Orc the partition key handling logic
[ https://issues.apache.org/jira/browse/FLINK-25113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17631632#comment-17631632 ]
Aitozi commented on FLINK-25113:
--------------------------------
hi [~slinkydeveloper], I created a preceding ticket to improve the hive source to handle the partition keys. I'd like to work on it, can you help assign the ticket to me ?
> Cleanup from Parquet and Orc the partition key handling logic
> -------------------------------------------------------------
>
> Key: FLINK-25113
> URL: https://issues.apache.org/jira/browse/FLINK-25113
> Project: Flink
> Issue Type: Sub-task
> Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
> Reporter: Francesco Guardiani
> Priority: Major
>
> After https://issues.apache.org/jira/browse/FLINK-24617 the partition key handling logic is encapsuled within {{FileInfoExtractorBulkFormat}}. We should cleanup this logic from orc and parquet formats, in order to simplify it. Note: Hive still depends on this logic, but it should rather use {{FileInfoExtractorBulkFormat}} or similar.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)