You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Manoj Govindassamy (Jira)" <ji...@apache.org> on 2022/01/26 00:44:00 UTC
[jira] [Created] (HUDI-3324) Query Integration: Support returning file names matching the given columns and ranges
Manoj Govindassamy created HUDI-3324:
----------------------------------------
Summary: Query Integration: Support returning file names matching the given columns and ranges
Key: HUDI-3324
URL: https://issues.apache.org/jira/browse/HUDI-3324
Project: Apache Hudi
Issue Type: Task
Components: writer-core
Reporter: Manoj Govindassamy
Assignee: Manoj Govindassamy
Fix For: 0.11.0
Today the ColumnStats keys are in IndexID formats. These are hash IDs from column name, partition name, file name. For the query integration, when we do aggregate query, we need to return the file name part that contain the ranges. There are two ways of solving this
# We can store the file name directly in the key for now
# Add reverse lookup from FileIndexID to FileName
--
This message was sent by Atlassian Jira
(v8.20.1#820001)