You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Manoj Govindassamy (Jira)" <ji...@apache.org> on 2022/01/26 00:44:00 UTC

[jira] [Created] (HUDI-3324) Query Integration: Support returning file names matching the given columns and ranges

Manoj Govindassamy created HUDI-3324:
----------------------------------------

             Summary: Query Integration: Support returning file names matching the given columns and ranges
                 Key: HUDI-3324
                 URL: https://issues.apache.org/jira/browse/HUDI-3324
             Project: Apache Hudi
          Issue Type: Task
          Components: writer-core
            Reporter: Manoj Govindassamy
            Assignee: Manoj Govindassamy
             Fix For: 0.11.0


Today the ColumnStats keys are in IndexID formats. These are hash IDs from column name, partition name, file name. For the query integration, when we do aggregate query, we need to return the file name part that contain the ranges. There are two ways of solving this
 # We can store the file name directly in the key for now
 # Add reverse lookup from FileIndexID to FileName



--
This message was sent by Atlassian Jira
(v8.20.1#820001)