You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2020/04/28 02:56:00 UTC

[jira] [Created] (IMPALA-9703) Skip loading partition meta and file meta for PB scale tables

Quanlong Huang created IMPALA-9703:
--------------------------------------

             Summary: Skip loading partition meta and file meta for PB scale tables
                 Key: IMPALA-9703
                 URL: https://issues.apache.org/jira/browse/IMPALA-9703
             Project: IMPALA
          Issue Type: New Feature
          Components: Catalog
            Reporter: Quanlong Huang


PB scale tables that have >100K partitions may hit catalog limitations. Caching all the partitions is also a waste since usually only few of them are required. Queries scanning all partitions probably fail with resource limitation errors so it's not in our consideration.

This JIRA tracks the work to skip caching partition meta of a table. Catalogd will only cache the HmsTable object and partition list (partition names, e.g. "p1=a/p2=b" and internal partition ids generated by Impala). Coordinators fetch the partition meta on-demand when compiling queries.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org