You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Ben Breakstone (JIRA)" <ji...@apache.org> on 2017/08/24 15:17:00 UTC

[jira] [Created] (IMPALA-5835) Severe slowdown in catalogd startup after 2.1 → 2.5 upgrade with > 200,000 databases

Ben Breakstone created IMPALA-5835:
--------------------------------------

             Summary: Severe slowdown in catalogd startup after 2.1 → 2.5 upgrade with > 200,000 databases
                 Key: IMPALA-5835
                 URL: https://issues.apache.org/jira/browse/IMPALA-5835
             Project: IMPALA
          Issue Type: Bug
    Affects Versions: Impala 2.5.5
            Reporter: Ben Breakstone


After an upgrade from Impala 2.1 (CDH 5.3.9) to Impala 2.5 (CDH 5.7.5), starting up Catalog Server takes around eight to ten hours. It took around twenty minutes before the upgrade. 

There are over 200,000 databases in use. Looking in the catalogd log as it starts up for hours, it says 
"Loading native functions for database..." and then 
"Loading Java functions for database..." for each database. Based on this, it appears the introduction of persistent UDFs/UDAs is causing the slowdown. 

Only one of the databases actually has any UDFs defined. num_metadata_loading_threads is set to 64. Background loading of metadata is disabled. 




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)