You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@sentry.apache.org by "Arjun Mishra (JIRA)" <ji...@apache.org> on 2018/09/13 00:07:00 UTC

[jira] [Created] (SENTRY-2401) FullUpdateInitializer should do a single fetch from HMS instead of 1 per task

Arjun Mishra created SENTRY-2401:
------------------------------------

             Summary: FullUpdateInitializer should do a single fetch from HMS instead of 1 per task
                 Key: SENTRY-2401
                 URL: https://issues.apache.org/jira/browse/SENTRY-2401
             Project: Sentry
          Issue Type: New Feature
          Components: Sentry
            Reporter: Arjun Mishra
            Assignee: Arjun Mishra


This is an optimization ticket. Currently the way we do fetch is 
# Get all database names from HMS. Create a Db Task for each database name. HMS Request = 1. 
# For each database task
## Get all database objects from HMS
## Get all table names, for each database
## Create a table task for "maxTablesPerCall" tables
# For each table task
##  For each table task get all table objects
## For each "maxTablesPerCall" tables fetched, get partition names
## For each ""maxTablesPerCall" tables fetched, created partition tasks for "maxPartitionsPerCall" partitions
# For each partition task
## Get all partition objects for,  database, table, and list of partition names

Instead of splitting connecting to HMS and getting bits of information, it is better to get them all in a single request



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)