You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tajo.apache.org by "JaeHwa Jung (JIRA)" <ji...@apache.org> on 2013/12/19 06:29:09 UTC

[jira] [Created] (TAJO-433) Improve integration with Hive

JaeHwa Jung created TAJO-433:
--------------------------------

             Summary: Improve integration with Hive
                 Key: TAJO-433
                 URL: https://issues.apache.org/jira/browse/TAJO-433
             Project: Tajo
          Issue Type: Sub-task
          Components: catalog
    Affects Versions: 0.8-incubating
            Reporter: JaeHwa Jung
            Assignee: JaeHwa Jung
             Fix For: 0.8-incubating


Hi, guys~ 

I wish to discuss about HCatalogStore. current Tajo uses HCatalog to integrate catalog with hive. So, users must run HiveMetaStoreServer because HCatalog connect HiveMetaStoreServer. 

But at this point, it needs to make improvements.
- First is HiveMetaStoreServer. if it can fall down or it performance bottlenecks, it will be a fatal problem for Tajo.
- Second is HCatalog API. It doesn't provide API for DDL. So, Tajo use HCatalog API and HiveMetaStoreClient API. 

So, I wish to remove HiveMetaStoreServer dependency and integrate catalog with Hive using HiveMetaStoreClient.  And I wish to use HiveMetaStoreClient API instead of HCatalog API.  I already checked HiveMetaStoreClient to connect HiveMetaStore without HiveMetaStoreServer.

if HiveMetaStoreClient can't connect HiveMetaStore, Tajo try to connect HiveMetaStoreServer as backup policy.

What do you think about this suggestion? :)



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)