You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tajo.apache.org by "JaeHwa Jung (JIRA)" <ji...@apache.org> on 2013/11/11 05:49:17 UTC

[jira] [Updated] (TAJO-16) Enable Tajo catalog to access Hive metastore.

     [ https://issues.apache.org/jira/browse/TAJO-16?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

JaeHwa Jung updated TAJO-16:
----------------------------

    Attachment: TAJO-16.patch

I'm so glad to upload this patch.

>From now on, Tajo can connect HiveMetastore and can get lots of data stored on HDFS through HiveMetaStore. Of course, there were some limitation as follows:

- current version support just text file format.
- current version support just uncompressed type.

But, I'll implement unsupported functions time soon.

If you want to use HiveMetaStore as CatalogServer, you have to start up HiveMetaStore server as follows:
{code:xml}
$HIVE_HOME/bin/hive --service metastore
{code}
 
Or you can start up HiveMetaStore server through HCatalog as follows:
{code:xml}
$HCATALOG_HOME/sbin/hcat_server.sh start
{code}
If you start up HCatalog server, HCatalogServer start up HiveMetaStore server actually.

And then, you have to update catalog-site.xml as follows:
{code:xml}
  <property>
    <name>tajo.catalog.store.class</name>
    <value>org.apache.tajo.catalog.store.HCatalogStore</value>
  </property>
  <property>
    <name>tajo.catalog.uri</name>
    <value>thrift://localhost:10001</value>
  </property>
{code}
'tajo.catalog.url' is HiveMetaStore sever uri. You must specify your address on this property. 

And you can build Tajo in accordance with your hive version. If you want to build Tajo in accordance with apache hive 0.11.0, you can build as follows:
{code:xml}
mvn clean install -Phive-0.11.0
{code}
Or if you want to build Tajo in accordance with hive-0.10.0 for cdh4.3.0, you can build as follows:
{code:xml}
mvm clean install -Phive-0.10.0-cdh4.3.0
{code}
In other version, you must modify pom.xml file which located at '$TAJO_HOME/tajo-catalog/tajo-catalog-server'.


> Enable Tajo catalog to access Hive metastore.
> ---------------------------------------------
>
>                 Key: TAJO-16
>                 URL: https://issues.apache.org/jira/browse/TAJO-16
>             Project: Tajo
>          Issue Type: Improvement
>            Reporter: JaeHwa Jung
>            Assignee: JaeHwa Jung
>             Fix For: 0.8-incubating
>
>         Attachments: TAJO-16.patch, TAJO-16.pdf
>
>
> Hive has been widely used in this area. Many users have maintained lots of big tables through Hive metastore. It would be great to enable Tajo catalog to access Hive metastore. Hive metastore already provides thrift API. We could employ it.



--
This message was sent by Atlassian JIRA
(v6.1#6144)