You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "fengYu (JIRA)" <ji...@apache.org> on 2016/08/11 03:00:26 UTC

[jira] [Commented] (KYLIN-1826) kylin support more than one hive based on different hadoop claster

    [ https://issues.apache.org/jira/browse/KYLIN-1826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15416436#comment-15416436 ] 

fengYu commented on KYLIN-1826:
-------------------------------

Sorry for delay, first of all, the hive in ProjectInstance is used in load hive table, then store the hive to TableDesc, the reason I use LocalThreadProject is I want do less code change, In my solution, I need project infomation to get hive instance, If pass this parameter in function, I have to modify so much functions. This is tricky but is the easiest way.

About the metadata changed in projectInstance and TableDesc, I consider the compatibility, for old project and tableDesc, the hive variable is set to null which means use default hive(hive-site.xml located in kylin classpath). 


> kylin support more than one hive based on different hadoop claster
> ------------------------------------------------------------------
>
>                 Key: KYLIN-1826
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1826
>             Project: Kylin
>          Issue Type: Improvement
>          Components: Environment 
>    Affects Versions: v1.5.2
>            Reporter: fengYu
>            Assignee: fengYu
>         Attachments: 0001-KYLIN-1826-support-more-hive-based-on-different-hado.patch
>
>
> Currently, kylin only support one hive which should run by 'hive' command, However, when source data located in more than one hive we should deploy more kylin instance and more than one metastore. which is difficult to manager and may cause some conflict.
> I has been working on it Recently, In our cluster, there are some hive client(different metastore) which based on different hadoop cluster, I add a new hive source type which called 'external hive' in kylin 1.5.x
> Thanks to kylin Plug-in architecture in 2.x, which make this work easiler. the main modification are:
> 1. add hive root directory in hive config file, external hive client exist in this directory. hive named by directory name.
> 2. add hive-site.xml file while loading hive tables.
> 3. store hive name into project, one project can only take one hive as source.
> 4. change and add some job to support job building.
> I will upload my patch if I finish all my tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)