You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kylin.apache.org by "wangrupeng (Jira)" <ji...@apache.org> on 2019/10/15 04:01:00 UTC

[jira] [Created] (KYLIN-4194) Throw KylinConfigCannotInitException at STEP "Extract Fact Table Distinct Columns" with spark

wangrupeng created KYLIN-4194:
---------------------------------

             Summary: Throw KylinConfigCannotInitException at STEP "Extract Fact Table Distinct Columns" with spark
                 Key: KYLIN-4194
                 URL: https://issues.apache.org/jira/browse/KYLIN-4194
             Project: Kylin
          Issue Type: Bug
    Affects Versions: v2.6.4, v3.0.0-beta
            Reporter: wangrupeng
            Assignee: wangrupeng
         Attachments: kylin_spark.log

When I use spark to extract fact table distinct columns values, kylin throw an exception,

org.apache.kylin.common.KylinConfigCannotInitException: Didn't find KYLIN_CONF or KYLIN_HOME, please set one of them,

see more details in [^kylin_spark.log] ^^ . I tried cdh5.7 and hdp2.4, both platforms go this problem.

It seems like an environment variables issue, but when I set KYLIN_CONF or KYLIN_HOME on each yarn cluster node, it still not work.

As we can see from the log, this exception occurs when TrieDictionaryBuilder try to build  Dictionary<String> objects,  but cannot find KylinConfig using KylinConfig.getInstanceFromEnv() method. 

My solution is to add code in spark job by reset KylinConfig.THREAD_ENV_INSTANCE 

from hdfs metadata as the spark job's metadata is stored in hdfs.And finally it works fine.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)