You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "wangrupeng (Jira)" <ji...@apache.org> on 2020/07/08 09:54:00 UTC

[jira] [Commented] (KYLIN-4625) Debug the code of Kylin on Parquet without hadoop environment

    [ https://issues.apache.org/jira/browse/KYLIN-4625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17153453#comment-17153453 ] 

wangrupeng commented on KYLIN-4625:
-----------------------------------

Now if you want to debug tomcat without hadoop environment, you can follow the follow steps:
* edit the properties of $KYLIN_SOURCE_DIR/examples/test_case_data/sandbox/kylin.properties to local
```log
kylin.metadata.url=$KYLIN_SOURCE_DIR/examples/test_case_data/parquet_test
kylin.env.zookeeper-is-local=true
kylin.env.hdfs-working-dir=file://$KYLIN_SOURCE_DIR/examples/test_case_data/parquet_test
kylin.engine.spark-conf.spark.master=local
kylin.engine.spark-conf.spark.eventLog.dir=/path/to/local/dir
kylin.env=LOCAL
```
* debug org.apache.kylin.rest.DebugTomcat with IDEA && add VM option "-Dspark.local=true"
  This is used for query engine
* start debug tomcat and we can use the models we already defined
 !screenshot-1.png! 

> Debug the code of Kylin on Parquet without hadoop environment
> -------------------------------------------------------------
>
>                 Key: KYLIN-4625
>                 URL: https://issues.apache.org/jira/browse/KYLIN-4625
>             Project: Kylin
>          Issue Type: Improvement
>          Components: Spark Engine
>            Reporter: wangrupeng
>            Assignee: wangrupeng
>            Priority: Major
>         Attachments: image-2020-07-08-17-41-35-954.png, image-2020-07-08-17-42-09-603.png, screenshot-1.png
>
>
> Currently, Kylin on Parquet already supports debuging source code with local csv files, but it's a little bit complex. The steps are as follows:
> * edit the properties of $KYLIN_SOURCE_DIR/examples/test_case_data/sandbox/kylin.properties to local
>    ```log
>    kylin.metadata.url=$LOCAL_META_DIR
>    kylin.env.zookeeper-is-local=true
>    kylin.env.hdfs-working-dir=file:///path/to/local/dir
>    kylin.engine.spark-conf.spark.master=local
>    kylin.engine.spark-conf.spark.eventLog.dir=/path/to/local/dir
>    kylin.env=UT
>    ```
> * debug org.apache.kylin.rest.DebugTomcat with IDEA && add VM option "-Dspark.local=true" 
>     !image-2020-07-08-17-41-35-954.png! 
> * Load csv data source by pressing button "Data Source->Load CSV File as Table" on "Model" page, and set the schema for your table. Then press "submit" to save.
>      !image-2020-07-08-17-42-09-603.png! 
> Most time we debug just want to build and query cube easy. But current way is complex to load csv tables and create model and cube. So, I want to add a csv source  which using the model of kylin sample data directly when debug tomcat started.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)