You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@griffin.apache.org by "William Guo (JIRA)" <ji...@apache.org> on 2017/10/26 07:46:00 UTC

[jira] [Created] (GRIFFIN-67) Simplify data quality env and deployment

William Guo created GRIFFIN-67:
----------------------------------

             Summary: Simplify data quality env and deployment
                 Key: GRIFFIN-67
                 URL: https://issues.apache.org/jira/browse/GRIFFIN-67
             Project: Griffin (Incubating)
          Issue Type: Task
            Reporter: William Guo
            Assignee: William Guo
            Priority: Minor
             Fix For: 0.1.6-incubating


Hi, Guys

I try to run griffin measure in local environment following GitHub guide, but I really hit some issues which I take much time to solve. I think basic cause is multiple dependencies in the whole project. Roughly counting, I see spark/yarn/Hadoop/Scala/zookeeper/Kafka involving startup, actually it’s hard to describe configuration details in one guide. In addition, current Docker image is not enough, lack of zookeeper/Kafka components, which is too difficult to find runtime problems for new users. Meanwhile, the image is so huge about 2.7G that is big burden for network downloading traffic.

So, I have proposal below,


1.       should strip complex dependencies and supply a basic measure sample image to try running

2.       trim user guide and make easy run experience

thanks
Jin



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)