You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@griffin.apache.org by "William Guo (JIRA)" <ji...@apache.org> on 2017/10/26 07:46:00 UTC
[jira] [Created] (GRIFFIN-67) Simplify data quality env and
deployment
William Guo created GRIFFIN-67:
----------------------------------
Summary: Simplify data quality env and deployment
Key: GRIFFIN-67
URL: https://issues.apache.org/jira/browse/GRIFFIN-67
Project: Griffin (Incubating)
Issue Type: Task
Reporter: William Guo
Assignee: William Guo
Priority: Minor
Fix For: 0.1.6-incubating
Hi, Guys
I try to run griffin measure in local environment following GitHub guide, but I really hit some issues which I take much time to solve. I think basic cause is multiple dependencies in the whole project. Roughly counting, I see spark/yarn/Hadoop/Scala/zookeeper/Kafka involving startup, actually it’s hard to describe configuration details in one guide. In addition, current Docker image is not enough, lack of zookeeper/Kafka components, which is too difficult to find runtime problems for new users. Meanwhile, the image is so huge about 2.7G that is big burden for network downloading traffic.
So, I have proposal below,
1. should strip complex dependencies and supply a basic measure sample image to try running
2. trim user guide and make easy run experience
thanks
Jin
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)