You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@zeppelin.apache.org by "Christian Tzolov (JIRA)" <ji...@apache.org> on 2015/09/02 01:03:45 UTC

[jira] [Created] (ZEPPELIN-275) Add SpringXD interpreter

Christian Tzolov created ZEPPELIN-275:
-----------------------------------------

             Summary: Add SpringXD interpreter
                 Key: ZEPPELIN-275
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-275
             Project: Zeppelin
          Issue Type: New Feature
          Components: Interpreters
    Affects Versions: 0.6.0
            Reporter: Christian Tzolov
            Assignee: Christian Tzolov


[SpringXD|http://projects.spring.io/spring-xd] is a unified, distributed, and extensible runtime platform for data ingestion, real time analytics, batch processing, and data export. It simplifies the development of big data applications.

Spring XD provides an extensible [DSL|http://docs.spring.io/spring-xd/docs/2.0.0.BUILD-SNAPSHOT/reference/html/#dsl-guide] for defining a stream and jobs using pipes and filters abstraction. A simple linear stream consists of a sequence of modules. Typically an Input Source, (optional) Processing Steps, and an Output Sink. 

DSL example for defaing a stream that collects data from an HTTP Source and writes it into a HDFS Sink
{code}
http --port 9000 | hdfs --fileName=<hdfs file name>
{code}

or twitter search stream that stores the incoming tweets in memory grid like Geode:
{code}
twittersearch --query=Zeppelin --outputType=application/json | gemfire-json-server --host=... --port=... --regionName=... --keyExpression=payload.getField('id_str')
{code}

The Spring XD DSL is good fit for Zeppelin notebooks as it will allow to declaratively (and human readably) define the ingestion/processing/export pipelines.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)