You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zeppelin.apache.org by "Christian Tzolov (JIRA)" <ji...@apache.org> on 2015/09/02 01:03:45 UTC
[jira] [Created] (ZEPPELIN-275) Add SpringXD interpreter
Christian Tzolov created ZEPPELIN-275:
-----------------------------------------
Summary: Add SpringXD interpreter
Key: ZEPPELIN-275
URL: https://issues.apache.org/jira/browse/ZEPPELIN-275
Project: Zeppelin
Issue Type: New Feature
Components: Interpreters
Affects Versions: 0.6.0
Reporter: Christian Tzolov
Assignee: Christian Tzolov
[SpringXD|http://projects.spring.io/spring-xd] is a unified, distributed, and extensible runtime platform for data ingestion, real time analytics, batch processing, and data export. It simplifies the development of big data applications.
Spring XD provides an extensible [DSL|http://docs.spring.io/spring-xd/docs/2.0.0.BUILD-SNAPSHOT/reference/html/#dsl-guide] for defining a stream and jobs using pipes and filters abstraction. A simple linear stream consists of a sequence of modules. Typically an Input Source, (optional) Processing Steps, and an Output Sink.
DSL example for defaing a stream that collects data from an HTTP Source and writes it into a HDFS Sink
{code}
http --port 9000 | hdfs --fileName=<hdfs file name>
{code}
or twitter search stream that stores the incoming tweets in memory grid like Geode:
{code}
twittersearch --query=Zeppelin --outputType=application/json | gemfire-json-server --host=... --port=... --regionName=... --keyExpression=payload.getField('id_str')
{code}
The Spring XD DSL is good fit for Zeppelin notebooks as it will allow to declaratively (and human readably) define the ingestion/processing/export pipelines.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)