You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2015/09/15 22:47:46 UTC
[jira] [Resolved] (SPARK-1363) Add streaming support for Spark SQL
module
[ https://issues.apache.org/jira/browse/SPARK-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael Armbrust resolved SPARK-1363.
-------------------------------------
Resolution: Duplicate
> Add streaming support for Spark SQL module
> ------------------------------------------
>
> Key: SPARK-1363
> URL: https://issues.apache.org/jira/browse/SPARK-1363
> Project: Spark
> Issue Type: New Feature
> Components: SQL
> Reporter: Saisai Shao
> Assignee: Saisai Shao
> Attachments: StreamSQLDesignDoc.pdf
>
>
> Currently there exists some projects like Pig On Storm, SQL on storm (Squall, SQLstream) that can query over streaming data, but for Spark Streaming, it is a blank area. It will be a good feature to add streaming supported SQL to Spark SQL.
> From semantic perspective, DStream is quite alike RDD, they both have join, filter, groupBy operators and so on, also DStream is backed by RDD, so it is transplant-able and reusable from existing spark plan.
> Also Catalyst has a clear division for each step, we can fully use its parse and logical plan analysis steps, with only different physical plan.
> So here we propose to add streaming support in Catalyst.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org