You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2015/09/15 22:47:46 UTC

[jira] [Resolved] (SPARK-1363) Add streaming support for Spark SQL module

     [ https://issues.apache.org/jira/browse/SPARK-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael Armbrust resolved SPARK-1363.
-------------------------------------
    Resolution: Duplicate

> Add streaming support for Spark SQL module
> ------------------------------------------
>
>                 Key: SPARK-1363
>                 URL: https://issues.apache.org/jira/browse/SPARK-1363
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Saisai Shao
>            Assignee: Saisai Shao
>         Attachments: StreamSQLDesignDoc.pdf
>
>
> Currently there exists some projects like Pig On Storm, SQL on storm (Squall, SQLstream) that can query over streaming data, but for Spark Streaming, it is a blank area. It will be a good feature to add streaming supported SQL to Spark SQL.
> From semantic perspective, DStream is quite alike RDD, they both have join, filter, groupBy operators and so on, also DStream is backed by RDD, so it is transplant-able and reusable from existing spark plan.
> Also Catalyst has a clear division for each step, we can fully use its parse and logical plan analysis steps,  with only different physical plan.
> So here we propose to add streaming support in Catalyst.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org