You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@spark.apache.org by Benjamin Kim <bb...@gmail.com> on 2016/10/06 23:27:19 UTC

RESTful Endpoint and Spark

Has anyone tried to integrate Spark with a server farm of RESTful API endpoints or even HTTP web-servers for that matter? I know it’s typically done using a web farm as the presentation interface, then data flows through a firewall/router to direct calls to a JDBC listener that will SELECT, INSERT, UPDATE and, at times, DELETE data in a database. Can the same be done using Spark SQL Thriftserver on top of, say, HBase, Kudu, Parquet, etc.? Or can Kafka be used somewhere? Spark would be an ideal solution as the intermediary because it can talk to any data store underneath; so, swapping out a technology at any time would be possible.

Just want some ideas.

Thank,
Ben 


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org

Re: RESTful Endpoint and Spark

Posted by Benjamin Kim <bb...@gmail.com>.

It would appear the simple answer is to use the JDBC thriftserver in Spark.

Thanks,
Ben

> On Oct 6, 2016, at 9:38 PM, Matei Zaharia <ma...@gmail.com> wrote:
> 
> This is exactly what the Spark SQL Thrift server does, if you just want to access it using JDBC.
> 
> Matei
> 
>> On Oct 6, 2016, at 4:27 PM, Benjamin Kim <bb...@gmail.com> wrote:
>> 
>> Has anyone tried to integrate Spark with a server farm of RESTful API endpoints or even HTTP web-servers for that matter? I know it’s typically done using a web farm as the presentation interface, then data flows through a firewall/router to direct calls to a JDBC listener that will SELECT, INSERT, UPDATE and, at times, DELETE data in a database. Can the same be done using Spark SQL Thriftserver on top of, say, HBase, Kudu, Parquet, etc.? Or can Kafka be used somewhere? Spark would be an ideal solution as the intermediary because it can talk to any data store underneath; so, swapping out a technology at any time would be possible.
>> 
>> Just want some ideas.
>> 
>> Thank,
>> Ben 
>> 
>> 
>> ---------------------------------------------------------------------
>> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>> 
> 


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org

Re: RESTful Endpoint and Spark

Posted by Matei Zaharia <ma...@gmail.com>.

This is exactly what the Spark SQL Thrift server does, if you just want to access it using JDBC.

Matei

> On Oct 6, 2016, at 4:27 PM, Benjamin Kim <bb...@gmail.com> wrote:
> 
> Has anyone tried to integrate Spark with a server farm of RESTful API endpoints or even HTTP web-servers for that matter? I know it’s typically done using a web farm as the presentation interface, then data flows through a firewall/router to direct calls to a JDBC listener that will SELECT, INSERT, UPDATE and, at times, DELETE data in a database. Can the same be done using Spark SQL Thriftserver on top of, say, HBase, Kudu, Parquet, etc.? Or can Kafka be used somewhere? Spark would be an ideal solution as the intermediary because it can talk to any data store underneath; so, swapping out a technology at any time would be possible.
> 
> Just want some ideas.
> 
> Thank,
> Ben 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
> 


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org

Re: RESTful Endpoint and Spark

Posted by Ofer Eliassaf <of...@gmail.com>.

there are 2 main projects for that: livy(https://github.com/cloudera/livy)
and spark job server(https://github.com/spark-jobserver/spark-jobserver).

On Fri, Oct 7, 2016 at 2:27 AM, Benjamin Kim <bb...@gmail.com> wrote:

> Has anyone tried to integrate Spark with a server farm of RESTful API
> endpoints or even HTTP web-servers for that matter? I know it’s typically
> done using a web farm as the presentation interface, then data flows
> through a firewall/router to direct calls to a JDBC listener that will
> SELECT, INSERT, UPDATE and, at times, DELETE data in a database. Can the
> same be done using Spark SQL Thriftserver on top of, say, HBase, Kudu,
> Parquet, etc.? Or can Kafka be used somewhere? Spark would be an ideal
> solution as the intermediary because it can talk to any data store
> underneath; so, swapping out a technology at any time would be possible.
>
> Just want some ideas.
>
> Thank,
> Ben
>
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>
>


-- 
Regards,
Ofer Eliassaf