You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@ctakes.apache.org by jay vyas <ja...@gmail.com> on 2014/12/11 01:26:36 UTC

SparkStreaming -> CTakes -> Cassandra ETL.

Hi folks..  Just an FYI for those interested in running CTakes in a BigData
context.

Ive been working on using CTakes inside Apache BigTop, so that you can do
big data stuff with the CTakes API.

I rewrote the CTakes spark streaming demo here:

https://github.com/jayunit100/SparkStreamingCassandraDemo/tree/master/src

It exemplifies how to stream data using spark, from twitter, and then
process it with CTakes, as well as how to Ultimately forward the results
into Cassandra as well.

Its a work in progress, but feel free to grab it as a template if looking
to integrate all these APIs.

ill brush of my SVN credentials and commit it to directly to CTakes as an
update to the streaming example in sandbox/ that is already there.

https://svn.apache.org/repos/asf/ctakes/sandbox/ctakes-spark-streaming-twitter/


-- 
jay vyas

Re: SparkStreaming -> CTakes -> Cassandra ETL.

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Guys have you sent this to dev@spark.apache.org? I’m sure they
would love to hear how you guys are using Spark!

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++






-----Original Message-----
From: Pei Chen <ch...@apache.org>
Reply-To: "user@ctakes.apache.org" <us...@ctakes.apache.org>
Date: Thursday, December 11, 2014 at 9:16 AM
To: "user@ctakes.apache.org" <us...@ctakes.apache.org>
Subject: Re: SparkStreaming -> CTakes -> Cassandra ETL.

>Jay,
>This is very cool.
>Let's plan on demo'ing this for the next ApacheCon...
>--Pei
>
>On Wed, Dec 10, 2014 at 7:26 PM, jay vyas
><ja...@gmail.com> wrote:
>
>Hi folks..  Just an FYI for those interested in running CTakes in a
>BigData context.
>
>
>Ive been working on using CTakes inside Apache BigTop, so that you can do
>big data stuff with the CTakes API.
>
>
>I rewrote the CTakes spark streaming demo here:
>
>
>https://github.com/jayunit100/SparkStreamingCassandraDemo/tree/master/src
>
>
>
>It exemplifies how to stream data using spark, from twitter, and then
>process it with CTakes, as well as how to Ultimately forward the results
>into Cassandra as well.
>
>
>Its a work in progress, but feel free to grab it as a template if looking
>to integrate all these APIs.
>
>
>ill brush of my SVN credentials and commit it to directly to CTakes as an
>update to the streaming example in sandbox/ that is already there.
>
>
>https://svn.apache.org/repos/asf/ctakes/sandbox/ctakes-spark-streaming-twi
>tter/
> 
>
>
>-- 
>jay vyas
>
>
>
>
>
>
>
>
>
>


Re: SparkStreaming -> CTakes -> Cassandra ETL.

Posted by Pei Chen <ch...@apache.org>.
Jay,
This is very cool.
Let's plan on demo'ing this for the next ApacheCon...
--Pei

On Wed, Dec 10, 2014 at 7:26 PM, jay vyas <ja...@gmail.com>
wrote:

> Hi folks..  Just an FYI for those interested in running CTakes in a
> BigData context.
>
> Ive been working on using CTakes inside Apache BigTop, so that you can do
> big data stuff with the CTakes API.
>
> I rewrote the CTakes spark streaming demo here:
>
> https://github.com/jayunit100/SparkStreamingCassandraDemo/tree/master/src
>
> It exemplifies how to stream data using spark, from twitter, and then
> process it with CTakes, as well as how to Ultimately forward the results
> into Cassandra as well.
>
> Its a work in progress, but feel free to grab it as a template if looking
> to integrate all these APIs.
>
> ill brush of my SVN credentials and commit it to directly to CTakes as an
> update to the streaming example in sandbox/ that is already there.
>
>
> https://svn.apache.org/repos/asf/ctakes/sandbox/ctakes-spark-streaming-twitter/
>
>
> --
> jay vyas
>