You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Ramprakash Ramamoorthy <yo...@gmail.com> on 2016/06/18 09:52:51 UTC

Making spark read from sources other than HDFS

Hi team,

I'm running spark in cluster mode.

We have a custom file storage in our organisation. Can I plug in data from
these custom sources (Non HDFS like...)

Can you please shed some light in this aspect, like where do I start,
should I have to tweak the spark source code (Where exactly do I look out
for?)

Thank you.

-- 
With Thanks and Regards,
Ramprakash Ramamoorthy,
Chennai, India

Re: Making spark read from sources other than HDFS

Posted by Mich Talebzadeh <mi...@gmail.com>.
Spark is capable of reading data from a variety of sources including normal
non HDFS RDBMS databases.

This will require JDBC connection for that source which is obviously not
HDFS.

Which sort of storage do you have in mind. Can you access it via JDBC, ODBC
etc?

HTH

Dr Mich Talebzadeh



LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com



On 18 June 2016 at 10:52, Ramprakash Ramamoorthy <youngestachiever@gmail.com
> wrote:

> Hi team,
>
> I'm running spark in cluster mode.
>
> We have a custom file storage in our organisation. Can I plug in data from
> these custom sources (Non HDFS like...)
>
> Can you please shed some light in this aspect, like where do I start,
> should I have to tweak the spark source code (Where exactly do I look out
> for?)
>
> Thank you.
>
> --
> With Thanks and Regards,
> Ramprakash Ramamoorthy,
> Chennai, India
>