You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@spark.apache.org by gen tang <ge...@gmail.com> on 2015/01/07 16:14:10 UTC

Spark on teradata?

Hi,

I have a stupid question:
Is it possible to use spark on Teradata data warehouse, please? I read some
news on internet which say yes. However, I didn't find any example about
this issue

Thanks in advance.

Cheers
Gen

Re: Spark on teradata?

Posted by "Evan R. Sparks" <ev...@gmail.com>.

Have you taken a look at the TeradataDBInputFormat? Spark is compatible
with arbitrary hadoop input formats - so this might work for you:
http://developer.teradata.com/extensibility/articles/hadoop-mapreduce-connector-to-teradata-edw

On Thu, Jan 8, 2015 at 10:53 AM, gen tang <ge...@gmail.com> wrote:

> Thanks a lot for your reply.
> In fact, I need to work on almost all the data in teradata (~100T). So, I
> don't think that jdbcRDD is a good choice.
>
> Cheers
> Gen
>
>
> On Thu, Jan 8, 2015 at 7:39 PM, Reynold Xin <rx...@databricks.com> wrote:
>
>> Depending on your use cases. If the use case is to extract small amount
>> of data out of teradata, then you can use the JdbcRDD and soon a jdbc input
>> source based on the new Spark SQL external data source API.
>>
>>
>>
>> On Wed, Jan 7, 2015 at 7:14 AM, gen tang <ge...@gmail.com> wrote:
>>
>>> Hi,
>>>
>>> I have a stupid question:
>>> Is it possible to use spark on Teradata data warehouse, please? I read
>>> some news on internet which say yes. However, I didn't find any example
>>> about this issue
>>>
>>> Thanks in advance.
>>>
>>> Cheers
>>> Gen
>>>
>>>
>>
>

Re: Spark on teradata?

Posted by gen tang <ge...@gmail.com>.

Thanks a lot for your reply.
In fact, I need to work on almost all the data in teradata (~100T). So, I
don't think that jdbcRDD is a good choice.

Cheers
Gen


On Thu, Jan 8, 2015 at 7:39 PM, Reynold Xin <rx...@databricks.com> wrote:

> Depending on your use cases. If the use case is to extract small amount of
> data out of teradata, then you can use the JdbcRDD and soon a jdbc input
> source based on the new Spark SQL external data source API.
>
>
>
> On Wed, Jan 7, 2015 at 7:14 AM, gen tang <ge...@gmail.com> wrote:
>
>> Hi,
>>
>> I have a stupid question:
>> Is it possible to use spark on Teradata data warehouse, please? I read
>> some news on internet which say yes. However, I didn't find any example
>> about this issue
>>
>> Thanks in advance.
>>
>> Cheers
>> Gen
>>
>>
>

Re: Spark on teradata?

Posted by Reynold Xin <rx...@databricks.com>.

Depending on your use cases. If the use case is to extract small amount of
data out of teradata, then you can use the JdbcRDD and soon a jdbc input
source based on the new Spark SQL external data source API.

On Wed, Jan 7, 2015 at 7:14 AM, gen tang <ge...@gmail.com> wrote:

> Hi,
>
> I have a stupid question:
> Is it possible to use spark on Teradata data warehouse, please? I read
> some news on internet which say yes. However, I didn't find any example
> about this issue
>
> Thanks in advance.
>
> Cheers
> Gen
>
>

Re: Spark on teradata?

Posted by Reynold Xin <rx...@databricks.com>.

Depending on your use cases. If the use case is to extract small amount of
data out of teradata, then you can use the JdbcRDD and soon a jdbc input
source based on the new Spark SQL external data source API.

On Wed, Jan 7, 2015 at 7:14 AM, gen tang <ge...@gmail.com> wrote:

> Hi,
>
> I have a stupid question:
> Is it possible to use spark on Teradata data warehouse, please? I read
> some news on internet which say yes. However, I didn't find any example
> about this issue
>
> Thanks in advance.
>
> Cheers
> Gen
>
>

Re: Spark on teradata?

Posted by xhudik <xh...@gmail.com>.

I don't think this makes sense. TD database is standard RDBMS (even parallel)
while Spark is used for non-relational issues. 
What could make sense is to deploy Spark on Teradata Aster. Aster is a
database cluster that might call external programs via STREAM operator. 
That said Spark/Scala app can be can be called and process some data. The
deployment itself should be easy the potential benefit - hard to say...


hope this helps, Tomas



--
View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/Spark-on-teradata-tp10025p10042.html
Sent from the Apache Spark Developers List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org
For additional commands, e-mail: dev-help@spark.apache.org