You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@spark.apache.org by satish chandra j <js...@gmail.com> on 2015/10/14 13:15:25 UTC

Fwd: Partition Column in JDBCRDD or Datasource API

HI All,
Please give me some inputs on *Partition Column *to be used in
DataSourceAPI or JDBCRDD to define Lowerbound and Upperbound value which
would be used to define No. of partitions, but issue is my source table does
not have a Numeric Columns which is sequential and unique such that proper
partitioning can take place

And what if we dont give any inputs as such for Lowerbound,Upperbound and
No. of partitions, how Spark can perform execution in distributed
mode/parallel and is there any default mode to perform in distributed
mode/parallel with out the above mentioned inputs

Regards,
Satish Chandra Jummula
---------- Forwarded message ----------
From: satish chandra j <js...@gmail.com>
Date: Wed, Sep 30, 2015 at 2:10 PM
Subject: Partition Column in JDBCRDD or Datasource API
To: user <us...@spark.apache.org>


HI All,
Please provide your inputs on Partition Column to be used in DataSourceAPI
or JDBCRDD in a scenerio where the source table does not have a Numeric
Columns which is sequential and unique such that proper partitioning can
take place in Spark

Regards,
Satish