You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Pallapothu Jyothi Swaroop (JIRA)" <ji...@apache.org> on 2016/07/15 09:13:21 UTC

[jira] [Updated] (SPARK-16567) how to increase performance of rdbms dataframe.

     [ https://issues.apache.org/jira/browse/SPARK-16567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Pallapothu Jyothi Swaroop updated SPARK-16567:
----------------------------------------------
    Summary: how to increase performance of rdbms dataframe.  (was: How to palatalize RDBMS dataframe and perform group by.)

> how to increase performance of rdbms dataframe.
> -----------------------------------------------
>
>                 Key: SPARK-16567
>                 URL: https://issues.apache.org/jira/browse/SPARK-16567
>             Project: Spark
>          Issue Type: Question
>            Reporter: Pallapothu Jyothi Swaroop
>            Priority: Critical
>
> Hello,
> how to increase performance of rdbms dataframe.
> I need to perform group by on fetched data.
> I performed like this.
> DataFrame jdbcDF = this.SQLCONTEXT.read().format("jdbc").options(options).load();
> Options is map contains db configuration
> DataFrame groupedDataFrame = 
> jdbcDF.groupBy("UNQ_STR").count();
> How i tune this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org