You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Ismaël Mejía (JIRA)" <ji...@apache.org> on 2019/04/06 13:53:00 UTC

[jira] [Resolved] (BEAM-6670) Add `withOutputParallelization` option to disable reparallelization of JdbcIO.Read

     [ https://issues.apache.org/jira/browse/BEAM-6670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ismaël Mejía resolved BEAM-6670.
--------------------------------
       Resolution: Fixed
    Fix Version/s: 2.13.0

> Add `withOutputParallelization` option to disable reparallelization of JdbcIO.Read
> ----------------------------------------------------------------------------------
>
>                 Key: BEAM-6670
>                 URL: https://issues.apache.org/jira/browse/BEAM-6670
>             Project: Beam
>          Issue Type: Wish
>          Components: io-java-jdbc
>            Reporter: Mike Pedersen
>            Assignee: Mike Pedersen
>            Priority: Minor
>             Fix For: 2.13.0
>
>          Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> I'm doing approx. 20 JDBC queries against a database and then joining them together in a group by. Every single one of these queries does a reshuffle, which is sort of useless due to them being fed to a CoGroupByKey immediately afterwards.
> Reshuffle by default seems sensible by the principle of least surprise, but it would be nice to have a way to disable it when it's not necessary. For example a "withReshuffle(boolean)" method. 
> This should be an easy addition and I am willing to add this if it sounds reasonable enough.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)