You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Eugene Kirpichov (JIRA)" <ji...@apache.org> on 2018/01/30 00:52:00 UTC

[jira] [Closed] (BEAM-3506) JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows

     [ https://issues.apache.org/jira/browse/BEAM-3506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eugene Kirpichov closed BEAM-3506.
----------------------------------
       Resolution: Won't Fix
    Fix Version/s: Not applicable

> JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows
> ----------------------------------------------------------------------------------------
>
>                 Key: BEAM-3506
>                 URL: https://issues.apache.org/jira/browse/BEAM-3506
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-java-extensions
>    Affects Versions: 2.3.0
>            Reporter: Knut Olav Loite
>            Assignee: Jean-Baptiste Onofré
>            Priority: Minor
>              Labels: JdbcIO, jdbc
>             Fix For: Not applicable
>
>   Original Estimate: 4h
>  Remaining Estimate: 4h
>
> The current JdbcIO write interface expects a PCollection<T> where T is the row to be written. Each instance of T is then added to a batch and written to the database. The user has little control over how many rows will be added to one batch. If JdbcIO would also support writing a PCollection<Iterable<T>> the user would have more control over the number of rows in one batch. Especially when writing to cloud databases, such as Google Cloud Spanner, the batching of multiple rows together is important for performance.
> I already have a solution locally and I will submit a pull request.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)