You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Knut Olav Loite (JIRA)" <ji...@apache.org> on 2018/01/21 05:34:00 UTC

[jira] [Created] (BEAM-3506) JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows

Knut Olav Loite created BEAM-3506:
-------------------------------------

             Summary: JdbcIO: Support writing iterables (i.e. collections) of rows instead of only single rows
                 Key: BEAM-3506
                 URL: https://issues.apache.org/jira/browse/BEAM-3506
             Project: Beam
          Issue Type: Improvement
          Components: sdk-java-extensions
    Affects Versions: 2.3.0
            Reporter: Knut Olav Loite
            Assignee: Reuven Lax


The current JdbcIO write interface expects a PCollection<T> where T is the row to be written. Each instance of T is then added to a batch and written to the database. The user has little control over how many rows will be added to one batch. If JdbcIO would also support writing a PCollection<Iterable<T>> the user would have more control over the number of rows in one batch. Especially when writing to cloud databases, such as Google Cloud Spanner, the batching of multiple rows together is important for performance.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)