You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Jung (Jira)" <ji...@apache.org> on 2020/11/10 07:20:00 UTC

[jira] [Created] (BEAM-11215) Support more granular splitting in BigQueryStorageStreamSource

Kenneth Jung created BEAM-11215:
-----------------------------------

             Summary: Support more granular splitting in BigQueryStorageStreamSource
                 Key: BEAM-11215
                 URL: https://issues.apache.org/jira/browse/BEAM-11215
             Project: Beam
          Issue Type: Improvement
          Components: io-java-gcp
    Affects Versions: 2.25.0
            Reporter: Kenneth Jung


The BigQuery storage API supports partitioning of streams in the API. However, the partitioning granularity is not sufficient for some pipelines, for which it is desirable to be able to dynamically rebalance down to the level of individual rows. This can be implemented by storing both the stream ID and an offset range as components of a stream source, as opposed to a stream ID only.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)