You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Jung (Jira)" <ji...@apache.org> on 2020/11/10 07:20:00 UTC
[jira] [Created] (BEAM-11215) Support more granular splitting in
BigQueryStorageStreamSource
Kenneth Jung created BEAM-11215:
-----------------------------------
Summary: Support more granular splitting in BigQueryStorageStreamSource
Key: BEAM-11215
URL: https://issues.apache.org/jira/browse/BEAM-11215
Project: Beam
Issue Type: Improvement
Components: io-java-gcp
Affects Versions: 2.25.0
Reporter: Kenneth Jung
The BigQuery storage API supports partitioning of streams in the API. However, the partitioning granularity is not sufficient for some pipelines, for which it is desirable to be able to dynamically rebalance down to the level of individual rows. This can be implemented by storing both the stream ID and an offset range as components of a stream source, as opposed to a stream ID only.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)