You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Nahuel Lofeudo (Jira)" <ji...@apache.org> on 2019/09/12 23:43:00 UTC
[jira] [Commented] (BEAM-8222) Consider making insertId optional in
BigQuery.insertAll
[ https://issues.apache.org/jira/browse/BEAM-8222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16928946#comment-16928946 ]
Nahuel Lofeudo commented on BEAM-8222:
--------------------------------------
The request is to not populate the insertId field when calling insertAll(), in order to use BigQuery's Streaming API V2 as described here:
[https://cloud.google.com/bigquery/quotas#streaming_inserts]
"
> Consider making insertId optional in BigQuery.insertAll
> -------------------------------------------------------
>
> Key: BEAM-8222
> URL: https://issues.apache.org/jira/browse/BEAM-8222
> Project: Beam
> Issue Type: New Feature
> Components: io-java-gcp
> Reporter: Boyuan Zhang
> Priority: Major
>
> Current implementation of StreamingWriteFn(https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/StreamingWriteFn.java#L102) sets insertId from input element, which is added an uniqueId by https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/TagWithUniqueIds.java#L53. Users report that if leaving insertId as empty, writing will be extremely speeded up. Can we add an bqOption like, nonInsertId and emit empty id based on this option?
--
This message was sent by Atlassian Jira
(v8.3.2#803003)