You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by "bvolpato (via GitHub)" <gi...@apache.org> on 2023/01/25 14:07:49 UTC

[GitHub] [beam] bvolpato opened a new issue, #25166: [Bug]: BigQueryIO with Storage Write API may fail with NullPointerException

bvolpato opened a new issue, #25166:
URL: https://github.com/apache/beam/issues/25166

   ### What happened?
   
   I'm still trying to get to a shorter version and more specific causes to reproduce the scenario, but when using BigQueryIO with the pipeline [KafkaToBigQuery](https://github.com/GoogleCloudPlatform/DataflowTemplates/blob/main/v2/kafka-to-bigquery/src/main/java/com/google/cloud/teleport/v2/templates/KafkaToBigQuery.java), using the options:
   
   - useStorageWriteApi: true
   - numStorageWriteApiStreams: 3
   - storageWriteApiTriggeringFrequencySec: 3
   
   It may lead to a situation where the following happens (reproduced on Dataflow):
   
   ```
   java.lang.RuntimeException: java.lang.NullPointerException
   	at org.apache.beam.sdk.io.gcp.bigquery.StorageApiWritesShardedRecords$WriteRecordsDoFn.lambda$process$4(StorageApiWritesShardedRecords.java:427)
   	at org.apache.beam.sdk.io.gcp.bigquery.StorageApiWritesShardedRecords$WriteRecordsDoFn.process(StorageApiWritesShardedRecords.java:616)
   Caused by: java.lang.NullPointerException
   	at org.apache.beam.sdk.io.gcp.bigquery.StorageApiWritesShardedRecords$WriteRecordsDoFn.lambda$process$4(StorageApiWritesShardedRecords.java:422)
   	at org.apache.beam.sdk.io.gcp.bigquery.StorageApiWritesShardedRecords$WriteRecordsDoFn.process(StorageApiWritesShardedRecords.java:616)
   	at org.apache.beam.sdk.io.gcp.bigquery.StorageApiWritesShardedRecords$WriteRecordsDoFn$DoFnInvoker.invokeProcessElement(Unknown Source)
   	at org.apache.beam.runners.dataflow.worker.repackaged.org.apache.beam.runners.core.SimpleDoFnRunner.invokeProcessElement(SimpleDoFnRunner.java:211)
   ```
   
   This started happening on Beam 2.44.0.
   
   
   
   ### Issue Priority
   
   Priority: 1 (data loss / total loss of function)
   
   ### Issue Components
   
   - [ ] Component: Python SDK
   - [X] Component: Java SDK
   - [ ] Component: Go SDK
   - [ ] Component: Typescript SDK
   - [X] Component: IO connector
   - [ ] Component: Beam examples
   - [ ] Component: Beam playground
   - [ ] Component: Beam katas
   - [ ] Component: Website
   - [ ] Component: Spark Runner
   - [ ] Component: Flink Runner
   - [ ] Component: Samza Runner
   - [ ] Component: Twister2 Runner
   - [ ] Component: Hazelcast Jet Runner
   - [ ] Component: Google Cloud Dataflow Runner


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [beam] johnjcasey closed issue #25166: [Bug]: BigQueryIO with Storage Write API may fail with NullPointerException

Posted by "johnjcasey (via GitHub)" <gi...@apache.org>.
johnjcasey closed issue #25166: [Bug]: BigQueryIO with Storage Write API may fail with NullPointerException
URL: https://github.com/apache/beam/issues/25166


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org