You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by "dmateusp (via GitHub)" <gi...@apache.org> on 2023/05/04 15:44:03 UTC
[GitHub] [beam] dmateusp opened a new issue, #24169: [Feature Request]: BigQuery Storage API read throttle
dmateusp opened a new issue, #24169:
URL: https://github.com/apache/beam/issues/24169
### What would you like to happen?
Beam version: `2.36.0`
Jobs reading from the BigQuery Storage API can hit Quota limits and eventually fail with `The worker lost contact with the service.` (Dataflow runner)
The reason for the failure can be diagnosed on the API Monitoring (GCP) for BigQuery Storage API calls, where a spike in `429`s can be seen for `ReadRows` when the throughput of the job starts going downhill.
On the GCP side we've tried to resolve it through Quota increases, but it would be helpful to de-risk jobs crashing
related to:
* https://github.com/apache/beam/pull/15445 <- Shouldn't this PR have pervented this kind of issue?
### Issue Priority
Priority: 2
### Issue Component
Component: io-java-gcp
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] kennknowles commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by GitBox <gi...@apache.org>.
kennknowles commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1335573386
Sounds good. Thanks for replying!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] dmateusp commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by GitBox <gi...@apache.org>.
dmateusp commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1335413716
hey thanks for having a look, it seems like the version of Beam we were using did not include https://github.com/apache/beam/pull/15445
We use Beam through SCIO and we don't have a release that includes Beam `2.43.0` yet, so I wasn't able to test the fix
We could close this issue for now and I could re-open it if we encounter it in 2.43.0 (which I don't think we will, given the fix above)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] kennknowles commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by GitBox <gi...@apache.org>.
kennknowles commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1335389811
@johnjcasey @reuvenlax @chamikaramj any thoughts?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] chamikaramj commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by "chamikaramj (via GitHub)" <gi...@apache.org>.
chamikaramj commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1535072535
Did you mean https://github.com/apache/beam/pull/15445 (different author) ?
I don't think that changes the retry behavior. Seems like it bumps up the metric used by Dataflow autoscaler to keep track of time the worker is throttled so that job does not unnecessarily scale up in that scenario.
Also note that this is BQ read API (different from BQ Storage Write API) in case that is what you are trying to track here.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] liferoad commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by "liferoad (via GitHub)" <gi...@apache.org>.
liferoad commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1535007491
.reopen-issue
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] liferoad commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by "liferoad (via GitHub)" <gi...@apache.org>.
liferoad commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1535010025
@chamikaramj Is your PR supposed to write the appropriate error messages?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] stfines-clgx commented on issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by "stfines-clgx (via GitHub)" <gi...@apache.org>.
stfines-clgx commented on issue #24169:
URL: https://github.com/apache/beam/issues/24169#issuecomment-1522434454
as the actual GCP customer with the issue - it continues to occur in 2.46.0
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [beam] kennknowles closed issue #24169: [Feature Request]: BigQuery Storage API read throttle
Posted by GitBox <gi...@apache.org>.
kennknowles closed issue #24169: [Feature Request]: BigQuery Storage API read throttle
URL: https://github.com/apache/beam/issues/24169
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: github-unsubscribe@beam.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org