You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Sergei Lilichenko (Jira)" <ji...@apache.org> on 2020/08/12 20:35:00 UTC

[jira] [Created] (BEAM-10693) Improve error handling in org.apache.beam.sdk.extensions.ml.CloudVision

Sergei Lilichenko created BEAM-10693:
----------------------------------------

             Summary: Improve error handling in org.apache.beam.sdk.extensions.ml.CloudVision
                 Key: BEAM-10693
                 URL: https://issues.apache.org/jira/browse/BEAM-10693
             Project: Beam
          Issue Type: New Feature
          Components: extensions-java-gcp
    Affects Versions: 2.22.0
            Reporter: Sergei Lilichenko


CloudVision invokes batchAnnotateImages API ((at AnnotateImages class). Any exception thrown will propagate to the pipeline. Because of the Vision API's quota processing large number of images with a large number of workers will often result in ResourceExhaustedException and subsequent reprocessing of the complete bundle and pipeline errors.

By catching that exception in the transform and providing an exponential backoff reprocessing and excessive error reporting can be significantly minimized.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)