You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 17:17:24 UTC

[GitHub] [beam] damccorm opened a new issue, #20418: Change org.apache.beam.sdk.extensions.ml.CloudVision to associate the AnnotateImageResponses with the image data used for the annotation

damccorm opened a new issue, #20418:
URL: https://github.com/apache/beam/issues/20418

   There is a problem with the design of that transform. It takes a PCollection<String\> (in case of GCS URIs) in and outputs PCollection<List<AnnotateImageResponse\>\>. There is no way to associate the responses with the original file URIs. [ImageAnnotationContext](https://cloud.google.com/vision/docs/reference/rest/v1/AnnotateImageResponse#ImageAnnotationContext) is returned as part of the response, but the "uri" is empty for the majority of annotations (looks like it's only populated for file annotations and not for image annotations).
   
   One approach is to return KV<String, List<AnnotateImageResponse\>\> for images where the key is the GCS URI and for bytes to pass an id of any type and do KV<IDTYPE, List<AnnotateImageResponse\>\>.
   
   Imported from Jira [BEAM-10692](https://issues.apache.org/jira/browse/BEAM-10692). Original Jira may contain additional context.
   Reported by: slilichenko.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org