You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 23:31:45 UTC

[GitHub] [beam] damccorm opened a new issue, #21444: Add flag to drop example from PredicitonResult

damccorm opened a new issue, #21444:
URL: https://github.com/apache/beam/issues/21444

   RunInference currently returns both the example/input and prediction. 
   ```
   
   PredictionResult:
     example: torch(...)
     inference: torch(...)
   
   ```
   
   Users may want the ability to only return the inference to minimize potential memory/serialization issues later in the pipeline. They can do this with a flag, and the return value may look like
   ```
   
   PredictionResult:
     example: None
     inference: torch(...)
   
   ```
   
    
   
   Imported from Jira [BEAM-14362](https://issues.apache.org/jira/browse/BEAM-14362). Original Jira may contain additional context.
   Reported by: yeandy.
   Subtask of issue #21435


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] AnandInguva commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

AnandInguva commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1248735550

   drop_example can be passed via ModelHandler or beam.RunInference. As of now, I added it to the RunInference Ptransform since it outputs the PredictionResult........but in the `process` method of the transform, it calls `self._model_handler.run_inference()` which returns `PredictionResult`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] AnandInguva commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

AnandInguva commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1246913694

   What would be the plan of action on this one? By looking at comments, `PredictionResult` will remain same. Are we adding an optional flag to exclude the input from the `PredictionResult`?
   
   I guess if someone wants to exclude the outputs, they can do it using `beam.Map()` or a PostProcessor. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] AnandInguva commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

AnandInguva commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1246985499

   .take-issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] AnandInguva commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

AnandInguva commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1248491698

   I ran few experiments on Dataflow Runner.
   
   1. default PredictionResult vs PredictionResult with input = None. 
   
   I didn't notice a significant change in the resources. @damccorm what do you think?
   
   <img width="1259" alt="Screen Shot 2022-09-15 at 3 02 39 PM" src="https://user-images.githubusercontent.com/34158215/190487881-b7423a56-84a1-4e4a-9c50-8af38a7ef5cd.png">
   
   <img width="1285" alt="Screen Shot 2022-09-15 at 3 02 50 PM" src="https://user-images.githubusercontent.com/34158215/190487903-16580e45-6cc3-4c79-8b7e-90e8bdbff9a5.png">
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] TheNeuralBit commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

TheNeuralBit commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1179358609

   Should we make returning *just* the inference the default and handle this with a composable `ModelHandler` that produces `PredictionResult` instances, similar to `KeyedModelHandler`?
   
   Unfortunately that couldn't be done in a backwards compatible way. 
   
   CC: @robertwb 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] yeandy commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

yeandy commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1185756756

   Subtask of https://github.com/apache/beam/issues/22117


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] damccorm closed issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

damccorm closed issue #21444: Add flag to drop example from PredicitonResult
URL: https://github.com/apache/beam/issues/21444


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] damccorm commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

damccorm commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1238586362

   > Should we make returning _just_ the inference the default and handle this with a composable `ModelHandler` that produces `PredictionResult` instances, similar to `KeyedModelHandler`?
   > 
   > Unfortunately that couldn't be done in a backwards compatible way.
   
   My instinct is that the large majority of users are going to want both passed through and we should prioritize that need (and not making a breaking change) over making things a little cleaner for the just inference case.
   
   The other advantage of always returning a PredictionResult is that it gives users more flexibility if, for example, they initially only need the inferences but later want to do something with the input itself. If we just return the inferences then they need to insert some intermediate mapping step to make their existing logic work, if we always return a `PredictionResult` then they can flip the flag, grab the initial input wherever they need it, and not touch anything else.
   
   ----------
   
   There's also a larger philosophical question of when we should introduce a new model handler vs parameterizing a new behavior option. I'd argue that because in this case omitting the input is essentially a subset of the current output (with no changes to the input) and we're not changing the graph structure we're better off treating this as an option (parameter to the `RunInference` call) than introducing a new handler.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] damccorm commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

damccorm commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1246957101

   > Are we adding an optional flag to exclude the input from the PredictionResult?
   
   Yeah, I think that's the best option here
   
   > I guess if someone wants to exclude the outputs, they can do it using beam.Map() or a PostProcessor.
   
   I don't quite follow this - why would someone want to exclude the outputs? Do you just mean unnesting the outputs from the `PredictionResult`?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] AnandInguva commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

AnandInguva commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1246983903

   > > Are we adding an optional flag to exclude the input from the PredictionResult?
   > 
   > Yeah, I think that's the best option here
   > 
   > > I guess if someone wants to exclude the outputs, they can do it using beam.Map() or a PostProcessor.
   > 
   > I don't quite follow this - why would someone want to exclude the outputs? Do you just mean unnesting the outputs from the `PredictionResult`?
   
   I am sorry. It's `inputs` not `outputs`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] damccorm commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

damccorm commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1259587397

   @AnandInguva I'm reopening this one since we're trying https://github.com/apache/beam/pull/23356 instead.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

[GitHub] [beam] damccorm commented on issue #21444: Add flag to drop example from PredicitonResult

Posted by GitBox <gi...@apache.org>.

damccorm commented on issue #21444:
URL: https://github.com/apache/beam/issues/21444#issuecomment-1248537960

   I think the difference largely will depend on workload. For our examples it is probably not significant because we're pretty much just writing the inference results immediately, but I can imagine if (a) your input size is significantly larger or (b) you're doing more than simple maps post-inference (especially if you're doing something that breaks pipeline fusion and causes work to be distributed to new workers), then the costs would be higher.
   
   I'm generally still in favor of doing this as a small convenience for users since its cheap


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org