You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 22:43:51 UTC

[GitHub] [beam] damccorm opened a new issue, #21345: Use runtime information to improve CoGroupByKey caching

damccorm opened a new issue, #21345:
URL: https://github.com/apache/beam/issues/21345

   Currently, CoGroupByKey creates UnionTables that are Flattened. The Flattened output is processed by a GroupByKey to produce a CoGbkResult (via ConstructCoGbkResultFn). 
   
    
   
   Given the performance of CoGBK is greatly impacted based on the which elements are cached in the (finitely sized) in-memory results, it would be useful if CoGbkResult can use runtime information to prioritize which elements are stored in-memory.
   
   Imported from Jira [BEAM-13541](https://issues.apache.org/jira/browse/BEAM-13541). Original Jira may contain additional context.
   Reported by: pedapudi.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org