You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/11/15 05:42:52 UTC

[GitHub] [beam] jcalvarezj commented on issue #24148: [Feature Request]: Support relational Joins in Python SDK

jcalvarezj commented on issue #24148:
URL: https://github.com/apache/beam/issues/24148#issuecomment-1314803183

   > I believe the main way you can do this in Python is directly using [`CoGroupByKey`](https://beam.apache.org/releases/pydoc/2.42.0/apache_beam.transforms.util.html?highlight=cogroupbykey#apache_beam.transforms.util.CoGroupByKey)
   > 
   > Your request is to have a friendly wrapper for an equijoin on a particular schema field of two schema-aware PCollections? This seems like a possible project as a first contribution. A bit advanced, but doable.
   > 
   > CC @tvalentyn
   
   That's right, I've worked with CoGroupByKey successfully with a couple workarounds, but I feel that schema-aware PCollections Join operations would save a lot of repetitive work on many use cases (as it's a more direct and friendly way as far as I've seen in Java implementations)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org