You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/13 18:37:10 UTC

[GitHub] [beam] TheNeuralBit opened a new issue, #21827: [Task]: grouping on categorical columns should not require Singleton partitioning

TheNeuralBit opened a new issue, #21827:
URL: https://github.com/apache/beam/issues/21827

   ### What needs to happen?
   
   Currently groupby with observed=False (the default) requires aggregating in the Singleton partition since it would otherwise produce results with every index value within every partition.
   
   Originally reported in BEAM-11190, which was erroneously closed at the time of the migration to GitHub Issues.
   
   ### Issue Priority
   
   Priority: 2
   
   ### Issue Component
   
   Component: dsl-dataframe


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org