You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by GitBox <gi...@apache.org> on 2023/01/02 23:09:12 UTC

[GitHub] [arrow] ava6969 opened a new issue, #15162: Usage of Hash Aggregation

ava6969 opened a new issue, #15162:
URL: https://github.com/apache/arrow/issues/15162

   ### Describe the usage question you have. Please include as many useful details as  possible.
   
   
   I am still confused on the right way to use the hash aggregations. I found 3 different approaches in the hash_aggregation_test.cc which included a naive grouper/exec_plan and internal group_by. Is anyone particularly recommended as GroupBy is used in the python api but the doc said its not for public use.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] ava6969 closed issue #15162: [C++] Usage of Hash Aggregation

Posted by GitBox <gi...@apache.org>.
ava6969 closed issue #15162: [C++] Usage of Hash Aggregation
URL: https://github.com/apache/arrow/issues/15162


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] westonpace commented on issue #15162: [C++] Usage of Hash Aggregation

Posted by GitBox <gi...@apache.org>.
westonpace commented on issue #15162:
URL: https://github.com/apache/arrow/issues/15162#issuecomment-1371325296

   My opinion is that the correct approach should be to use an exec plan.  Pyarrow perhaps uses `internal::GroupBy` because it finds the API more convenient but it was never intended to be a long term thing.
   
   I have a PR (#14867) to adapt the `internal::GroupBy` API to using an exec plan under the hood for those that find that API more convenient.  However, this PR is not yet merged.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] ava6969 commented on issue #15162: [C++] Usage of Hash Aggregation

Posted by GitBox <gi...@apache.org>.
ava6969 commented on issue #15162:
URL: https://github.com/apache/arrow/issues/15162#issuecomment-1371539833

   Thanks for your clarification i'll check it out.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org