You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/11/11 19:34:47 UTC

[GitHub] [arrow-datafusion] alamb opened a new issue, #4179: Use arrow row format in GroupByHash rather than datafusion row format

alamb opened a new issue, #4179:
URL: https://github.com/apache/arrow-datafusion/issues/4179

   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   Arrow now has a row format https://docs.rs/arrow/26.0.0/arrow/row/index.html that is both well documented and likely to get additional functionality (as it will be used for sorting, etc) 
   
   It would be good to use that in DataFusion to benefit from upstream efforts
   
   
   
   **Describe the solution you'd like**
   It would be great to use the Arrow row format instead of `Compact` layout: https://github.com/apache/arrow-datafusion/blob/f2f846512ab032845de5dcee768a8a69ddf17eac/datafusion/row/src/layout.rs#L68
   
   **Describe alternatives you've considered**
   A clear and concise description of any alternative solutions or features you've considered.
   
   **Additional context**
   As part of https://github.com/apache/arrow-datafusion/issues/3941 I believe @crepererum  is working on this feature
   
   cc @yjshen 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] alamb commented on issue #4179: Use arrow row format in GroupByHash rather than datafusion row format

Posted by GitBox <gi...@apache.org>.
alamb commented on issue #4179:
URL: https://github.com/apache/arrow-datafusion/issues/4179#issuecomment-1386167949

   I think we should probably file a follow on ticket for the group state (aka the aggregates) as I am not sure the existing row format is actually the right thing. I'll try and sort this out tomorrow when filing tickets


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] tustvold commented on issue #4179: Use arrow row format in GroupByHash rather than datafusion row format

Posted by GitBox <gi...@apache.org>.
tustvold commented on issue #4179:
URL: https://github.com/apache/arrow-datafusion/issues/4179#issuecomment-1385698008

   I believe this was closed by https://github.com/apache/arrow-datafusion/pull/4830


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] crepererum commented on issue #4179: Use arrow row format in GroupByHash rather than datafusion row format

Posted by GitBox <gi...@apache.org>.
crepererum commented on issue #4179:
URL: https://github.com/apache/arrow-datafusion/issues/4179#issuecomment-1385731259

   Only for the group key, not for the group state.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] tustvold closed issue #4179: Use arrow row format in GroupByHash rather than datafusion row format

Posted by GitBox <gi...@apache.org>.
tustvold closed issue #4179: Use arrow row format in GroupByHash rather than datafusion row format
URL: https://github.com/apache/arrow-datafusion/issues/4179


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org