You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "jychen7 (via GitHub)" <gi...@apache.org> on 2023/04/14 01:51:25 UTC

[GitHub] [arrow-rs] jychen7 opened a new issue, #4087: lexsort_to_indices may output larger size than input

jychen7 opened a new issue, #4087:
URL: https://github.com/apache/arrow-rs/issues/4087

   **Describe the bug**
   Not sure if it is arrow-rs bug. This issue is created to track
   We find a problem in https://github.com/apache/arrow-datafusion/issues/3747#issuecomment-1271514648
   
   **To Reproduce**
   TBD
   
   **Expected behavior**
   sort output size should be <= input size (smaller is expected when it is partial sort, i.e. top K)
   
   **Additional context**
   <!--
   Add any other context about the problem here.
   -->


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] jychen7 commented on issue #4087: lexsort_to_indices may output larger size than input

Posted by "jychen7 (via GitHub)" <gi...@apache.org>.
jychen7 commented on issue #4087:
URL: https://github.com/apache/arrow-rs/issues/4087#issuecomment-1557862960

   > one could argue that we should correctly estimate the output buffers to not be larger than the inputs, this is not guaranteed.
   
   Thanks, that makes sense


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] tustvold closed issue #4087: lexsort_to_indices may output larger size than input

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold closed issue #4087: lexsort_to_indices may output larger size than input
URL: https://github.com/apache/arrow-rs/issues/4087


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] tustvold commented on issue #4087: lexsort_to_indices may output larger size than input

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold commented on issue #4087:
URL: https://github.com/apache/arrow-rs/issues/4087#issuecomment-1508061463

   I don't think the size of the underlying buffers is something we guarantee, whilst one could argue that we should correctly estimate the output buffers to not be larger than the inputs, this is not guaranteed.
   
   Perhaps you might explain why this is a problem?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] tustvold commented on issue #4087: lexsort_to_indices may output larger size than input

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold commented on issue #4087:
URL: https://github.com/apache/arrow-rs/issues/4087#issuecomment-1557080026

   Closing this as I've not heard back, feel free to reopen if still an issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org