You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/07/07 13:55:50 UTC

[GitHub] [arrow-datafusion] NGA-TRAN commented on issue #396: DataFusion benchmarks should show executed plan with metrics after query completes

NGA-TRAN commented on issue #396:
URL: https://github.com/apache/arrow-datafusion/issues/396#issuecomment-875625584


   Regarding metrics, I think the following items will be useful:
     1. Sort algorithm (quick sort, ...)
     2. If possible, how many partitions (input streams) get sorted.
     3. Depends on sort algorithm, we may be able to know what kind of input encoding (e.g. RLE) and number of distinct values. I think these metrics will also help us to evaluate the effectiveness of the sort algorithm we choose.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org