You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/04/13 05:39:33 UTC
[GitHub] [arrow] cyb70289 commented on pull request #10009: ARROW-11568: [C++][Compute] Rewrite mode kernel
cyb70289 commented on pull request #10009:
URL: https://github.com/apache/arrow/pull/10009#issuecomment-818452704
Benchmark on skylake, clang-9.
```
-----------------------------------------------------------------------------------------------
Non-regressions: (40)
-----------------------------------------------------------------------------------------------
benchmark baseline contender change % counters
ModeKernelWide<Int32Type>/1048576/0 20.644 MiB/sec 66.187 MiB/sec 220.613 {}
ModeKernelWide<Int32Type>/1048576/100 20.768 MiB/sec 65.772 MiB/sec 216.699 {}
ModeKernelWide<Int32Type>/1048576/10000 20.626 MiB/sec 65.291 MiB/sec 216.546 {}
ModeKernelWide<Int32Type>/1048576/10 22.390 MiB/sec 69.904 MiB/sec 212.209 {}
ModeKernelWide<FloatType>/1048576/10000 18.174 MiB/sec 54.517 MiB/sec 199.975 {}
ModeKernelWide<FloatType>/1048576/0 18.423 MiB/sec 54.869 MiB/sec 197.826 {}
ModeKernelWide<FloatType>/1048576/10 20.009 MiB/sec 59.346 MiB/sec 196.601 {}
ModeKernelWide<FloatType>/1048576/100 18.486 MiB/sec 54.568 MiB/sec 195.186 {}
ModeKernelWide<Int64Type>/1048576/100 46.593 MiB/sec 136.786 MiB/sec 193.578 {}
ModeKernelWide<Int64Type>/1048576/0 46.434 MiB/sec 134.296 MiB/sec 189.221 {}
ModeKernelWide<Int64Type>/1048576/10000 46.435 MiB/sec 134.149 MiB/sec 188.894 {}
ModeKernelNarrow<Int64Type>/1048576/0 893.469 MiB/sec 2.515 GiB/sec 188.197 {}
ModeKernelNarrow<Int64Type>/1048576/10000 886.403 MiB/sec 2.484 GiB/sec 186.933 {}
ModeKernelWide<Int64Type>/1048576/10 50.182 MiB/sec 143.034 MiB/sec 185.029 {}
ModeKernelWide<DoubleType>/1048576/10000 40.525 MiB/sec 115.477 MiB/sec 184.953 {}
ModeKernelWide<DoubleType>/1048576/100 41.114 MiB/sec 116.862 MiB/sec 184.241 {}
ModeKernelWide<DoubleType>/1048576/10 44.555 MiB/sec 126.095 MiB/sec 183.008 {}
ModeKernelWide<DoubleType>/1048576/0 41.083 MiB/sec 113.499 MiB/sec 176.263 {}
ModeKernelWide<FloatType>/1048576/2 39.704 MiB/sec 106.142 MiB/sec 167.335 {}
ModeKernelNarrow<Int64Type>/1048576/100 853.217 MiB/sec 2.223 GiB/sec 166.836 {}
ModeKernelWide<Int32Type>/1048576/2 43.329 MiB/sec 115.209 MiB/sec 165.893 {}
ModeKernelWide<DoubleType>/1048576/2 88.485 MiB/sec 223.284 MiB/sec 152.340 {}
ModeKernelWide<Int64Type>/1048576/2 97.572 MiB/sec 236.831 MiB/sec 142.723 {}
ModeKernelNarrow<Int32Type>/1048576/0 715.706 MiB/sec 1.522 GiB/sec 117.703 {}
ModeKernelNarrow<Int32Type>/1048576/10000 714.237 MiB/sec 1.497 GiB/sec 114.577 {}
ModeKernelNarrow<Int64Type>/1048576/10 701.986 MiB/sec 1.399 GiB/sec 104.094 {}
ModeKernelNarrow<Int32Type>/1048576/100 666.834 MiB/sec 1.291 GiB/sec 98.234 {}
ModeKernelNarrow<Int64Type>/1048576/2 611.412 MiB/sec 1.057 GiB/sec 77.008 {}
ModeKernelNarrow<Int32Type>/1048576/10 499.852 MiB/sec 780.344 MiB/sec 56.115 {}
ModeKernelNarrow<Int32Type>/1048576/2 402.986 MiB/sec 562.758 MiB/sec 39.647 {}
ModeKernelNarrow<Int8Type>/1048576/1 499.705 GiB/sec 639.311 GiB/sec 27.938 {}
ModeKernelWide<DoubleType>/1048576/1 599.969 GiB/sec 738.218 GiB/sec 23.043 {}
ModeKernelNarrow<Int64Type>/1048576/1 610.213 GiB/sec 743.338 GiB/sec 21.816 {}
ModeKernelWide<Int64Type>/1048576/1 608.551 GiB/sec 735.741 GiB/sec 20.901 {}
ModeKernelWide<Int32Type>/1048576/1 611.261 GiB/sec 727.163 GiB/sec 18.961 {}
ModeKernelNarrow<Int32Type>/1048576/1 601.834 GiB/sec 714.017 GiB/sec 18.640 {}
ModeKernelWide<FloatType>/1048576/1 612.628 GiB/sec 716.591 GiB/sec 16.970 {}
ModeKernelNarrow<BooleanType>/1048576/1 628.851 GiB/sec 691.936 GiB/sec 10.032 {}
ModeKernelNarrow<BooleanType>/1048576/0 21.709 GiB/sec 22.397 GiB/sec 3.166 {}
ModeKernelNarrow<Int8Type>/1048576/2 293.420 MiB/sec 286.770 MiB/sec -2.266 {}
-------------------------------------------------------------------------------------------------
Regressions: (8)
-------------------------------------------------------------------------------------------------
benchmark baseline contender change % counters
ModeKernelNarrow<BooleanType>/1048576/10000 1.959 GiB/sec 1.856 GiB/sec -5.274 {}
ModeKernelNarrow<BooleanType>/1048576/2 1.957 GiB/sec 1.854 GiB/sec -5.293 {}
ModeKernelNarrow<BooleanType>/1048576/100 1.972 GiB/sec 1.866 GiB/sec -5.374 {}
ModeKernelNarrow<BooleanType>/1048576/10 1.974 GiB/sec 1.855 GiB/sec -6.027 {}
ModeKernelNarrow<Int8Type>/1048576/10 364.472 MiB/sec 342.415 MiB/sec -6.052 {}
ModeKernelNarrow<Int8Type>/1048576/0 484.491 MiB/sec 440.347 MiB/sec -9.111 {}
ModeKernelNarrow<Int8Type>/1048576/100 458.333 MiB/sec 416.045 MiB/sec -9.226 {}
ModeKernelNarrow<Int8Type>/1048576/10000 486.679 MiB/sec 439.186 MiB/sec -9.759 {}
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org