You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/04/13 05:39:33 UTC

[GitHub] [arrow] cyb70289 commented on pull request #10009: ARROW-11568: [C++][Compute] Rewrite mode kernel

cyb70289 commented on pull request #10009:
URL: https://github.com/apache/arrow/pull/10009#issuecomment-818452704


   Benchmark on skylake, clang-9.
   
   ```
   -----------------------------------------------------------------------------------------------
   Non-regressions: (40)
   -----------------------------------------------------------------------------------------------
                                    benchmark         baseline        contender  change % counters
          ModeKernelWide<Int32Type>/1048576/0   20.644 MiB/sec   66.187 MiB/sec   220.613       {}
        ModeKernelWide<Int32Type>/1048576/100   20.768 MiB/sec   65.772 MiB/sec   216.699       {}
      ModeKernelWide<Int32Type>/1048576/10000   20.626 MiB/sec   65.291 MiB/sec   216.546       {}
         ModeKernelWide<Int32Type>/1048576/10   22.390 MiB/sec   69.904 MiB/sec   212.209       {}
      ModeKernelWide<FloatType>/1048576/10000   18.174 MiB/sec   54.517 MiB/sec   199.975       {}
          ModeKernelWide<FloatType>/1048576/0   18.423 MiB/sec   54.869 MiB/sec   197.826       {}
         ModeKernelWide<FloatType>/1048576/10   20.009 MiB/sec   59.346 MiB/sec   196.601       {}
        ModeKernelWide<FloatType>/1048576/100   18.486 MiB/sec   54.568 MiB/sec   195.186       {}
        ModeKernelWide<Int64Type>/1048576/100   46.593 MiB/sec  136.786 MiB/sec   193.578       {}
          ModeKernelWide<Int64Type>/1048576/0   46.434 MiB/sec  134.296 MiB/sec   189.221       {}
      ModeKernelWide<Int64Type>/1048576/10000   46.435 MiB/sec  134.149 MiB/sec   188.894       {}
        ModeKernelNarrow<Int64Type>/1048576/0  893.469 MiB/sec    2.515 GiB/sec   188.197       {}
    ModeKernelNarrow<Int64Type>/1048576/10000  886.403 MiB/sec    2.484 GiB/sec   186.933       {}
         ModeKernelWide<Int64Type>/1048576/10   50.182 MiB/sec  143.034 MiB/sec   185.029       {}
     ModeKernelWide<DoubleType>/1048576/10000   40.525 MiB/sec  115.477 MiB/sec   184.953       {}
       ModeKernelWide<DoubleType>/1048576/100   41.114 MiB/sec  116.862 MiB/sec   184.241       {}
        ModeKernelWide<DoubleType>/1048576/10   44.555 MiB/sec  126.095 MiB/sec   183.008       {}
         ModeKernelWide<DoubleType>/1048576/0   41.083 MiB/sec  113.499 MiB/sec   176.263       {}
          ModeKernelWide<FloatType>/1048576/2   39.704 MiB/sec  106.142 MiB/sec   167.335       {}
      ModeKernelNarrow<Int64Type>/1048576/100  853.217 MiB/sec    2.223 GiB/sec   166.836       {}
          ModeKernelWide<Int32Type>/1048576/2   43.329 MiB/sec  115.209 MiB/sec   165.893       {}
         ModeKernelWide<DoubleType>/1048576/2   88.485 MiB/sec  223.284 MiB/sec   152.340       {}
          ModeKernelWide<Int64Type>/1048576/2   97.572 MiB/sec  236.831 MiB/sec   142.723       {}
        ModeKernelNarrow<Int32Type>/1048576/0  715.706 MiB/sec    1.522 GiB/sec   117.703       {}
    ModeKernelNarrow<Int32Type>/1048576/10000  714.237 MiB/sec    1.497 GiB/sec   114.577       {}
       ModeKernelNarrow<Int64Type>/1048576/10  701.986 MiB/sec    1.399 GiB/sec   104.094       {}
      ModeKernelNarrow<Int32Type>/1048576/100  666.834 MiB/sec    1.291 GiB/sec    98.234       {}
        ModeKernelNarrow<Int64Type>/1048576/2  611.412 MiB/sec    1.057 GiB/sec    77.008       {}
       ModeKernelNarrow<Int32Type>/1048576/10  499.852 MiB/sec  780.344 MiB/sec    56.115       {}
        ModeKernelNarrow<Int32Type>/1048576/2  402.986 MiB/sec  562.758 MiB/sec    39.647       {}
         ModeKernelNarrow<Int8Type>/1048576/1  499.705 GiB/sec  639.311 GiB/sec    27.938       {}
         ModeKernelWide<DoubleType>/1048576/1  599.969 GiB/sec  738.218 GiB/sec    23.043       {}
        ModeKernelNarrow<Int64Type>/1048576/1  610.213 GiB/sec  743.338 GiB/sec    21.816       {}
          ModeKernelWide<Int64Type>/1048576/1  608.551 GiB/sec  735.741 GiB/sec    20.901       {}
          ModeKernelWide<Int32Type>/1048576/1  611.261 GiB/sec  727.163 GiB/sec    18.961       {}
        ModeKernelNarrow<Int32Type>/1048576/1  601.834 GiB/sec  714.017 GiB/sec    18.640       {}
          ModeKernelWide<FloatType>/1048576/1  612.628 GiB/sec  716.591 GiB/sec    16.970       {}
      ModeKernelNarrow<BooleanType>/1048576/1  628.851 GiB/sec  691.936 GiB/sec    10.032       {}
      ModeKernelNarrow<BooleanType>/1048576/0   21.709 GiB/sec   22.397 GiB/sec     3.166       {}
         ModeKernelNarrow<Int8Type>/1048576/2  293.420 MiB/sec  286.770 MiB/sec    -2.266       {}
   
   -------------------------------------------------------------------------------------------------
   Regressions: (8)
   -------------------------------------------------------------------------------------------------
                                      benchmark         baseline        contender  change % counters
    ModeKernelNarrow<BooleanType>/1048576/10000    1.959 GiB/sec    1.856 GiB/sec    -5.274       {}
        ModeKernelNarrow<BooleanType>/1048576/2    1.957 GiB/sec    1.854 GiB/sec    -5.293       {}
      ModeKernelNarrow<BooleanType>/1048576/100    1.972 GiB/sec    1.866 GiB/sec    -5.374       {}
       ModeKernelNarrow<BooleanType>/1048576/10    1.974 GiB/sec    1.855 GiB/sec    -6.027       {}
          ModeKernelNarrow<Int8Type>/1048576/10  364.472 MiB/sec  342.415 MiB/sec    -6.052       {}
           ModeKernelNarrow<Int8Type>/1048576/0  484.491 MiB/sec  440.347 MiB/sec    -9.111       {}
         ModeKernelNarrow<Int8Type>/1048576/100  458.333 MiB/sec  416.045 MiB/sec    -9.226       {}
       ModeKernelNarrow<Int8Type>/1048576/10000  486.679 MiB/sec  439.186 MiB/sec    -9.759       {}
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org