You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@kudu.apache.org by "Attila Bukor (Code Review)" <ge...@cloudera.org> on 2019/11/11 19:40:52 UTC
[kudu-CR] KUDU-1938 Make UTF-8 truncation faster pt 2
Hello Kudu Jenkins, Adar Dembo, Grant Henke,
I'd like you to reexamine a change. Please visit
http://gerrit.cloudera.org:8080/14354
to look at the new patch set (#10).
Change subject: KUDU-1938 Make UTF-8 truncation faster pt 2
......................................................................
KUDU-1938 Make UTF-8 truncation faster pt 2
Adds Intel Intrinsics (up to SSE4.2) to speed up the processing of UTF8
character counting in the case of ASCII-only chunks (fast path) by
doubling the chunk size in a single pass from 64 to 128 bits.
Before:
[ RUN ] CharUtilTest.StressTestUtf8
[ OK ] CharUtilTest.StressTestUtf8 (7746 ms)
[ RUN ] CharUtilTest.StressTestAscii
[ OK ] CharUtilTest.StressTestAscii (1028 ms)
After:
[ RUN ] CharUtilTest.StressTestUtf8
[ OK ] CharUtilTest.StressTestUtf8 (10599 ms)
[ RUN ] CharUtilTest.StressTestAscii
[ OK ] CharUtilTest.StressTestAscii (676 ms)
Change-Id: I9a491157dd5c8b4815030bbda921a0afc0bafd28
---
M src/kudu/util/char_util-test.cc
M src/kudu/util/char_util.cc
2 files changed, 28 insertions(+), 6 deletions(-)
git pull ssh://gerrit.cloudera.org:29418/kudu refs/changes/54/14354/10
--
To view, visit http://gerrit.cloudera.org:8080/14354
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings
Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9a491157dd5c8b4815030bbda921a0afc0bafd28
Gerrit-Change-Number: 14354
Gerrit-PatchSet: 10
Gerrit-Owner: Attila Bukor <ab...@apache.org>
Gerrit-Reviewer: Adar Dembo <ad...@cloudera.com>
Gerrit-Reviewer: Attila Bukor <ab...@apache.org>
Gerrit-Reviewer: Grant Henke <gr...@apache.org>
Gerrit-Reviewer: Kudu Jenkins (120)