You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Padma Penumarthy (JIRA)" <ji...@apache.org> on 2017/10/23 21:25:00 UTC
[jira] [Created] (DRILL-5899) No need to do isAscii check for
simple pattern matcher
Padma Penumarthy created DRILL-5899:
---------------------------------------
Summary: No need to do isAscii check for simple pattern matcher
Key: DRILL-5899
URL: https://issues.apache.org/jira/browse/DRILL-5899
Project: Apache Drill
Issue Type: Improvement
Components: Execution - Flow
Reporter: Padma Penumarthy
Assignee: Padma Penumarthy
Priority: Critical
For simple pattern matcher, we do not have to do isAscii check.
UTF-8 encoding ensures that no UTF-8 character is a prefix of any other valid character. So, for the 4 simple patterns we have i.e. startsWith, endsWith, contains and constant, we can get rid of this check. This will help improve performance.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)