You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Padma Penumarthy (JIRA)" <ji...@apache.org> on 2017/10/23 21:25:00 UTC

[jira] [Created] (DRILL-5899) No need to do isAscii check for simple pattern matcher

Padma Penumarthy created DRILL-5899:
---------------------------------------

             Summary: No need to do isAscii check for simple pattern matcher
                 Key: DRILL-5899
                 URL: https://issues.apache.org/jira/browse/DRILL-5899
             Project: Apache Drill
          Issue Type: Improvement
          Components: Execution - Flow
            Reporter: Padma Penumarthy
            Assignee: Padma Penumarthy
            Priority: Critical


For simple pattern matcher, we do not have to do isAscii check. 
UTF-8 encoding ensures that no UTF-8 character is a prefix of any other valid character. So, for the 4 simple patterns we have i.e. startsWith, endsWith, contains and constant, we can get rid of this check. This will help improve performance. 




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)