You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Padma Penumarthy (JIRA)" <ji...@apache.org> on 2017/08/25 21:36:01 UTC

[jira] [Updated] (DRILL-5697) Improve performance of filter operator for pattern matching

     [ https://issues.apache.org/jira/browse/DRILL-5697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Padma Penumarthy updated DRILL-5697:
------------------------------------
    Labels: ready-to-commit  (was: )

> Improve performance of filter operator for pattern matching
> -----------------------------------------------------------
>
>                 Key: DRILL-5697
>                 URL: https://issues.apache.org/jira/browse/DRILL-5697
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Execution - Flow
>    Affects Versions: 1.11.0
>            Reporter: Padma Penumarthy
>            Assignee: Padma Penumarthy
>              Labels: ready-to-commit
>
> Queries using filter with sql like operator use Java regex library for pattern matching. However, for cases like %abc (ends with abc), abc% (starts with abc), %abc% (contains abc), it is observed that implementing these cases with simple code instead of using regex library provides good performance boost (4-6x). Idea is to use special case code for simple, common cases and fall back to Java regex library for complicated ones. That will provide good performance benefit for most common cases.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)