You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "gaoxiaoqing (Jira)" <ji...@apache.org> on 2021/10/13 06:47:00 UTC
[jira] [Updated] (IMPALA-10968) optimize regular expression
matching
[ https://issues.apache.org/jira/browse/IMPALA-10968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
gaoxiaoqing updated IMPALA-10968:
---------------------------------
Description:
eg:
{code:java}
select
case when p__latest_referrer rlike '.*ssid=0.*'
and city = '北京' then 1 when p__latest_referrer rlike '.*ssid=0.*'
and city = '上海' then 2 end as x
from
event_ros_p7
where
p__latest_referrer;
{code}
the above sql need match pattern '.*ssid=0.*' two times on each line,if we store the match result on each line, it only need match one time on each line.
the key is property p__latest_referrer and pattern ".*ssid", the value is true or false means whether match or not. after one line is calculated, the kv store will be released
was:
eg:
{code:java}
select
case when p__latest_referrer rlike '.*ssid=0.*'
and city = '北京' then 1 when p__latest_referrer rlike '.*ssid=0.*'
and city = '上海' then 2 end as x
from
event_ros_p7
where
p__latest_referrer;
{code}
the above sql need match pattern '.*ssid=0.*' two times on each line,if we store the match result on each line, it only need match one time on each line.
key: property p__latest_referrer and pattern ".*ssid"
value: true or false
> optimize regular expression matching
> ------------------------------------
>
> Key: IMPALA-10968
> URL: https://issues.apache.org/jira/browse/IMPALA-10968
> Project: IMPALA
> Issue Type: Improvement
> Reporter: gaoxiaoqing
> Assignee: gaoxiaoqing
> Priority: Major
>
> eg:
> {code:java}
> select
> case when p__latest_referrer rlike '.*ssid=0.*'
> and city = '北京' then 1 when p__latest_referrer rlike '.*ssid=0.*'
> and city = '上海' then 2 end as x
> from
> event_ros_p7
> where
> p__latest_referrer;
> {code}
> the above sql need match pattern '.*ssid=0.*' two times on each line,if we store the match result on each line, it only need match one time on each line.
> the key is property p__latest_referrer and pattern ".*ssid", the value is true or false means whether match or not. after one line is calculated, the kv store will be released
>
>
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org