You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "László Bodor (Jira)" <ji...@apache.org> on 2021/03/11 13:48:00 UTC
[jira] [Updated] (HIVE-24873) TPCDS query51 doesn't vectorize:
Only PTF directly under reduce-shuffle is supported
[ https://issues.apache.org/jira/browse/HIVE-24873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
László Bodor updated HIVE-24873:
--------------------------------
Summary: TPCDS query51 doesn't vectorize: Only PTF directly under reduce-shuffle is supported (was: TPCDS query51 doesn't vectorize: reduce-shuffle is supported)
> TPCDS query51 doesn't vectorize: Only PTF directly under reduce-shuffle is supported
> --------------------------------------------------------------------------------------
>
> Key: HIVE-24873
> URL: https://issues.apache.org/jira/browse/HIVE-24873
> Project: Hive
> Issue Type: Sub-task
> Reporter: László Bodor
> Priority: Major
>
> {code}
> EXPLAIN VECTORIZATION DETAIL WITH web_v1 as (
> select
> ws_item_sk item_sk, d_date,
> sum(sum(ws_sales_price))
> over (partition by ws_item_sk order by d_date rows between unbounded preceding and current row) cume_sales
> from web_sales
> ,date_dim
> where ws_sold_date_sk=d_date_sk
> and d_month_seq between 1214 and 1214+11
> and ws_item_sk is not NULL
> group by ws_item_sk, d_date),
> store_v1 as (
> select
> ss_item_sk item_sk, d_date,
> sum(sum(ss_sales_price))
> over (partition by ss_item_sk order by d_date rows between unbounded preceding and current row) cume_sales
> from store_sales
> ,date_dim
> where ss_sold_date_sk=d_date_sk
> and d_month_seq between 1214 and 1214+11
> and ss_item_sk is not NULL
> group by ss_item_sk, d_date)
> select *
> from (select item_sk
> ,d_date
> ,web_sales
> ,store_sales
> ,max(web_sales)
> over (partition by item_sk order by d_date rows between unbounded preceding and current row) web_cumulative
> ,max(store_sales)
> over (partition by item_sk order by d_date rows between unbounded preceding and current row) store_cumulative
> from (select case when web.item_sk is not null then web.item_sk else store.item_sk end item_sk
> ,case when web.d_date is not null then web.d_date else store.d_date end d_date
> ,web.cume_sales web_sales
> ,store.cume_sales store_sales
> from web_v1 web full outer join store_v1 store on (web.item_sk = store.item_sk
> and web.d_date = store.d_date)
> )x )y
> where web_cumulative > store_cumulative
> order by item_sk
> ,d_date
> limit 100;
> {code}
> {code}
> Reducer 2
> notVectorizedReason: PTF operator: Only PTF directly under reduce-shuffle is supported
> window functions:
> window function: GenericUDAFSumHiveDecimal
> window frame: ROWS PRECEDING(MAX)~CURRENT
> ...
> Reducer 8
> notVectorizedReason: PTF operator: Only PTF directly under reduce-shuffle is supported
> window functions:
> window function: GenericUDAFSumHiveDecimal
> window frame: ROWS PRECEDING(MAX)~CURRENT |
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)