You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Nita Dembla (JIRA)" <ji...@apache.org> on 2018/05/24 14:11:00 UTC
[jira] [Created] (HIVE-19694) Create Materialized View statement
should check for MV name conflicts before running MV's SQL statement.
Nita Dembla created HIVE-19694:
----------------------------------
Summary: Create Materialized View statement should check for MV name conflicts before running MV's SQL statement.
Key: HIVE-19694
URL: https://issues.apache.org/jira/browse/HIVE-19694
Project: Hive
Issue Type: Bug
Components: Hive
Affects Versions: 3.0.0
Reporter: Nita Dembla
Fix For: 3.0.1
If the CREATE MATERIALIZE VIEW statement refers to a mv name that already exists, the statement runs the SQL on cluster and Move task returns an error at the very end.
This unnecessarily uses up cluster resources and user time.
{code:java}
0: jdbc:hive2://localhost:10007/tpcds_bin_par> CREATE MATERIALIZED VIEW mv_store_sales_item_store
. . . . . . . . . . . . . . . . . . . . . . .> ENABLE REWRITE AS (
. . . . . . . . . . . . . . . . . . . . . . .> select ss_item_sk,
. . . . . . . . . . . . . . . . . . . . . . .> ss_store_sk,
. . . . . . . . . . . . . . . . . . . . . . .> sum(ss_quantity) as ss_quantity,
. . . . . . . . . . . . . . . . . . . . . . .> sum(ss_ext_wholesale_cost) as ss_ext_wholesale_cost,
. . . . . . . . . . . . . . . . . . . . . . .> sum(ss_net_paid) as ss_net_paid,
. . . . . . . . . . . . . . . . . . . . . . .> sum(ss_net_profit) as ss_net_profit
. . . . . . . . . . . . . . . . . . . . . . .> from store_sales
. . . . . . . . . . . . . . . . . . . . . . .> group by ss_item_sk,ss_store_sk
. . . . . . . . . . . . . . . . . . . . . . .> );
INFO : Compiling command(queryId=root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037): CREATE MATERIALIZED VIEW mv_store_sales_item_store
ENABLE REWRITE AS (
select ss_item_sk,
| `ss_store_sk` bigint, |
| `ss_quantity` bigint, |
| `ss_ext_wholesale_cost` double, |
| `ss_net_paid` double, |
| `ss_net_profit` double) |
. . . . . . . . . . . . . . . . . . . . . . .> from store_sales
. . . . . . . . . . . . . . . . . . . . . . .> group by ss_item_sk,ss_store_sk
. . . . . . . . . . . . . . . . . . . . . . .> );
INFO : Compiling command(queryId=root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037): CREATE MATERIALIZED VIEW mv_store_sales_item_store
ENABLE REWRITE AS (
select ss_item_sk,
ss_store_sk,
sum(ss_quantity) as ss_quantity,
sum(ss_ext_wholesale_cost) as ss_ext_wholesale_cost,
sum(ss_net_paid) as ss_net_paid,
sum(ss_net_profit) as ss_net_profit
from store_sales
group by ss_item_sk,ss_store_sk
)
INFO : Semantic Analysis Completed
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:ss_item_sk, type:bigint, comment:null), FieldSchema(name:ss_store_sk, type:bigint, comment:null), FieldSchema(name:ss_quantity, type:bigint, comment:null), FieldSchema(name:ss_ext_wholesale_cost, type:double, comment:null), FieldSchema(name:ss_net_paid, type:double, comment:null), FieldSchema(name:ss_net_profit, type:double, comment:null)], properties:null)
INFO : Completed compiling command(queryId=root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037); Time taken: 3.652 seconds
INFO : Executing command(queryId=root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037): CREATE MATERIALIZED VIEW mv_store_sales_item_store
ENABLE REWRITE AS (
select ss_item_sk,
ss_store_sk,
sum(ss_quantity) as ss_quantity,
sum(ss_ext_wholesale_cost) as ss_ext_wholesale_cost,
sum(ss_net_paid) as ss_net_paid,
sum(ss_net_profit) as ss_net_profit
from store_sales
group by ss_item_sk,ss_store_sk
)
INFO : Query ID = root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037
INFO : Total jobs = 1
INFO : Launching Job 1 out of 1
INFO : Starting task [Stage-1:MAPRED] in serial mode
INFO : Subscribed to counters: [] for queryId: root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037
INFO : Session is already open
INFO : Dag name: CREATE MATERIALIZED V...tem_sk,ss_store_sk
) (Stage-1)
INFO : Status: Running (Executing on YARN cluster with App id application_1525123931791_0151)
----------------------------------------------------------------------------------------------
VERTICES MODE STATUS TOTAL COMPLETED RUNNING PENDING FAILED KILLED
----------------------------------------------------------------------------------------------
Map 1 .......... llap SUCCEEDED 1682 1682 0 0 0 0
Reducer 2 ...... llap SUCCEEDED 1009 1009 0 0 0 7
----------------------------------------------------------------------------------------------
VERTICES: 02/02 [==========================>>] 100% ELAPSED TIME: 1734.00 s
----------------------------------------------------------------------------------------------
INFO : Status: DAG finished successfully in 1731.89 seconds
INFO :
INFO : Query Execution Summary
INFO : ----------------------------------------------------------------------------------------------
INFO : OPERATION DURATION
INFO : ----------------------------------------------------------------------------------------------
INFO : Compile Query 3.65s
INFO : Prepare Plan 0.45s
INFO : Get Query Coordinator (AM) 0.00s
INFO : Submit Plan 0.17s
INFO : Start DAG 0.60s
INFO : Run DAG 1731.89s
INFO : ----------------------------------------------------------------------------------------------
INFO :
INFO : Task Execution Summary
INFO : ----------------------------------------------------------------------------------------------
INFO : VERTICES DURATION(ms) CPU_TIME(ms) GC_TIME(ms) INPUT_RECORDS OUTPUT_RECORDS
INFO : ----------------------------------------------------------------------------------------------
INFO : Map 1 928170.00 0 0 28,800,426,268 28,760,206,232
INFO : Reducer 2 1099992.00 0 0 28,760,206,232 0
INFO : ----------------------------------------------------------------------------------------------
INFO :
INFO : LLAP IO Summary
INFO : ----------------------------------------------------------------------------------------------
INFO : VERTICES ROWGROUPS META_HIT META_MISS DATA_HIT DATA_MISS ALLOCATION USED TOTAL_IO
INFO : ----------------------------------------------------------------------------------------------
INFO : Map 1 2890797 33830 1888 40.57GB 401.64GB 832.29GB 777.79GB 117756.65s
INFO : ----------------------------------------------------------------------------------------------
INFO :
INFO : FileSystem Counters Summary
INFO :
INFO : Scheme: HDFS
INFO : ----------------------------------------------------------------------------------------------
INFO : VERTICES BYTES_READ READ_OPS LARGE_READ_OPS BYTES_WRITTEN WRITE_OPS
INFO : ----------------------------------------------------------------------------------------------
INFO : Map 1 401.65GB 29867 0 0B 0
INFO : Reducer 2 0B 4036 0 5.95GB 3027
INFO : ----------------------------------------------------------------------------------------------
INFO :
INFO : Scheme: FILE
INFO : ----------------------------------------------------------------------------------------------
INFO : VERTICES BYTES_READ READ_OPS LARGE_READ_OPS BYTES_WRITTEN WRITE_OPS
INFO : ----------------------------------------------------------------------------------------------
INFO : Map 1 0B 0 0 691.51GB 0
INFO : Reducer 2 490.46GB 0 0 0B 0
INFO : ----------------------------------------------------------------------------------------------
INFO :
INFO : org.apache.tez.common.counters.DAGCounter:
INFO : NUM_KILLED_TASKS: 7
INFO : NUM_SUCCEEDED_TASKS: 2691
INFO : TOTAL_LAUNCHED_TASKS: 2698
INFO : AM_CPU_MILLISECONDS: 884400
INFO : AM_GC_TIME_MILLIS: 1052
INFO : File System Counters:
INFO : FILE_BYTES_READ: 490462333002
INFO : FILE_BYTES_WRITTEN: 691512269321
INFO : FILE_READ_OPS: 0
INFO : FILE_LARGE_READ_OPS: 0
INFO : FILE_WRITE_OPS: 0
INFO : HDFS_BYTES_READ: 401649861525
INFO : HDFS_BYTES_WRITTEN: 5953929405
INFO : HDFS_READ_OPS: 33903
INFO : HDFS_LARGE_READ_OPS: 0
INFO : HDFS_WRITE_OPS: 3027
INFO : org.apache.tez.common.counters.TaskCounter:
INFO : REDUCE_INPUT_GROUPS: 301902000
INFO : REDUCE_INPUT_RECORDS: 28760206232
INFO : COMBINE_INPUT_RECORDS: 0
INFO : SPILLED_RECORDS: 57520412464
INFO : NUM_SHUFFLED_INPUTS: 1697138
INFO : NUM_SKIPPED_INPUTS: 0
INFO : NUM_FAILED_SHUFFLE_INPUTS: 0
INFO : MERGED_MAP_OUTPUTS: 1697138
INFO : INPUT_RECORDS_PROCESSED: 28802064
INFO : INPUT_SPLIT_LENGTH_BYTES: 1423169426915
INFO : OUTPUT_RECORDS: 28760206232
INFO : OUTPUT_LARGE_RECORDS: 0
INFO : OUTPUT_BYTES: 1251477312279
INFO : OUTPUT_BYTES_WITH_OVERHEAD: 1249819249865
INFO : OUTPUT_BYTES_PHYSICAL: 691471524553
INFO : ADDITIONAL_SPILLS_BYTES_WRITTEN: 490687645770
INFO : ADDITIONAL_SPILLS_BYTES_READ: 490687645770
INFO : ADDITIONAL_SPILL_COUNT: 0
INFO : SHUFFLE_CHUNK_COUNT: 1682
INFO : SHUFFLE_BYTES: 691471524553
INFO : SHUFFLE_BYTES_DECOMPRESSED: 1249819249865
INFO : SHUFFLE_BYTES_TO_MEM: 691471524553
INFO : SHUFFLE_BYTES_TO_DISK: 0
INFO : SHUFFLE_BYTES_DISK_DIRECT: 0
INFO : NUM_MEM_TO_DISK_MERGES: 0
INFO : NUM_DISK_TO_DISK_MERGES: 0
INFO : SHUFFLE_PHASE_TIME: 208623266
INFO : MERGE_PHASE_TIME: 248665618
INFO : FIRST_EVENT_RECEIVED: 6992
INFO : LAST_EVENT_RECEIVED: 66502682
INFO : HIVE:
INFO : CREATED_FILES: 1009
INFO : DESERIALIZE_ERRORS: 0
INFO : RECORDS_IN_Map_1: 28800426268
INFO : RECORDS_OUT_1_tpcds_bin_partitioned_orc_10000.mv_store_sales_it: 301902000
INFO : RECORDS_OUT_INTERMEDIATE_Map_1: 28760206232
INFO : RECORDS_OUT_INTERMEDIATE_Reducer_2: 0
INFO : RECORDS_OUT_OPERATOR_FS_11: 301902000
INFO : RECORDS_OUT_OPERATOR_GBY_10: 301902000
INFO : RECORDS_OUT_OPERATOR_GBY_8: 28760206232
INFO : RECORDS_OUT_OPERATOR_MAP_0: 0
INFO : RECORDS_OUT_OPERATOR_RS_9: 28760206232
INFO : RECORDS_OUT_OPERATOR_SEL_7: 28800426268
INFO : RECORDS_OUT_OPERATOR_TS_0: 28800426268
INFO : Shuffle Errors:
INFO : BAD_ID: 0
INFO : CONNECTION: 0
INFO : IO_ERROR: 0
INFO : WRONG_LENGTH: 0
INFO : WRONG_MAP: 0
INFO : WRONG_REDUCE: 0
INFO : Shuffle Errors_Reducer_2_INPUT_Map_1:
INFO : BAD_ID: 0
INFO : CONNECTION: 0
INFO : IO_ERROR: 0
INFO : WRONG_LENGTH: 0
INFO : WRONG_MAP: 0
INFO : WRONG_REDUCE: 0
INFO : TaskCounter_Map_1_INPUT_store_sales:
INFO : INPUT_RECORDS_PROCESSED: 28802064
INFO : INPUT_SPLIT_LENGTH_BYTES: 1423169426915
INFO : TaskCounter_Map_1_OUTPUT_Reducer_2:
INFO : ADDITIONAL_SPILLS_BYTES_READ: 0
INFO : ADDITIONAL_SPILLS_BYTES_WRITTEN: 0
INFO : ADDITIONAL_SPILL_COUNT: 0
INFO : OUTPUT_BYTES: 1251477312279
INFO : OUTPUT_BYTES_PHYSICAL: 691471524553
INFO : OUTPUT_BYTES_WITH_OVERHEAD: 1249819249865
INFO : OUTPUT_LARGE_RECORDS: 0
INFO : OUTPUT_RECORDS: 28760206232
INFO : SHUFFLE_CHUNK_COUNT: 1682
INFO : SPILLED_RECORDS: 28760206232
INFO : TaskCounter_Reducer_2_INPUT_Map_1:
INFO : ADDITIONAL_SPILLS_BYTES_READ: 490687645770
INFO : ADDITIONAL_SPILLS_BYTES_WRITTEN: 490687645770
INFO : COMBINE_INPUT_RECORDS: 0
INFO : FIRST_EVENT_RECEIVED: 6992
INFO : LAST_EVENT_RECEIVED: 66502682
INFO : MERGED_MAP_OUTPUTS: 1697138
INFO : MERGE_PHASE_TIME: 248665618
INFO : NUM_DISK_TO_DISK_MERGES: 0
INFO : NUM_FAILED_SHUFFLE_INPUTS: 0
INFO : NUM_MEM_TO_DISK_MERGES: 0
INFO : NUM_SHUFFLED_INPUTS: 1697138
INFO : NUM_SKIPPED_INPUTS: 0
INFO : REDUCE_INPUT_GROUPS: 301902000
INFO : REDUCE_INPUT_RECORDS: 28760206232
INFO : SHUFFLE_BYTES: 691471524553
INFO : SHUFFLE_BYTES_DECOMPRESSED: 1249819249865
INFO : SHUFFLE_BYTES_DISK_DIRECT: 0
INFO : SHUFFLE_BYTES_TO_DISK: 0
INFO : SHUFFLE_BYTES_TO_MEM: 691471524553
INFO : SHUFFLE_PHASE_TIME: 208623266
INFO : SPILLED_RECORDS: 28760206232
INFO : TaskCounter_Reducer_2_OUTPUT_out_Reducer_2:
INFO : OUTPUT_RECORDS: 0
INFO : org.apache.hadoop.hive.llap.counters.LlapIOCounters:
INFO : ALLOCATED_BYTES: 832288587776
INFO : ALLOCATED_USED_BYTES: 777785688450
INFO : CACHE_HIT_BYTES: 40573598684
INFO : CACHE_MISS_BYTES: 401640241544
INFO : CONSUMER_TIME_NS: 103780723121700
INFO : DECODE_TIME_NS: 67854956903872
INFO : HDFS_TIME_NS: 40407374232025
INFO : METADATA_CACHE_HIT: 33830
INFO : METADATA_CACHE_MISS: 1888
INFO : NUM_DECODED_BATCHES: 2890797
INFO : NUM_VECTOR_BATCHES: 28802069
INFO : ROWS_EMITTED: 28800426268
INFO : SELECTED_ROWGROUPS: 2890797
INFO : TOTAL_IO_TIME_NS: 117756651175367
INFO : org.apache.hadoop.hive.llap.counters.LlapWmCounters:
INFO : GUARANTEED_QUEUED_NS: 0
INFO : GUARANTEED_RUNNING_NS: 0
INFO : SPECULATIVE_QUEUED_NS: 127227283691687
INFO : SPECULATIVE_RUNNING_NS: 475493970727392
INFO : org.apache.hadoop.hive.ql.exec.tez.HiveInputCounters:
INFO : GROUPED_INPUT_SPLITS_Map_1: 1682
INFO : INPUT_DIRECTORIES_Map_1: 1824
INFO : INPUT_FILES_Map_1: 4323
INFO : RAW_INPUT_SPLITS_Map_1: 8292
INFO : Starting task [Stage-2:DEPENDENCY_COLLECTION] in serial mode
INFO : Starting task [Stage-0:MOVE] in serial mode
INFO : Moving data to directory hdfs://ctr-e138-1518143905142-92974-01-000002.hwx.site:8020/apps/hive/warehouse/tpcds_bin_partitioned_orc_10000.db/mv_store_sales_item_store from hdfs://ctr-e138-1518143905142-92974-01-000002.hwx.site:8020/apps/hive/warehouse/tpcds_bin_partitioned_orc_10000.db/.hive-staging_hive_2018-05-24_03-43-30_960_8962535148229486995-408/-ext-10002
INFO : Starting task [Stage-4:DDL] in serial mode
ERROR : FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask. Table already exists: tpcds_bin_partitioned_orc_10000.mv_store_sales_item_store
INFO : Completed executing command(queryId=root_20180524034330_21fca7f6-ed5a-492c-88e9-913d4120b037); Time taken: 1734.952 seconds
Error: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask. Table already exists: tpcds_bin_partitioned_orc_10000.mv_store_sales_item_store (state=08S01,code=1{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)