You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Sai Hemanth Gantasala (Jira)" <ji...@apache.org> on 2021/08/27 02:37:00 UTC

[jira] [Updated] (HIVE-25303) CTAS hive.create.as.external.legacy tries to place data files in managed WH path

     [ https://issues.apache.org/jira/browse/HIVE-25303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sai Hemanth Gantasala updated HIVE-25303:
-----------------------------------------
    Description: 
Under legacy table creation mode (hive.create.as.external.legacy=true), when a database has been created in a specific LOCATION, in a session where that database is Used, tables are created using the following command:
{code:java}
CREATE TABLE <tablename> AS SELECT <select statement>{code}
should inherit the HDFS path from the database's location. Instead, Hive is trying to write the table data into /warehouse/tablespace/managed/hive/<database_directory_name>/<table_name>

+Design+: 
In the CTAS query, first data is written in the target directory (which happens in HS2) and then the table is created(This happens in HMS). So here two decisions are being made i) target directory location ii) how the table should be created (table type, sd e.t.c).
When HS2 needs a target location that needs to be set, it'll make create table dry run call to HMS (where table translation happens) and i) and ii) decisions are made within HMS and returns table object. Then HS2 will use this location set by HMS for placing the data.

  was:
Under legacy table creation mode (hive.create.as.external.legacy=true), when a database has been created in a specific LOCATION, in a session where that database is USEd, tables created using

CREATE TABLE <tablename> AS SELECT <select statement>

should inherit the HDFS path from the database's location.

Instead, Hive is trying to write the table data into /warehouse/tablespace/managed/hive/<database_directory_name>/<table_name>


> CTAS hive.create.as.external.legacy tries to place data files in managed WH path
> --------------------------------------------------------------------------------
>
>                 Key: HIVE-25303
>                 URL: https://issues.apache.org/jira/browse/HIVE-25303
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2, Standalone Metastore
>            Reporter: Sai Hemanth Gantasala
>            Assignee: Sai Hemanth Gantasala
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Under legacy table creation mode (hive.create.as.external.legacy=true), when a database has been created in a specific LOCATION, in a session where that database is Used, tables are created using the following command:
> {code:java}
> CREATE TABLE <tablename> AS SELECT <select statement>{code}
> should inherit the HDFS path from the database's location. Instead, Hive is trying to write the table data into /warehouse/tablespace/managed/hive/<database_directory_name>/<table_name>
> +Design+: 
> In the CTAS query, first data is written in the target directory (which happens in HS2) and then the table is created(This happens in HMS). So here two decisions are being made i) target directory location ii) how the table should be created (table type, sd e.t.c).
> When HS2 needs a target location that needs to be set, it'll make create table dry run call to HMS (where table translation happens) and i) and ii) decisions are made within HMS and returns table object. Then HS2 will use this location set by HMS for placing the data.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)