You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Deepak Jaiswal (JIRA)" <ji...@apache.org> on 2018/01/06 23:22:00 UTC

[jira] [Created] (HIVE-18392) load data should rename files consistent with insert statements (non bucketed tables only)

Deepak Jaiswal created HIVE-18392:
-------------------------------------

             Summary: load data should rename files consistent with insert statements (non bucketed tables only)
                 Key: HIVE-18392
                 URL: https://issues.apache.org/jira/browse/HIVE-18392
             Project: Hive
          Issue Type: Sub-task
            Reporter: Deepak Jaiswal
            Assignee: Deepak Jaiswal


Insert statements create files of format ending with 0000_0, 0001_0 etc. However, the load data uses the input file name. That results in inconsistent naming convention which makes SMB joins difficult in some scenarios and may cause trouble for other types of queries in future.
We need consistent naming convention.
For non-bucketed table, hive renames all the files regardless of how they were named by the user.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)