You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/21 00:13:00 UTC
[jira] [Work logged] (HIVE-25401) Insert overwrite a table which location is on other cluster fail in kerberos cluster
[ https://issues.apache.org/jira/browse/HIVE-25401?focusedWorklogId=684270&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-684270 ]
ASF GitHub Bot logged work on HIVE-25401:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 21/Nov/21 00:12
Start Date: 21/Nov/21 00:12
Worklog Time Spent: 10m
Work Description: github-actions[bot] commented on pull request #2544:
URL: https://github.com/apache/hive/pull/2544#issuecomment-974730527
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.
Feel free to reach out on the dev@hive.apache.org list if the patch is in need of reviews.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: gitbox-unsubscribe@hive.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
Issue Time Tracking
-------------------
Worklog Id: (was: 684270)
Time Spent: 50m (was: 40m)
> Insert overwrite a table which location is on other cluster fail in kerberos cluster
> --------------------------------------------------------------------------------------
>
> Key: HIVE-25401
> URL: https://issues.apache.org/jira/browse/HIVE-25401
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Affects Versions: 2.3.0, 3.1.2
> Environment: hive 2.3
> hadoop3 cluster with kerberos
> Reporter: Max Xie
> Assignee: Max Xie
> Priority: Minor
> Labels: pull-request-available
> Attachments: HIVE-25401.patch, image-2021-07-29-14-25-23-418.png
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> we have tow hdfs clusters with kerberos security, it means that mapreduce task need delegation tokens to authenticate namenode when hive on mapreduce run.
> Insert overwrite a table which location is on other cluster fail in kerberos cluster. For example,
> # yarn cluster's default fs is hdfs://cluster1
> # tb1's location is hdfs://cluster1/tb1
> # tb2's location is hdfs://cluster2/tb2
> # sql `INSERT OVERWRITE TABLE tb2 SELECT * from tb1` run on yarn cluster will fail
>
> reduce task error log:
> !image-2021-07-29-14-25-23-418.png!
> How to fix:
> After dig it, web found mapreduce job just obtain delegation tokens for input files in FileInputFormat. But Hive context get extendal scratchDir base on table's location, If the table 's location is on other cluster, the delegation token will not be obtained.
> So we need to obtaine delegation tokens for hive scratchDirs before hive submit mapreduce job.
>
> How to test:
> no test
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)