You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Tarek Abouzeid (Jira)" <ji...@apache.org> on 2020/07/23 13:42:00 UTC

[jira] [Commented] (MAPREDUCE-7033) Map outputs implicitly rely on permissive umask for shuffle

    [ https://issues.apache.org/jira/browse/MAPREDUCE-7033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17163588#comment-17163588 ] 

Tarek Abouzeid commented on MAPREDUCE-7033:
-------------------------------------------

Hi,

an update to this ticket, in Hortonworks HDP, the umask settings for TEZ was being fetched from the HDFS service umask setting where it was 077, changing it to 022 fixed the problem.

Best Regards, 

> Map outputs implicitly rely on permissive umask for shuffle
> -----------------------------------------------------------
>
>                 Key: MAPREDUCE-7033
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7033
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>            Reporter: Jason Darrell Lowe
>            Assignee: Jason Darrell Lowe
>            Priority: Critical
>             Fix For: 3.1.0, 3.0.1
>
>         Attachments: MAPREDUCE-7033.001.patch, MAPREDUCE-7033.002.patch
>
>
> Map tasks do not explicitly set the permissions of their output files for shuffle.  In a secure cluster the shuffle service is running as a different user than the map task, so the output files require group readability in order to serve up the data during the shuffle phase.  If the user's UNIX umask is too restrictive (e.g.: 077) then the map task's file.out and file.out.index permissions can be too restrictive to allow the shuffle handler to access them.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org