You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/02/07 10:45:00 UTC

[jira] [Work logged] (HADOOP-17386) fs.s3a.buffer.dir to be under Yarn container path on yarn applications

     [ https://issues.apache.org/jira/browse/HADOOP-17386?focusedWorklogId=721835&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-721835 ]

ASF GitHub Bot logged work on HADOOP-17386:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 07/Feb/22 10:44
            Start Date: 07/Feb/22 10:44
    Worklog Time Spent: 10m 
      Work Description: monthonk commented on pull request #3908:
URL: https://github.com/apache/hadoop/pull/3908#issuecomment-1031325713


   Rebased and tested in `eu-west-1` by running
   ```
   mvn -Dparallel-tests -DtestsThreadCount=32 clean verify
   ```
   
   Test result
   ```
   [INFO] Results:
   [INFO] 
   [WARNING] Tests run: 1063, Failures: 0, Errors: 0, Skipped: 185
   [INFO] Results:
   [INFO] 
   [WARNING] Tests run: 108, Failures: 0, Errors: 0, Skipped: 68
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 721835)
    Time Spent: 50m  (was: 40m)

> fs.s3a.buffer.dir to be under Yarn container path on yarn applications
> ----------------------------------------------------------------------
>
>                 Key: HADOOP-17386
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17386
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 3.3.0
>            Reporter: Steve Loughran
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> # fs.s3a.buffer.dir defaults to hadoop.tmp.dir which is /tmp or similar
> # we use this for storing file blocks during upload
> # staging committers use it for all files in a task, which can be a lot more
> # a lot of systems don't clean up /tmp until reboot -and if they stay up for a long time then they accrue files written through s3a staging committer from spark containers which fail
> Fix: use ${env.LOCAL_DIRS:-${hadoop.tmp.dir}}/s3a as the option so that if env.LOCAL_DIRS is set is used over hadoop.tmp.dir. YARN-deployed apps will use that for the buffer dir. When the app container is destroyed, so is the directory.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org