You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Steve Loughran (Jira)" <ji...@apache.org> on 2021/11/10 15:59:00 UTC

[jira] [Commented] (HADOOP-17979) Interface EtagSource to allow FileStatus subclasses to provide etags

    [ https://issues.apache.org/jira/browse/HADOOP-17979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17441808#comment-17441808 ] 

Steve Loughran commented on HADOOP-17979:
-----------------------------------------

once this is in MAPREDUCE-7341 will use it to recover from failures, with the recovery helper of  HADOOP-17981

> Interface EtagSource to allow FileStatus subclasses to provide etags
> --------------------------------------------------------------------
>
>                 Key: HADOOP-17979
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17979
>             Project: Hadoop Common
>          Issue Type: New Feature
>          Components: fs, fs/azure, fs/s3
>    Affects Versions: 3.3.1
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Various objects stores provide etags in their FileStatus implementations
> Make these values accessible
> * new interface {{EtagFromFileStatus}} to be implemented when provided
> * filesystem.md to declare requirements of etags (constant between LIST and HEAD)...
> * path capabilities for (a) etag and (b) etags consistent across rename
> Add implementation for abfs, later s3a (and google gcs)
> This is initially to handle recovery from certain failures in job commit against abfs, but it would allow a cloud-ready version of distcp to track etags of uploaded files, so diff properly.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org