You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2023/02/06 09:30:00 UTC
[jira] [Commented] (HADOOP-18544) S3A: add option to disable probe for dir marker recreation on delete/rename.
[ https://issues.apache.org/jira/browse/HADOOP-18544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17684537#comment-17684537 ]
ASF GitHub Bot commented on HADOOP-18544:
-----------------------------------------
HarshitGupta11 opened a new pull request, #5354:
URL: https://github.com/apache/hadoop/pull/5354
### Description of PR
In applications which do many single-file deletions on the same dir, a lot of time is wasted in maybeCreateFakeParentDirectory().
Proposed: add an option to disable the probe, for use by applications which are happy for parent dirs to sometimes disappear after a cleanup.
file by file delete is still woefully inefficient because of the HEAD request on every file, but there's no need to amplify the damage.
### How was this patch tested?
The patch was tested against s3 bucket in US-West 2
### For code changes:
##Caveats:
Parent directories might disappear on delete or on renames.
##What breaks:
The rename tests are failing for the FileContext renames as both S3AFileSystem and the FileContext have different probes and different rules.
- [ ] Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
- [ ] Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
- [ ] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)?
- [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, `NOTICE-binary` files?
> S3A: add option to disable probe for dir marker recreation on delete/rename.
> ----------------------------------------------------------------------------
>
> Key: HADOOP-18544
> URL: https://issues.apache.org/jira/browse/HADOOP-18544
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/s3
> Affects Versions: 3.3.4
> Reporter: Steve Loughran
> Assignee: Harshit Gupta
> Priority: Major
>
> In applications which do many single-file deletions on the same dir, a lot of time is wasted in {{maybeCreateFakeParentDirectory()}}.
> Proposed: add an option to disable the probe, for use by applications which are happy for parent dirs to sometimes disappear after a cleanup.
> file by file delete is still woefully inefficient because of the HEAD request on every file, but there's no need to amplify the damage.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org