You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "jenu9417 (via GitHub)" <gi...@apache.org> on 2023/02/19 08:12:36 UTC

[GitHub] [hudi] jenu9417 commented on issue #7991: Higher number of S3 HEAD requests, while writing data to S3.

jenu9417 commented on issue #7991:
URL: https://github.com/apache/hudi/issues/7991#issuecomment-1435921620

   Also, Just noticed while analysing S3 access logs,
   the number BatchDeleteObject API calls is also way higher. Even more higher than HEAD requests.
   ````
   BATCH.DELETE.OBJECT	data/
   BATCH.DELETE.OBJECT	data/testfolder/
   BATCH.DELETE.OBJECT	data/testfolder/.hoodie/
   BATCH.DELETE.OBJECT	data/testfolder/
   ````
   Such batch delete object requests for the same path has been continuously invoked repeatedly, roughly around 500 times for the ingestion of 1 record from Kafka to S3.
   Can you please help us understand why these HEAD and BATCH DELETE OBJECT requests are way higher even for 1 record ingestion.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org