You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/03 16:35:02 UTC

[GitHub] [beam] kennknowles opened a new issue, #18209: java DataflowRunner should warn if the stagingLocation has a TTL

kennknowles opened a new issue, #18209:
URL: https://github.com/apache/beam/issues/18209

   We have seen a few customers run into a hard-to-track-down bug where the staging bucket has a TTL, but files get TTL-deleted when they are still needed.
   
   This might be because of:
   
   1. Long lived batch jobs / streaming jobs can reference staged files arbitrarily later and will fail in bad ways if they have been deleted.
   2. Some customers even hit issues where the "check file already exists" succeeds when starting a job, but then the file is TTL-deleted before the job actually starts. (This sounds crazy, but may happen if TTL is 7 days and jobs run every 7 days, for example. Race condition.)
   
   I'm hoping it's not hard to check that files would have TTLs and warn if so.
   
   Imported from Jira [BEAM-1507](https://issues.apache.org/jira/browse/BEAM-1507). Original Jira may contain additional context.
   Reported by: dhalperi.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org