You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Siddharth Seth (JIRA)" <ji...@apache.org> on 2014/11/03 01:55:33 UTC
[jira] [Created] (TEZ-1731) OnDiskMerger can end up clobbering
files across tasks with LocalDiskFetch
Siddharth Seth created TEZ-1731:
-----------------------------------
Summary: OnDiskMerger can end up clobbering files across tasks with LocalDiskFetch
Key: TEZ-1731
URL: https://issues.apache.org/jira/browse/TEZ-1731
Project: Apache Tez
Issue Type: Bug
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Critical
When an on disk fetch starts with LOCAL files (optimize.local.fetch), the filename used by the merger is based on the source file name. This name can be the same for all tasks reading the same input on the node - and can result in files being overwritten between tasks, depending on the order in which events are processed, and the dir allocated by the local dir-allocator.
Leads to ChecksumExceptions, and FileNotFoundExceptions during the merge.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)