You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Hitesh Shah (JIRA)" <ji...@apache.org> on 2015/05/14 22:29:00 UTC

[jira] [Updated] (TEZ-1894) No checkOutputSpecs for MROutput

     [ https://issues.apache.org/jira/browse/TEZ-1894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hitesh Shah updated TEZ-1894:
-----------------------------
    Target Version/s: 0.8.0  (was: 0.6.0)

> No checkOutputSpecs for MROutput
> --------------------------------
>
>                 Key: TEZ-1894
>                 URL: https://issues.apache.org/jira/browse/TEZ-1894
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Jeff Zhang
>            Assignee: Jeff Zhang
>            Priority: Critical
>         Attachments: TEZ-1894-1.patch
>
>
> MROutput won't check whether the destination folder exists, so it would cause weird result. 
> E.g.  I run tez WordCount example with 5 partitions, it would generate 5 part files, and then run the same WordCount example with 1 partition, it would just override one part file, in that case the results of 2 dags coexist in the same folder. 
> {code}
> Found 6 items
> -rw-r--r--   1 jzhang supergroup          0 2014-12-28 14:38 output/_SUCCESS
> -rw-r--r--   1 jzhang supergroup         15 2014-12-28 14:38 output/part-v001-o000-00000
> -rw-r--r--   1 jzhang supergroup       8198 2014-12-28 14:37 output/part-v001-o000-00001
> -rw-r--r--   1 jzhang supergroup       7372 2014-12-28 14:37 output/part-v001-o000-00002
> -rw-r--r--   1 jzhang supergroup       8575 2014-12-28 14:37 output/part-v001-o000-00003
> -rw-r--r--   1 jzhang supergroup       6755 2014-12-28 14:37 output/part-v001-o000-00004
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)