You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Thejas M Nair (JIRA)" <ji...@apache.org> on 2011/09/21 00:02:09 UTC

[jira] [Updated] (PIG-1883) Pig's progress estimation should account for parallel job executions

     [ https://issues.apache.org/jira/browse/PIG-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thejas M Nair updated PIG-1883:
-------------------------------

    Attachment: PIG-1883.4.patch

I have made some fixes to the way the progress logging is handled when both old and new timers are enabled.
I have also removed the -s command line option to control the behavior, and replaced with the use of a new property. This is because -s option will be deprecated in future.

> Pig's progress estimation should account for parallel job executions
> --------------------------------------------------------------------
>
>                 Key: PIG-1883
>                 URL: https://issues.apache.org/jira/browse/PIG-1883
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Laukik Chitnis
>            Assignee: Laukik Chitnis
>         Attachments: PIG-1883-2.patch, PIG-1883-3.patch, PIG-1883.4.patch
>
>
> Currently, Pig's progress estimation is based on the percentage of jobs completed out of the total number of MR jobs. However, since the MR operators are arranged in a DAG (and hence more than 1 job might be submitted for execution in parallel), the progress estimation can be improved by considering the number of jobs in the critical path, instead of just the total number of jobs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira