You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Gao Zhong Liang (JIRA)" <ji...@apache.org> on 2015/03/24 18:23:53 UTC

[jira] [Updated] (OOZIE-547) build workflow progress information in Oozie

     [ https://issues.apache.org/jira/browse/OOZIE-547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gao Zhong Liang updated OOZIE-547:
----------------------------------
    Attachment: oozie-547-1.patch

The attached patch(oozie-547-1.patch) contains the support of crontab format job scheduling  for coordinator jobs.

> build workflow progress information in Oozie
> --------------------------------------------
>
>                 Key: OOZIE-547
>                 URL: https://issues.apache.org/jira/browse/OOZIE-547
>             Project: Oozie
>          Issue Type: New Feature
>            Reporter: Hadoop QA
>            Assignee: zhu jin wei
>         Attachments: oozie-547-1.patch, oozie-547.patch
>
>
> For a user, knowing progress of her workflow is always desirable. This ticket is to introduce this support to Oozie.
> I know it's a hard problem. For my initial effort, I plan to start with simple workflows that do not contain decision nodes or fork/join nodes, i.e., chain type workflows. I plan to use percentage of finished actions as the overall wf progress estimate.
> Going forward we can improve the estimation by:
> 1) handle general workflows that contain decision, fork/join nodes;
> 2) incorporate the action level progress into wf level progress estimation to make the estimate better. To be more specific:
> In the case of "opaque" actions like pig/hive/jaql where the status can only be 0% or 100% (or failure) we plug that value into the overall DAG status of 0-100%. If a DAG had say 4 opaque actions, the progress would move in discrete steps 0, 25, 50, 75, 100%.  For the m/r actions where the JobTracker
> gives values between 0-100% for an action then the overall progress will be smoother. We can do same thing for pig/hive/jaql actions as well if they expose their own progress info.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)