You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Russell Jurney (JIRA)" <ji...@apache.org> on 2010/06/04 06:40:58 UTC

[jira] Created: (PIG-1436) Print number of records outputted at each step of a Pig script

Print number of records outputted at each step of a Pig script
--------------------------------------------------------------

                 Key: PIG-1436
                 URL: https://issues.apache.org/jira/browse/PIG-1436
             Project: Pig
          Issue Type: New Feature
          Components: grunt
    Affects Versions: 0.7.0
            Reporter: Russell Jurney
            Priority: Minor
             Fix For: 0.8.0


I often run a script multiple times, or have to go and look through Hadoop task logs, to figure out where I broke a long script in such a way that I get 0 records out of it.  I think this is a common problem.

If someone can point me in the right direction, I can make a pass at this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-1436) Print number of records outputted at each step of a Pig script

Posted by "Richard Ding (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12887446#action_12887446 ] 

Richard Ding commented on PIG-1436:
-----------------------------------

Russell,

PIG-1478 implemented a callback mechanism that allows users to retrieve stats after each job. Will this meet your needs? 

> Print number of records outputted at each step of a Pig script
> --------------------------------------------------------------
>
>                 Key: PIG-1436
>                 URL: https://issues.apache.org/jira/browse/PIG-1436
>             Project: Pig
>          Issue Type: New Feature
>          Components: grunt
>    Affects Versions: 0.7.0
>            Reporter: Russell Jurney
>            Assignee: Richard Ding
>            Priority: Minor
>             Fix For: 0.8.0
>
>
> I often run a script multiple times, or have to go and look through Hadoop task logs, to figure out where I broke a long script in such a way that I get 0 records out of it.  I think this is a common problem.
> If someone can point me in the right direction, I can make a pass at this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (PIG-1436) Print number of records outputted at each step of a Pig script

Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olga Natkovich resolved PIG-1436.
---------------------------------

    Resolution: Duplicate

This looks like duplicate of PIG-1478. Please, re-open if this is not the case

> Print number of records outputted at each step of a Pig script
> --------------------------------------------------------------
>
>                 Key: PIG-1436
>                 URL: https://issues.apache.org/jira/browse/PIG-1436
>             Project: Pig
>          Issue Type: New Feature
>          Components: grunt
>    Affects Versions: 0.7.0
>            Reporter: Russell Jurney
>            Assignee: Richard Ding
>            Priority: Minor
>             Fix For: 0.8.0
>
>
> I often run a script multiple times, or have to go and look through Hadoop task logs, to figure out where I broke a long script in such a way that I get 0 records out of it.  I think this is a common problem.
> If someone can point me in the right direction, I can make a pass at this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-1436) Print number of records outputted at each step of a Pig script

Posted by "Alan Gates (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12875638#action_12875638 ] 

Alan Gates commented on PIG-1436:
---------------------------------

Russell,

Richard's already doing a lot of work in this area.  Check out  PIG-1389, PIG-908, PIG-864, PIG-809  to see if those will meet your needs.  If not, please discuss with him as his current project is to add script usage statistics.

> Print number of records outputted at each step of a Pig script
> --------------------------------------------------------------
>
>                 Key: PIG-1436
>                 URL: https://issues.apache.org/jira/browse/PIG-1436
>             Project: Pig
>          Issue Type: New Feature
>          Components: grunt
>    Affects Versions: 0.7.0
>            Reporter: Russell Jurney
>            Priority: Minor
>             Fix For: 0.8.0
>
>
> I often run a script multiple times, or have to go and look through Hadoop task logs, to figure out where I broke a long script in such a way that I get 0 records out of it.  I think this is a common problem.
> If someone can point me in the right direction, I can make a pass at this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (PIG-1436) Print number of records outputted at each step of a Pig script

Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olga Natkovich reassigned PIG-1436:
-----------------------------------

    Assignee: Richard Ding

I think Richard is already doing this as part of his stats work

> Print number of records outputted at each step of a Pig script
> --------------------------------------------------------------
>
>                 Key: PIG-1436
>                 URL: https://issues.apache.org/jira/browse/PIG-1436
>             Project: Pig
>          Issue Type: New Feature
>          Components: grunt
>    Affects Versions: 0.7.0
>            Reporter: Russell Jurney
>            Assignee: Richard Ding
>            Priority: Minor
>             Fix For: 0.8.0
>
>
> I often run a script multiple times, or have to go and look through Hadoop task logs, to figure out where I broke a long script in such a way that I get 0 records out of it.  I think this is a common problem.
> If someone can point me in the right direction, I can make a pass at this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.