You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@abdera.apache.org by "Xu Zhang (JIRA)" <ji...@apache.org> on 2008/04/04 02:45:30 UTC

[jira] Closed: (ABDERA-140) Using cached data does not give me the expected result

     [ https://issues.apache.org/jira/browse/ABDERA-140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xu Zhang closed ABDERA-140.
---------------------------

    Resolution: Fixed

Logged the bug for the wrong project.  Sorry!

> Using cached data does not give me the expected result
> ------------------------------------------------------
>
>                 Key: ABDERA-140
>                 URL: https://issues.apache.org/jira/browse/ABDERA-140
>             Project: Abdera
>          Issue Type: Bug
>            Reporter: Xu Zhang
>
> I was trying to run the following Pig script with the latest Pig stuff.  Since essentially I was streaming 2 identical sets of data, I was expecting the final result which is the count of the name field to contain all even numbers.  However, lots of odd number showed up in the actual result.
> {code}
> define X `perl -ne 'chomp $_; print "$_\n"' - ./user/pig/tests/data/singlefile/studenttab10k` cache('/user/pig/tests/data/singlefile/studenttab10k');
> A = load '/user/pig/tests/data/singlefile/studenttab10k';
> B = stream A through X as (name, age, gpa);
> C = group B by name;
> D = foreach C generate COUNT(B.$0);
> store D into 'results_22';
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.