You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Karina Hauser (JIRA)" <ji...@apache.org> on 2014/02/24 04:32:19 UTC

[jira] [Created] (PIG-3777) Pig 12.0 Documentation

Karina Hauser created PIG-3777:
----------------------------------

             Summary: Pig 12.0 Documentation
                 Key: PIG-3777
                 URL: https://issues.apache.org/jira/browse/PIG-3777
             Project: Pig
          Issue Type: Wish
          Components: documentation
    Affects Versions: 0.12.0
            Reporter: Karina Hauser
            Priority: Trivial


I think the example for SAMPLE is wrong

In this example, a scalar expression is used (it will sample approximately 1000 records from the input).

a = load 'a.txt';
b = group a all;
c = foreach b generate COUNT(a) as num_rows;
e = sample d 1000/num_rows;

There is no relation d and I get a "Projected field [num_rows] does not exist in schema" error, I think it need to be 
e = sample a 1000/c.numrows



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)