You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Mridul Muralidharan (JIRA)" <ji...@apache.org> on 2011/04/08 10:37:05 UTC

[jira] [Commented] (PIG-1627) Flattening of bags with unknown schemas produces wrong schema

    [ https://issues.apache.org/jira/browse/PIG-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017347#comment-13017347 ] 

Mridul Muralidharan commented on PIG-1627:
------------------------------------------

bytearray vs unknown schema use is always confusing.
The description in https://issues.apache.org/jira/browse/PIG-1876, for example, indicates that unknown schema implies it should be bytearray (desc starts with : "Currently Pig map type is untyped, which means map value is always of bytearray(ie. unknown) type." ..), while this JIRA seems to indicate it is not the case !

I have seen varying interpretations of what bytearray is supposed to mean in the jira's, pig docs and pig source code over the last 3+ years, not to mention in the various ilist's and user source codebass - some clarity in this regard would be good and less confusing.

> Flattening of bags with unknown schemas produces wrong schema
> -------------------------------------------------------------
>
>                 Key: PIG-1627
>                 URL: https://issues.apache.org/jira/browse/PIG-1627
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.7.0
>            Reporter: Alan Gates
>            Assignee: Daniel Dai
>             Fix For: 0.9.0
>
>
> The following should produce an unknown schema:
> {code}
> A = load '/Users/gates/test/data/studenttab10';
> B = group A by $0;
> C = foreach B generate flatten(A);
> describe C;
> {code}
> Instead it gives
> {code}
> C: {bytearray}
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira