You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Mridul Muralidharan (JIRA)" <ji...@apache.org> on 2011/04/08 10:37:05 UTC
[jira] [Commented] (PIG-1627) Flattening of bags with unknown
schemas produces wrong schema
[ https://issues.apache.org/jira/browse/PIG-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017347#comment-13017347 ]
Mridul Muralidharan commented on PIG-1627:
------------------------------------------
bytearray vs unknown schema use is always confusing.
The description in https://issues.apache.org/jira/browse/PIG-1876, for example, indicates that unknown schema implies it should be bytearray (desc starts with : "Currently Pig map type is untyped, which means map value is always of bytearray(ie. unknown) type." ..), while this JIRA seems to indicate it is not the case !
I have seen varying interpretations of what bytearray is supposed to mean in the jira's, pig docs and pig source code over the last 3+ years, not to mention in the various ilist's and user source codebass - some clarity in this regard would be good and less confusing.
> Flattening of bags with unknown schemas produces wrong schema
> -------------------------------------------------------------
>
> Key: PIG-1627
> URL: https://issues.apache.org/jira/browse/PIG-1627
> Project: Pig
> Issue Type: Bug
> Components: impl
> Affects Versions: 0.7.0
> Reporter: Alan Gates
> Assignee: Daniel Dai
> Fix For: 0.9.0
>
>
> The following should produce an unknown schema:
> {code}
> A = load '/Users/gates/test/data/studenttab10';
> B = group A by $0;
> C = foreach B generate flatten(A);
> describe C;
> {code}
> Instead it gives
> {code}
> C: {bytearray}
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira