You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Cheolsoo Park (JIRA)" <ji...@apache.org> on 2014/03/27 00:56:15 UTC

[jira] [Commented] (PIG-3749) PigPerformance - data in the map gets lost during parsing

    [ https://issues.apache.org/jira/browse/PIG-3749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948679#comment-13948679 ] 

Cheolsoo Park commented on PIG-3749:
------------------------------------

I don't seem to be able to reproduce it. I used "keren	1	2	qt	3	4	5.0	aaaabbbb	mccccddddeeeedmffffgggghhhh" as input, and it gives me the following-
{code}
(keren	1	2	qt	3	4	5.0	aaaabbbb	mccccddddeeeemffffgggghhhh,,,,,,,,)
(keren	1	2	qt	3	4	5.0	aaaabbbb	mccccddddeeeemffffgggghhhh,,,)
{code}
I think I am not loading the data properly. Do you mind attaching a sample dataset to the jira?

Also, can you post a patch that can be easily applied with {{patch < filenamename}} in the root directory? Not a big deal for small patches, but it's helpful to reviewers.

Thanks!

> PigPerformance - data in the map gets lost during parsing
> ---------------------------------------------------------
>
>                 Key: PIG-3749
>                 URL: https://issues.apache.org/jira/browse/PIG-3749
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.12.0
>            Reporter: Keren Ouaknine
>            Assignee: Keren Ouaknine
>             Fix For: 0.12.1
>
>         Attachments: PIG-3749.patch
>
>
> Create a Pigmix sample dataset which looks as follow:
> keren	1	2	qt	3	4	5.0	aaaabbbb	mccccddddeeeedmffffgggghhhh
> Launch the following query:
> A = load 'page_views_sample.txt' using org.apache.pig.test.pigmix.udf.PigPerformanceLoader()
>     as (user, action, timespent, query_term, ip_addr, timestamp, estimated_revenue, page_info, page_links);
> store A into 'L1out_A';
> B = foreach A generate user, (int)action as action, (map[])page_info as page_info, flatten((bag{tuple(map[])})page_links) as page_links;
> store B into 'L1out_B';
> The result looks like this: 
> keren	1	[b#bbb,a#aaa]	[d#,e#eee,c#ccc]
> keren	1	[b#bbb,a#aaa]	[f#fff,g#ggg,h#hhh
> It is missing the 'ddd' value and a closing bracket.
> Thanks,
> Keren



--
This message was sent by Atlassian JIRA
(v6.2#6252)