You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Keren Ouaknine (JIRA)" <ji...@apache.org> on 2014/03/26 01:10:17 UTC
[jira] [Updated] (PIG-3749) PigPerformance - data in the map gets
lost during parsing
[ https://issues.apache.org/jira/browse/PIG-3749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Keren Ouaknine updated PIG-3749:
--------------------------------
Fix Version/s: 0.12.1
Release Note: Bug in PigPerformanceLoader when reading bytes, the loop which looks for a termination character in a map is missing the null value (Ascii=0)
Status: Patch Available (was: Open)
> PigPerformance - data in the map gets lost during parsing
> ---------------------------------------------------------
>
> Key: PIG-3749
> URL: https://issues.apache.org/jira/browse/PIG-3749
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.12.0
> Reporter: Keren Ouaknine
> Fix For: 0.12.1
>
>
> Create a Pigmix sample dataset which looks as follow:
> keren 1 2 qt 3 4 5.0 aaaabbbb mccccddddeeeedmffffgggghhhh
> Launch the following query:
> A = load 'page_views_sample.txt' using org.apache.pig.test.pigmix.udf.PigPerformanceLoader()
> as (user, action, timespent, query_term, ip_addr, timestamp, estimated_revenue, page_info, page_links);
> store A into 'L1out_A';
> B = foreach A generate user, (int)action as action, (map[])page_info as page_info, flatten((bag{tuple(map[])})page_links) as page_links;
> store B into 'L1out_B';
> The result looks like this:
> keren 1 [b#bbb,a#aaa] [d#,e#eee,c#ccc]
> keren 1 [b#bbb,a#aaa] [f#fff,g#ggg,h#hhh
> It is missing the 'ddd' value and a closing bracket.
> Thanks,
> Keren
--
This message was sent by Atlassian JIRA
(v6.2#6252)