You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Ivo Lenting (JIRA)" <ji...@apache.org> on 2016/06/27 20:34:52 UTC
[jira] [Created] (PIG-4938) [PiggyBank] XPath returns empty values
when using aggregation method
Ivo Lenting created PIG-4938:
--------------------------------
Summary: [PiggyBank] XPath returns empty values when using aggregation method
Key: PIG-4938
URL: https://issues.apache.org/jira/browse/PIG-4938
Project: Pig
Issue Type: Bug
Components: piggybank
Affects Versions: 0.15.0
Reporter: Ivo Lenting
Priority: Minor
I have a xml file which I want to parse using the piggybank XPath udf.
The xml is:
<Aa name="test1">
<Bb Cc="1"/>
<Bb Cc="1"/>
<Bb Cc="1"/>
<Bb Cc="1"/>
<Dd>test2</Dd>
</Aa>
The xpath contains a sum aggregate to sum all Cc values.
The complete pig script:
REGISTER piggybank.jar
DEFINE XPath org.apache.pig.piggybank.evaluation.xml.XPath();
DEFINE XPathAll org.apache.pig.piggybank.evaluation.xml.XPathAll();
XMLFile = LOAD '/demo/test.xml' using org.apache.pig.piggybank.storage.XMLLoader('Aa') as (xmlContents:chararray);
MyOutput = FOREACH XMLFile GENERATE XPathAll(xmlContents,'Aa/@name',true,false).$0 AS Aa:chararray,XPath(xmlContents,'sum(Aa/Bb/@Cc)') AS Cc:Double, XPath(xmlContents,'Aa/Dd') AS Dd:chararray;
STORE MyOutput INTO 'Output/MyOutput' USING PigStorage('|');
MyOutput:
test1||test2
So i'm missing the aggregate 4 in column 2.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)