You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Ivo Lenting (JIRA)" <ji...@apache.org> on 2016/06/27 20:34:52 UTC

[jira] [Created] (PIG-4938) [PiggyBank] XPath returns empty values when using aggregation method

Ivo Lenting created PIG-4938:
--------------------------------

             Summary: [PiggyBank] XPath returns empty values when using aggregation method
                 Key: PIG-4938
                 URL: https://issues.apache.org/jira/browse/PIG-4938
             Project: Pig
          Issue Type: Bug
          Components: piggybank
    Affects Versions: 0.15.0
            Reporter: Ivo Lenting
            Priority: Minor


I have a xml file which I want to parse using the piggybank XPath udf.

The xml is:
<Aa name="test1">	
	<Bb Cc="1"/>
	<Bb Cc="1"/>
	<Bb Cc="1"/>
	<Bb Cc="1"/>
	<Dd>test2</Dd>
</Aa>

The xpath contains a sum aggregate to sum all Cc values. 
The complete pig script:

REGISTER piggybank.jar
DEFINE XPath org.apache.pig.piggybank.evaluation.xml.XPath();
DEFINE XPathAll org.apache.pig.piggybank.evaluation.xml.XPathAll();
XMLFile = LOAD '/demo/test.xml' using org.apache.pig.piggybank.storage.XMLLoader('Aa') as (xmlContents:chararray);
MyOutput = FOREACH XMLFile GENERATE XPathAll(xmlContents,'Aa/@name',true,false).$0 AS Aa:chararray,XPath(xmlContents,'sum(Aa/Bb/@Cc)') AS Cc:Double, XPath(xmlContents,'Aa/Dd') AS Dd:chararray;
STORE MyOutput INTO 'Output/MyOutput' USING PigStorage('|');

MyOutput:
test1||test2

So i'm missing the aggregate 4 in column 2.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)