You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Rohini Palaniswamy (JIRA)" <ji...@apache.org> on 2015/10/05 22:51:26 UTC

[jira] [Updated] (PIG-3957) Refactor out resetting input key in TezDagBuilder

     [ https://issues.apache.org/jira/browse/PIG-3957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rohini Palaniswamy updated PIG-3957:
------------------------------------
    Assignee: Rohini Palaniswamy  (was: Daniel Dai)

   This is causing some queries to fail with one of the below errors if a vertex contains input from both union and replicated join table. The input keys are overwritten in TezDAGBuilder and both of they end up pointing to same input.

{code}
Caused by: java.io.IOException: Please check if you are invoking next() even after it returned false. For usage, please refer to KeyValueReader javadocs
	at org.apache.tez.runtime.library.api.KeyValueReader.hasCompletedProcessing(KeyValueReader.java:77)
	at org.apache.tez.runtime.library.common.readers.UnorderedKVReader.moveToNextInput(UnorderedKVReader.java:190)
	at org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next(UnorderedKVReader.java:118)
	at org.apache.pig.backend.hadoop.executionengine.tez.plan.operator.POValueInputTez.getNextTuple(POValueInputTez.java:124)
{code}

or 

{code}
Caused by: java.lang.ClassCastException: org.apache.pig.impl.io.NullableTuple cannot be cast to org.apache.pig.data.Tuple
	at org.apache.pig.backend.hadoop.executionengine.tez.plan.operator.POValueInputTez.getNextTuple(POValueInputTez.java:126)
	at org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.processInput(PhysicalOperator.java:307)
	at org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POForEach.getNextTuple(POForEach.java:252)
{code}

> Refactor out resetting input key in TezDagBuilder
> -------------------------------------------------
>
>                 Key: PIG-3957
>                 URL: https://issues.apache.org/jira/browse/PIG-3957
>             Project: Pig
>          Issue Type: Improvement
>          Components: tez
>            Reporter: Daniel Dai
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.16.0
>
>
> In TezDagBuilder, we reset input key of all POPackage/POValueInputTez/POIdentityInOutTez, that is very confusing. We shall refactor these out.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)