You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Rohini Palaniswamy (JIRA)" <ji...@apache.org> on 2015/10/05 22:51:26 UTC
[jira] [Updated] (PIG-3957) Refactor out resetting input key in
TezDagBuilder
[ https://issues.apache.org/jira/browse/PIG-3957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rohini Palaniswamy updated PIG-3957:
------------------------------------
Assignee: Rohini Palaniswamy (was: Daniel Dai)
This is causing some queries to fail with one of the below errors if a vertex contains input from both union and replicated join table. The input keys are overwritten in TezDAGBuilder and both of they end up pointing to same input.
{code}
Caused by: java.io.IOException: Please check if you are invoking next() even after it returned false. For usage, please refer to KeyValueReader javadocs
at org.apache.tez.runtime.library.api.KeyValueReader.hasCompletedProcessing(KeyValueReader.java:77)
at org.apache.tez.runtime.library.common.readers.UnorderedKVReader.moveToNextInput(UnorderedKVReader.java:190)
at org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next(UnorderedKVReader.java:118)
at org.apache.pig.backend.hadoop.executionengine.tez.plan.operator.POValueInputTez.getNextTuple(POValueInputTez.java:124)
{code}
or
{code}
Caused by: java.lang.ClassCastException: org.apache.pig.impl.io.NullableTuple cannot be cast to org.apache.pig.data.Tuple
at org.apache.pig.backend.hadoop.executionengine.tez.plan.operator.POValueInputTez.getNextTuple(POValueInputTez.java:126)
at org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.processInput(PhysicalOperator.java:307)
at org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POForEach.getNextTuple(POForEach.java:252)
{code}
> Refactor out resetting input key in TezDagBuilder
> -------------------------------------------------
>
> Key: PIG-3957
> URL: https://issues.apache.org/jira/browse/PIG-3957
> Project: Pig
> Issue Type: Improvement
> Components: tez
> Reporter: Daniel Dai
> Assignee: Rohini Palaniswamy
> Fix For: 0.16.0
>
>
> In TezDagBuilder, we reset input key of all POPackage/POValueInputTez/POIdentityInOutTez, that is very confusing. We shall refactor these out.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)