You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Gunther Hagleitner (JIRA)" <ji...@apache.org> on 2013/10/04 23:33:42 UTC

[jira] [Created] (TEZ-529) Hive communicates state from RecordReader to Processor via JobConf

Gunther Hagleitner created TEZ-529:
--------------------------------------

             Summary: Hive communicates state from RecordReader to Processor via JobConf
                 Key: TEZ-529
                 URL: https://issues.apache.org/jira/browse/TEZ-529
             Project: Apache Tez
          Issue Type: Bug
    Affects Versions: 0.2.0
            Reporter: Gunther Hagleitner
            Assignee: Siddharth Seth
             Fix For: 0.2.0


Hive currently switches between operator pipelines + partition descriptors via a map.input.file

In the CombineFileInputFormat case Hive relies on the fact that CombineFileRecordReader sets this field every time a new file is processed. This file will then be read in the processor to setup the correct processing pipeline.

After the Tez refactor RecordReader and TezProcessor use different job conf instances. Because of that Hive will fail since map.input.file isn't set and updated in the processor's conf.



--
This message was sent by Atlassian JIRA
(v6.1#6144)