You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Prasanth Jayachandran (JIRA)" <ji...@apache.org> on 2016/07/27 08:20:20 UTC

[jira] [Created] (HIVE-14354) Cleanup ORC reader interfaces and redundant metadata objects

Prasanth Jayachandran created HIVE-14354:
--------------------------------------------

             Summary: Cleanup ORC reader interfaces and redundant metadata objects
                 Key: HIVE-14354
                 URL: https://issues.apache.org/jira/browse/HIVE-14354
             Project: Hive
          Issue Type: Bug
          Components: ORC
    Affects Versions: 2.2.0
            Reporter: Prasanth Jayachandran
            Assignee: Prasanth Jayachandran


ORC reader interface has to separated out to a new interface that only returns metadata. 
OrcTail should implement the new interface that only provides metadata information about the file.
OrcFileMetadata object contains many redundant information that can be obtained from OrcTail. This class should be cleaned up and OrcTail should be reused.
Remove all references to List<OrcProto.Types> and replace them with TypeDescription.
SchemaEvolution objects are created in multiple places. But the information required for creating SchemaEvolution is already present within the Reader (file types, projection and schema on read).
Schema on read can be specified only to RecordReader options. Reader options should also accept schema on read type description.
Consolidate Reader options and RecordReader options.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)