You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2015/09/01 03:45:46 UTC

[jira] [Created] (HIVE-11705) refactor SARG stripe filtering for ORC into a method

Sergey Shelukhin created HIVE-11705:
---------------------------------------

             Summary: refactor SARG stripe filtering for ORC into a method
                 Key: HIVE-11705
                 URL: https://issues.apache.org/jira/browse/HIVE-11705
             Project: Hive
          Issue Type: Bug
            Reporter: Sergey Shelukhin
            Assignee: Sergey Shelukhin


For footer cache PPD to metastore, we'd need a method to do the PPD. Tiny item to create it on OrcInputFormat.
For metastore path, these methods will be called from expression proxy similar to current objectstore expr filtering; it will change to have serialized sarg and column list to come from request instead of conf; includedCols/etc. will also come from request instead of assorted java objects. 
The types and stripe stats will need to be extracted from HBase. This is a little bit of a problem, since ideally we want to be inside HBase filter/coprocessor/.... I'd need to take a look to see if this is possible... since that filter would need to either deserialize orc, or we would need to store types and stats information in some other, non-ORC manner on write. The latter is probably a better idea, although it's dangerous because there's no sync between this code and ORC itself.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)