You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2015/02/13 03:17:11 UTC

[jira] [Created] (HIVE-9682) Use variable length blocks to store ORC stripes

Gopal V created HIVE-9682:
-----------------------------

             Summary: Use variable length blocks to store ORC stripes
                 Key: HIVE-9682
                 URL: https://issues.apache.org/jira/browse/HIVE-9682
             Project: Hive
          Issue Type: Improvement
          Components: File Formats
    Affects Versions: 1.2.0
            Reporter: Gopal V


HDFS-3689 is in hadoop-2.7 branch. 

This allows ORC to get 100% locality of stripes within a block without artificial data padding to prevent stripe splitting across blocks.

This can be used instead of block.padding internally to cut down 5% of ORC data sizes across the board.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)