You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2017/05/27 00:58:04 UTC
[jira] [Comment Edited] (ORC-201) Brotli compression codec support
[ https://issues.apache.org/jira/browse/ORC-201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16027078#comment-16027078 ]
Gopal V edited comment on ORC-201 at 5/27/17 12:57 AM:
-------------------------------------------------------
Brotli is clearly aimed at performing better at longform ASCII Text data.
See Appendix B transforms (and decode Appendix A) - https://tools.ietf.org/html/rfc7932#appendix-B
The hadoop benchmark does not naturally translate into a speedup for ORC as Hadoop Gzip defaults to Zlib-6 (which is deflate_slow), while Orc sticks to only deflate_fast (Zlib 1,2,3).
was (Author: gopalv):
Brotli is clearly aimed at performing better at longform ASCII Text data.
See Appendix B transforms (and decode Appendix A) - https://tools.ietf.org/html/rfc7932#appendix-B
The hadoop benchmark does not naturally translate into a speedup for ORC as Hadoop Gzip defaults to Zlib-6 (which is deflate_slow), while Orc sticks only deflate_fast (Zlib 1,2,3).
> Brotli compression codec support
> --------------------------------
>
> Key: ORC-201
> URL: https://issues.apache.org/jira/browse/ORC-201
> Project: ORC
> Issue Type: New Feature
> Components: compression
> Reporter: Prasanth Jayachandran
>
> HADOOP-13126 is bringing Brotli compression codec to hadoop. ORC should add support for Brotli as it seems to have better performance that SNAPPY.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)