You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ant.apache.org by bu...@apache.org on 2011/05/24 13:59:01 UTC

DO NOT REPLY [Bug 45718] Enhance bzip2 for Hadoop

https://issues.apache.org/bugzilla/show_bug.cgi?id=45718

--- Comment #2 from Matthew J. Francis <mj...@gmail.com> 2011-05-24 11:59:01 UTC ---
For unrelated reasons I have implemented an alternative BZip2 encoder/decoder -
see http://code.google.com/p/jbzip2/

Although its default InputStream implementation behaves much like the current
CBZip2InputStream, a block level interface is also provided that can be wrapped
however is desired. This could perhaps be used to provide the functionality
requested.

Hadoop and/or Apache are welcome to adopt, adapt or simply learn from this
code. At the very least, it should be easier for a person of ordinary skill to
read it and gain some understanding of how the various algorithms fit together,
possibly helping to solve the "nobody understands this package" problem.

Anyone simply wishing to be demystified as to how BZip2 works is also most
welcome to get in touch. There's a certain amount of "Aha!" involved in
grasping how it all fits together, but none of the individual parts are
especially difficult to understand.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.