You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Slavik Krassovsky (JIRA)" <ji...@apache.org> on 2012/10/09 00:10:02 UTC
[jira] [Created] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Slavik Krassovsky created HADOOP-8900:
-----------------------------------------
Summary: BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
Key: HADOOP-8900
URL: https://issues.apache.org/jira/browse/HADOOP-8900
Project: Hadoop Common
Issue Type: Bug
Environment: Encountered failure when processing large GZIP file
Reporter: Slavik Krassovsky
Encountered failure when processing large GZIP file
• Gz: Failed in 1hrs, 13mins, 57sec with the error:
¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
at org.apache.hadoop.mapred.Child.main(Child.java:260)
Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
at java.io.InputStream.read(InputStream.java:102)
at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13481990#comment-13481990 ]
Andy Isaacson commented on HADOOP-8900:
---------------------------------------
bq. Andy or Colin, can you please review the merged branch-1 patch.
hadoop-8900.branch-1.patch looks good to me. Thanks for the backport!
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work stopped] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on HADOOP-8900 stopped by Andy Isaacson.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482333#comment-13482333 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-Mapreduce-trunk #1234 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1234/])
HADOOP-8900. I had mistakenly not credited the patch to the right person. Changing the contributor to Andy Isaacson. (Revision 1401144)
Result = SUCCESS
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1401144
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 1.2.0, 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Colin Patrick McCabe (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472855#comment-13472855 ]
Colin Patrick McCabe commented on HADOOP-8900:
----------------------------------------------
Another, perhaps easier way to avoid the disk space cost would be to have a gzip output stream and feed it into the gzip input stream (never touch disk).
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andy Isaacson updated HADOOP-8900:
----------------------------------
Status: Patch Available (was: Open)
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13478225#comment-13478225 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-trunk-Commit #2882 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/2882/])
HADOOP-8900. BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size. Contributed by Slavik Krassovsky. (Revision 1399377)
Result = SUCCESS
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1399377
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/io/compress/zlib/BuiltInGzipDecompressor.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/compress/TestCodec.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/file/tfile/TestVLong.java
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482248#comment-13482248 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-Yarn-trunk #12 (See [https://builds.apache.org/job/Hadoop-Yarn-trunk/12/])
HADOOP-8900. I had mistakenly not credited the patch to the right person. Changing the contributor to Andy Isaacson. (Revision 1401144)
Result = FAILURE
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1401144
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 1.2.0, 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13478895#comment-13478895 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-Yarn-trunk #7 (See [https://builds.apache.org/job/Hadoop-Yarn-trunk/7/])
HADOOP-8900. BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size. Contributed by Slavik Krassovsky. (Revision 1399377)
Result = FAILURE
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1399377
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/io/compress/zlib/BuiltInGzipDecompressor.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/compress/TestCodec.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/file/tfile/TestVLong.java
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Suresh Srinivas (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Suresh Srinivas updated HADOOP-8900:
------------------------------------
Summary: BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size (was: BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky))
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482298#comment-13482298 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-Hdfs-trunk #1204 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1204/])
HADOOP-8900. I had mistakenly not credited the patch to the right person. Changing the contributor to Andy Isaacson. (Revision 1401144)
Result = SUCCESS
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1401144
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 1.2.0, 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andy Isaacson updated HADOOP-8900:
----------------------------------
Attachment: hadoop8900-2.txt
New patch for trunk:
* fix all examples of {{long&0xffffffff}} in the tree (adds TestVLong).
* verified that 4GB+1 is the relevant edge case, 2GB+1 does not trigger the failure.
I'm still a bit unhappy at the long runtime, but 100 seconds is not *that* long by the standards of this test suite, so maybe it's worthwhile.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472860#comment-13472860 ]
Slavik Krassovsky commented on HADOOP-8900:
-------------------------------------------
Andy, let's go with your patch.
+1
Could you please port it to branch-1 that that we could integrate it to branch-1-win
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andy Isaacson reassigned HADOOP-8900:
-------------------------------------
Assignee: Andy Isaacson
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Colin Patrick McCabe (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13473385#comment-13473385 ]
Colin Patrick McCabe commented on HADOOP-8900:
----------------------------------------------
bq. Nope, it only writes the compressed file to disk; gzip -1 compresses 4GB of zeros to 18 MiB.
Sorry, I missed the fact that the uncompressed file was never written to disk.
Looks good to me.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482018#comment-13482018 ]
Hadoop QA commented on HADOOP-8900:
-----------------------------------
{color:red}-1 overall{color}. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12550367/hadoop-8900.branch-1.patch
against trunk revision .
{color:red}-1 patch{color}. The patch command could not apply the patch.
Console output: https://builds.apache.org/job/PreCommit-HADOOP-Build/1660//console
This message is automatically generated.
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Suresh Srinivas (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482067#comment-13482067 ]
Suresh Srinivas commented on HADOOP-8900:
-----------------------------------------
Sorry I had mistakenly credited this patch to Slavik. I changed the CHANGES.txt to give the credit for this jira to Andy.
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 1.2.0, 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Slavik Krassovsky updated HADOOP-8900:
--------------------------------------
Affects Version/s: 1-win
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13478986#comment-13478986 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-Mapreduce-trunk #1229 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1229/])
HADOOP-8900. BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size. Contributed by Slavik Krassovsky. (Revision 1399377)
Result = FAILURE
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1399377
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/io/compress/zlib/BuiltInGzipDecompressor.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/compress/TestCodec.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/file/tfile/TestVLong.java
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Suresh Srinivas (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Suresh Srinivas updated HADOOP-8900:
------------------------------------
Fix Version/s: 2.0.3-alpha
3.0.0
+1 for the patch.
I committed the patch to trunk and branch-2. I will post it to branch-1, once the patch for it is posted.
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Suresh Srinivas (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13481907#comment-13481907 ]
Suresh Srinivas commented on HADOOP-8900:
-----------------------------------------
Andy or Colin, can you please review the merged branch-1 patch.
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Suresh Srinivas (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Suresh Srinivas updated HADOOP-8900:
------------------------------------
Resolution: Fixed
Fix Version/s: 1.2.0
Hadoop Flags: Reviewed
Status: Resolved (was: Patch Available)
I committed the patch branch-1 also. Thank you Andy and Slavik.
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 1.2.0, 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Suresh Srinivas (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Suresh Srinivas updated HADOOP-8900:
------------------------------------
Attachment: hadoop-8900.branch-1.patch
Merged patch for branch-1.
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andy Isaacson updated HADOOP-8900:
----------------------------------
Attachment: hadoop8900.txt
Attaching patch which corrects this mask issue and adds a testcase which fails without the fix.
Unfortunately the testcase takes more than 30 seconds to run on my 2.5GHz Core i5, so I doubt that it should be run by default. The total runtime for TestCodec goes from 16 seconds to 99 seconds with testGzipLongOverflow enabled.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Attachments: hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472069#comment-13472069 ]
Hadoop QA commented on HADOOP-8900:
-----------------------------------
{color:red}-1 overall{color}. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12548348/BuiltInGzipDecompressor2.patch
against trunk revision .
{color:red}-1 patch{color}. The patch command could not apply the patch.
Console output: https://builds.apache.org/job/PreCommit-HADOOP-Build/1585//console
This message is automatically generated.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Slavik Krassovsky updated HADOOP-8900:
--------------------------------------
Attachment: BuiltInGzipDecompressor2.patch
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on HADOOP-8900 started by Andy Isaacson.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472998#comment-13472998 ]
Andy Isaacson commented on HADOOP-8900:
---------------------------------------
bq. It's kind of annoying to have to use 4GB of temporary space
Nope, it only writes the compressed file to disk; {{gzip -1}} compresses 4GB of zeros to 18 MiB.
bq. Could you please port it to branch-1 that that we could integrate it to branch-1-win
Slavik, thanks for the review!
I don't have very much experience on branch-1, would you like to take a shot at the port? Especially I don't know very much about the test framework differences. I will figure out the details and do the port later this week if you don't get to it first.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472025#comment-13472025 ]
Hadoop QA commented on HADOOP-8900:
-----------------------------------
{color:green}+1 overall{color}. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12548337/hadoop8900.txt
against trunk revision .
{color:green}+1 @author{color}. The patch does not contain any @author tags.
{color:green}+1 tests included{color}. The patch appears to include 1 new or modified test files.
{color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings.
{color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages.
{color:green}+1 eclipse:eclipse{color}. The patch built with eclipse:eclipse.
{color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings.
{color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings.
{color:green}+1 core tests{color}. The patch passed unit tests in hadoop-common-project/hadoop-common.
{color:green}+1 contrib tests{color}. The patch passed contrib unit tests.
Test results: https://builds.apache.org/job/PreCommit-HADOOP-Build/1581//testReport/
Console output: https://builds.apache.org/job/PreCommit-HADOOP-Build/1581//console
This message is automatically generated.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13475397#comment-13475397 ]
Slavik Krassovsky commented on HADOOP-8900:
-------------------------------------------
Andy, sounds good, I'll port to branch-1.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13478969#comment-13478969 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-Hdfs-trunk #1199 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1199/])
HADOOP-8900. BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size. Contributed by Slavik Krassovsky. (Revision 1399377)
Result = SUCCESS
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1399377
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/io/compress/zlib/BuiltInGzipDecompressor.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/compress/TestCodec.java
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/io/file/tfile/TestVLong.java
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor throws
IOException - stored gzip size doesn't match decompressed size
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482148#comment-13482148 ]
Hudson commented on HADOOP-8900:
--------------------------------
Integrated in Hadoop-trunk-Commit #2910 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/2910/])
HADOOP-8900. I had mistakenly not credited the patch to the right person. Changing the contributor to Andy Isaacson. (Revision 1401144)
Result = SUCCESS
suresh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1401144
Files :
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
> BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
> ---------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Fix For: 1.2.0, 3.0.0, 2.0.3-alpha
>
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop-8900.branch-1.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13471908#comment-13471908 ]
Slavik Krassovsky commented on HADOOP-8900:
-------------------------------------------
It’s a quirk of Java to treat masks of integer values as integer types even when applying to a long value – legitimate, yet somewhat imposing.
/**
@Hadoo Gzip issue repro
•@author viatk
*/
public class Repro {
public static void main(String[] args) {
long smallLongValue = 665615408L; //0x027AC7C30
long largeLongValue = 9255550000L; //0x227AC7C30
long largeValueWithIntMask = (largeLongValue & 0xffffffff);
long largeValueWithLongMask = (largeLongValue & 0xffffffffL);
System.out.println("smallLongValue= "+smallLongValue);
System.out.println("largeLongValue= "+largeLongValue);
System.out.println("largeValueWithIntMask ="+largeValueWithIntMask);
System.out.println("largeValueWithLongMask ="+largeValueWithLongMask);
System.out.println();
if (largeValueWithIntMask != largeValueWithLongMask)
{ System.out.println("Here is your repro - largeValueWithIntMask != largeValueWithLongMask"); }
if (smallLongValue != largeValueWithIntMask)
{ System.out.println("Thus smallLongValue != largeValueWithIntMask"); }
if (smallLongValue == largeValueWithLongMask)
{ System.out.println("The fix is to compare Long values with long values with long masks."); }
}
}
smallLongValue= 665615408
largeLongValue= 9255550000
largeValueWithIntMask =9255550000
largeValueWithLongMask =665615408
Here is your repro - largeValueWithIntMask != largeValueWithLongMask
Thus smallLongValue != largeValueWithIntMask
The fix is to compare Long values with long values with long masks.
.
Chuan Liu added a comment - 24/Aug/12 11:32 AM - edited
+1
We found this bug while working with an internal customer.
The bug exists on Linux as well.
The root cause is we are comparing long values with a int mask.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472582#comment-13472582 ]
Slavik Krassovsky commented on HADOOP-8900:
-------------------------------------------
The http://issues.apache.org/jira/secure/attachment/12548348/BuiltInGzipDecompressor2.patch is for 1-win branch.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Colin Patrick McCabe (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472616#comment-13472616 ]
Colin Patrick McCabe commented on HADOOP-8900:
----------------------------------------------
Looks good to me.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Slavik Krassovsky (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472859#comment-13472859 ]
Slavik Krassovsky commented on HADOOP-8900:
-------------------------------------------
Andy, let's go with your patch.
+1
Could you please port it to branch-1 that that we could integrate it to branch-1-win
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Colin Patrick McCabe (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472851#comment-13472851 ]
Colin Patrick McCabe commented on HADOOP-8900:
----------------------------------------------
It seems like the long runtime could be avoided by implementing an InputStream subclass that returns synthetic data. GZip is a block-based compression algorithm-- could one simply keep returning the same block over and over, followed by the 4-byte footer?
It's kind of annoying to have to use 4GB of temporary space for the unit tests as well. I only have 9GB free in total on my drive now-- a test like this could easily push me over the limit.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13472867#comment-13472867 ]
Hadoop QA commented on HADOOP-8900:
-----------------------------------
{color:green}+1 overall{color}. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12548490/hadoop8900-2.txt
against trunk revision .
{color:green}+1 @author{color}. The patch does not contain any @author tags.
{color:green}+1 tests included{color}. The patch appears to include 2 new or modified test files.
{color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings.
{color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages.
{color:green}+1 eclipse:eclipse{color}. The patch built with eclipse:eclipse.
{color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings.
{color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings.
{color:green}+1 core tests{color}. The patch passed unit tests in hadoop-common-project/hadoop-common.
{color:green}+1 contrib tests{color}. The patch passed contrib unit tests.
Test results: https://builds.apache.org/job/PreCommit-HADOOP-Build/1592//testReport/
Console output: https://builds.apache.org/job/PreCommit-HADOOP-Build/1592//console
This message is automatically generated.
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 1-win, 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900-2.txt, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HADOOP-8900) BuiltInGzipDecompressor :
java.io.IOException: stored gzip size doesn't match decompressed size
(Slavik Krassovsky)
Posted by "Andy Isaacson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HADOOP-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andy Isaacson updated HADOOP-8900:
----------------------------------
Affects Version/s: 2.0.1-alpha
> BuiltInGzipDecompressor : java.io.IOException: stored gzip size doesn't match decompressed size (Slavik Krassovsky)
> -------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-8900
> URL: https://issues.apache.org/jira/browse/HADOOP-8900
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 2.0.1-alpha
> Environment: Encountered failure when processing large GZIP file
> Reporter: Slavik Krassovsky
> Assignee: Andy Isaacson
> Attachments: BuiltInGzipDecompressor2.patch, hadoop8900.txt
>
>
> Encountered failure when processing large GZIP file
> • Gz: Failed in 1hrs, 13mins, 57sec with the error:
> ¸java.io.IOException: IO error in map input file hdfs://localhost:9000/Halo4/json_m/gz/NewFileCat.txt.gz
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:242)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
> at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:435)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:371)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
> at org.apache.hadoop.mapred.Child.main(Child.java:260)
> Caused by: java.io.IOException: stored gzip size doesn't match decompressed size
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.executeTrailerState(BuiltInGzipDecompressor.java:389)
> at org.apache.hadoop.io.compress.zlib.BuiltInGzipDecompressor.decompress(BuiltInGzipDecompressor.java:224)
> at org.apache.hadoop.io.compress.DecompressorStream.decompress(DecompressorStream.java:82)
> at org.apache.hadoop.io.compress.DecompressorStream.read(DecompressorStream.java:76)
> at java.io.InputStream.read(InputStream.java:102)
> at org.apache.hadoop.util.LineReader.readLine(LineReader.java:134)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:136)
> at org.apache.hadoop.mapred.LineRecordReader.next(LineRecordReader.java:40)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:66)
> at org.apache.hadoop.hive.ql.io.HiveRecordReader.doNext(HiveRecordReader.java:32)
> at org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.next(HiveContextAwareRecordReader.java:67)
> at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236)
> ... 9 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira