You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2012/05/13 12:16:17 UTC

[Bug 52991] Unexpected end of ZLIB input stream on embedded OLE extraction from PPT

https://issues.apache.org/bugzilla/show_bug.cgi?id=52991

RM <eu...@kontextwork.de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|FIXED                       |---

--- Comment #2 from RM <eu...@kontextwork.de> ---
Verified on with the current trunk, revision 1337825, not fixed yet:

The source is a ppt, error is exactly the same:
xception in thread "main" org.apache.tika.exception.TikaException: TIKA-198:
Illegal IOException from org.apache.tika.parser.microsoft.OfficeParser@bd928a
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:248)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
    at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
    at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:126)
    at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:395)
    at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:97)
Caused by: org.apache.tika.io.TaggedIOException: Unexpected end of ZLIB input
stream
    at
org.apache.tika.io.TaggedInputStream.handleIOException(TaggedInputStream.java:133)
    at org.apache.tika.io.ProxyInputStream.read(ProxyInputStream.java:103)
    at org.apache.tika.io.ProxyInputStream.read(ProxyInputStream.java:99)
    at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
    at java.io.BufferedInputStream.read1(BufferedInputStream.java:258)
    at java.io.BufferedInputStream.read(BufferedInputStream.java:317)
    at java.io.FilterInputStream.read(FilterInputStream.java:90)
    at org.apache.tika.io.IOUtils.copyLarge(IOUtils.java:933)
    at org.apache.tika.io.IOUtils.copy(IOUtils.java:907)
    at org.apache.tika.io.TikaInputStream.getFile(TikaInputStream.java:536)
    at
org.apache.tika.io.TikaInputStream.getFileChannel(TikaInputStream.java:564)
    at
org.apache.tika.parser.microsoft.POIFSContainerDetector.getTopLevelNames(POIFSContainerDetector.java:335)
    at
org.apache.tika.parser.microsoft.POIFSContainerDetector.detect(POIFSContainerDetector.java:152)
    at
org.apache.tika.detect.CompositeDetector.detect(CompositeDetector.java:61)
    at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:113)
    at org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72)
    at
org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:102)
    at
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedResource(AbstractPOIFSExtractor.java:68)
    at
org.apache.tika.parser.microsoft.HSLFExtractor.handleSlideEmbeddedResources(HSLFExtractor.java:236)
    at
org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:117)
    at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:188)
    at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:160)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
    ... 5 more
Caused by: java.io.EOFException: Unexpected end of ZLIB input stream
    at java.util.zip.InflaterInputStream.fill(InflaterInputStream.java:223)
    at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:141)
    at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
    at java.io.BufferedInputStream.read1(BufferedInputStream.java:258)
    at java.io.BufferedInputStream.read(BufferedInputStream.java:317)
    at org.apache.tika.io.ProxyInputStream.read(ProxyInputStream.java:99)
    ... 26 more


---------
Debian Squeeze with tika from source ( also tried 1.0 and 1.1 )

-- 
You are receiving this mail because:
You are the assignee for the bug.