You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Andreas Lehmkuehler <an...@lehmi.de> on 2013/08/08 18:44:26 UTC
Re: Unable to read PDF with embedded Black-And-White TIFF.
Hi,
Am 29.07.2013 16:39, schrieb Lachezar Dobrev:
> Hello colleagues.
>
> Since a month or two I've started using PDF Box to read PDF files
> received from a scanner. Recently some of the users started receiving
> this error:
>
>> java.lang.RuntimeException: EOL encountered in black run.
>> at org.apache.pdfbox.filter.TIFFFaxDecoder.decodeNextScanline(TIFFFaxDecoder.java:677)
>> at org.apache.pdfbox.filter.TIFFFaxDecoder.decode2D(TIFFFaxDecoder.java:766)
>> at org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecodeFilter.java:120)
>> at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:295)
>> at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:237)
>> at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:172)
>> at org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.getRGBImage(PDCcitt.java:155)
There are some known issues concerning the CCITT filter. Most likely yours is
related to them.
> Opening the same file with evince yields:
>> Syntax Error (40958): Missing 'endstream' or incorrect stream length
>
> But the file still displays and the image content is visible.
>
> The files that exhibit this problem have sensitive information, and
> I don't feel comfortable sharing those.
> However if anyone of the developers needs a sample I can probably
> provide one off-list.
Send it to me and I'll have a look to check if my assumption is correct or not.
> Please advise.
BR
Andreas Lehmkühler