You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Andreas Lehmkuehler <an...@lehmi.de> on 2013/08/08 18:44:26 UTC

Re: Unable to read PDF with embedded Black-And-White TIFF.

Hi,

Am 29.07.2013 16:39, schrieb Lachezar Dobrev:
>    Hello colleagues.
>
>    Since a month or two I've started using PDF Box to read PDF files
> received from a scanner. Recently some of the users started receiving
> this error:
>
>> java.lang.RuntimeException: EOL encountered in black run.
>>   at org.apache.pdfbox.filter.TIFFFaxDecoder.decodeNextScanline(TIFFFaxDecoder.java:677)
>>   at org.apache.pdfbox.filter.TIFFFaxDecoder.decode2D(TIFFFaxDecoder.java:766)
>>   at org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecodeFilter.java:120)
>>   at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:295)
>>   at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:237)
>>   at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:172)
>>   at org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.getRGBImage(PDCcitt.java:155)
There are some known issues concerning the CCITT filter. Most likely yours is
related to them.

>    Opening the same file with evince yields:
>> Syntax Error (40958): Missing 'endstream' or incorrect stream length
>
>    But the file still displays and the image content is visible.
>
>    The files that exhibit this problem have sensitive information, and
> I don't feel comfortable sharing those.
>    However if anyone of the developers needs a sample I can probably
> provide one off-list.
Send it to me and I'll have a look to check if my assumption is correct or not.

>    Please advise.

BR
Andreas Lehmkühler