You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Lachezar Dobrev <l....@gmail.com> on 2013/07/29 16:39:45 UTC

Unable to read PDF with embedded Black-And-White TIFF.

  Hello colleagues.

  Since a month or two I've started using PDF Box to read PDF files
received from a scanner. Recently some of the users started receiving
this error:

> java.lang.RuntimeException: EOL encountered in black run.
>  at org.apache.pdfbox.filter.TIFFFaxDecoder.decodeNextScanline(TIFFFaxDecoder.java:677)
>  at org.apache.pdfbox.filter.TIFFFaxDecoder.decode2D(TIFFFaxDecoder.java:766)
>  at org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecodeFilter.java:120)
>  at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:295)
>  at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:237)
>  at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:172)
>  at org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.getRGBImage(PDCcitt.java:155)

  Opening the same file with evince yields:
> Syntax Error (40958): Missing 'endstream' or incorrect stream length

  But the file still displays and the image content is visible.

  The files that exhibit this problem have sensitive information, and
I don't feel comfortable sharing those.
  However if anyone of the developers needs a sample I can probably
provide one off-list.

  Please advise.

Re: Unable to read PDF with embedded Black-And-White TIFF.

Posted by Andreas Lehmkuehler <an...@lehmi.de>.
Hi,

Am 29.07.2013 16:39, schrieb Lachezar Dobrev:
>    Hello colleagues.
>
>    Since a month or two I've started using PDF Box to read PDF files
> received from a scanner. Recently some of the users started receiving
> this error:
>
>> java.lang.RuntimeException: EOL encountered in black run.
>>   at org.apache.pdfbox.filter.TIFFFaxDecoder.decodeNextScanline(TIFFFaxDecoder.java:677)
>>   at org.apache.pdfbox.filter.TIFFFaxDecoder.decode2D(TIFFFaxDecoder.java:766)
>>   at org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecodeFilter.java:120)
>>   at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:295)
>>   at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:237)
>>   at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:172)
>>   at org.apache.pdfbox.pdmodel.graphics.xobject.PDCcitt.getRGBImage(PDCcitt.java:155)
There are some known issues concerning the CCITT filter. Most likely yours is
related to them.

>    Opening the same file with evince yields:
>> Syntax Error (40958): Missing 'endstream' or incorrect stream length
>
>    But the file still displays and the image content is visible.
>
>    The files that exhibit this problem have sensitive information, and
> I don't feel comfortable sharing those.
>    However if anyone of the developers needs a sample I can probably
> provide one off-list.
Send it to me and I'll have a look to check if my assumption is correct or not.

>    Please advise.

BR
Andreas Lehmkühler