You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2009/02/02 12:39:59 UTC
[jira] Commented: (PDFBOX-81) Excetion while extracting images
[ https://issues.apache.org/jira/browse/PDFBOX-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669562#action_12669562 ]
Andreas Lehmkühler commented on PDFBOX-81:
------------------------------------------
JBIG2 is a (rarely??) used compression format espacially for bi-level (b/w) images such as faxes or scans and by now it is not supported by pdfbox, yet.
See also http://www.jpeg.org/jbig/jbigpt2.html
> Excetion while extracting images
> --------------------------------
>
> Key: PDFBOX-81
> URL: https://issues.apache.org/jira/browse/PDFBOX-81
> Project: PDFBox
> Issue Type: Bug
> Components: PDFReader
> Affects Versions: 0.8.0-incubator
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1259747
> Originally submitted by guzzil on 2005-08-15 02:40.
> when trying to extract images from I pdf, i get exceptions
> like
> Exception in thread "main" java.io.IOException: Unknown
> stream filter:COSName{JBIG2Decode}
> at
> org.pdfbox.filter.FilterManager.getFilter(FilterManager.java:116)
> at
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:276)
> at
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:240)
> at
> org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:173)
> at
> org.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:205)
> at
> org.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:458)
> at
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:131)
> at
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.write2OutputStream(PDPixelMap.java:153)
> at
> org.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:117)
> at
> org.pdfbox.ExtractImages.extractImages(ExtractImages.java:169)
> at
> org.pdfbox.ExtractImages.main(ExtractImages.java:73)
>
> The pdfs are scanned images, which are afterwards
> optimized with Adobe Acrobats "optimize" function.
>
> pdfimages from xpdf can extract the images.
>
> I can send you a pdf with this error (it is to big for an
> upload).
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES
> user_id=601708
> yes please upload the pdf to ftp.pdfbox.org and I will take a
> look at it.
> Ben Litchfield
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.