You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2009/02/02 12:39:59 UTC
[jira] Commented: (PDFBOX-81) Excetion while extracting images

    [ https://issues.apache.org/jira/browse/PDFBOX-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669562#action_12669562 ] 

Andreas Lehmkühler commented on PDFBOX-81:
------------------------------------------

JBIG2 is a (rarely??) used compression format espacially for bi-level (b/w) images such as faxes or scans and by now it is not supported by pdfbox, yet.

See also http://www.jpeg.org/jbig/jbigpt2.html

> Excetion while extracting images
> --------------------------------
>
>                 Key: PDFBOX-81
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-81
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDFReader
>    Affects Versions: 0.8.0-incubator
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1259747
> Originally submitted by guzzil on 2005-08-15 02:40.
> when trying to extract images from I pdf, i get exceptions  
> like  
> Exception in thread "main" java.io.IOException: Unknown  
> stream filter:COSName{JBIG2Decode}  
>         at  
> org.pdfbox.filter.FilterManager.getFilter(FilterManager.java:116)  
>         at  
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:276)  
>         at  
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:240)  
>         at  
> org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:173)  
>         at  
> org.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:205)  
>         at  
> org.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:458)  
>         at  
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:131)  
>         at  
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.write2OutputStream(PDPixelMap.java:153)  
>         at  
> org.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:117)  
>         at  
> org.pdfbox.ExtractImages.extractImages(ExtractImages.java:169)  
>         at  
> org.pdfbox.ExtractImages.main(ExtractImages.java:73)  
>   
> The pdfs are scanned images, which are afterwards  
> optimized with Adobe Acrobats "optimize" function.  
>   
> pdfimages from xpdf can extract the images.  
>  
> I can send you a pdf with this error (it is to big for an 
> upload). 
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES 
> user_id=601708
> yes please upload the pdf to ftp.pdfbox.org and I will take a 
> look at it.
> Ben Litchfield

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.