You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2014/06/01 00:11:01 UTC

[jira] [Comment Edited] (PDFBOX-2105) Support for multipage TIFFs in CCITTFactory, makes PDFBox capable of doing tiff2pdf

    [ https://issues.apache.org/jira/browse/PDFBOX-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14014834#comment-14014834 ] 

Tilman Hausherr edited comment on PDFBOX-2105 at 5/31/14 10:10 PM:
-------------------------------------------------------------------

Done for the 1.8 branch too in 1598926 and 1598928.


was (Author: tilman):
Done for the 1.8 branch too in 1598926.

> Support for multipage TIFFs in CCITTFactory, makes PDFBox capable of doing tiff2pdf
> -----------------------------------------------------------------------------------
>
>                 Key: PDFBOX-2105
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2105
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: PDModel
>    Affects Versions: 1.8.5, 1.8.6, 2.0.0
>            Reporter: Antti Lankila
>            Assignee: Tilman Hausherr
>            Priority: Minor
>              Labels: CCITTFaxDecode, ccitt, features, patch
>             Fix For: 1.8.6, 2.0.0
>
>         Attachments: pdfbox-multipagetiff.diff
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> I created a patch based on Sergey Ushakov's work that handles multipage TIFFs. This allows fast and efficient conversion from TIFF to PDF
> The general approach is to provide a new factory method that accepts an image (page) number, and then appropriate page number is located when the CCITT stream is being extracted.
> There's a minor inefficiency in this approach because the seek starts from the beginning for each page, causing O(N^2) algorithm when extracting every page, but maximum size for file appears to be 2 GB and the cost for finding a single page will still be low, so I bet this will never come up in practice.
> There is no method that tells how many pages TIFF files have. I opted to simply return null from the factory method that accepts page number if there is no such page, so users can use this as condition to break from a TIFF to PDF conversion loop.



--
This message was sent by Atlassian JIRA
(v6.2#6252)