You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2018/08/22 16:20:00 UTC

[jira] [Commented] (PDFBOX-4297) Allow to space efficiently analyse large PDFs

    [ https://issues.apache.org/jira/browse/PDFBOX-4297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16589087#comment-16589087 ] 

Tilman Hausherr commented on PDFBOX-4297:
-----------------------------------------

I think that what you want is parsing on demand. If this would exist, then creating 1+2+3 would just be a tool like many others.

> Allow to space efficiently analyse large PDFs
> ---------------------------------------------
>
>                 Key: PDFBOX-4297
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4297
>             Project: PDFBox
>          Issue Type: Improvement
>            Reporter: Ralf Hauser
>            Priority: Major
>
> Assume you get a 300+MB large pdf and need to know
> 1) the file names of embedded files if any
> 2) whether it is encrypted (symmetric or asymmetric)
> 3) certification level (and whether it is signed)
> This should not use more than 5 MB (extra) memory
>  
> P.S.: seems to an exampe of https://pdfbox.apache.org/ideas.html  "Handle large PDF files"
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org