You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2016/06/24 16:28:16 UTC

[jira] [Commented] (PDFBOX-3398) Text (XML) output of pdf structure

    [ https://issues.apache.org/jira/browse/PDFBOX-3398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15348528#comment-15348528 ] 

Tilman Hausherr commented on PDFBOX-3398:
-----------------------------------------

This sounds like replacing one complex format (PDF) with another complex format (XML). You could print the PDF to XPS, that is XML based. Compared to what you get with PDFDebugger, I'd say that an XML representation is not helpful.

> Text (XML) output of pdf structure
> ----------------------------------
>
>                 Key: PDFBOX-3398
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3398
>             Project: PDFBox
>          Issue Type: New Feature
>          Components: Parsing, Utilities
>            Reporter: Stefan Hegny
>            Priority: Minor
>
> It would be nice to have a text/xml representation output to pdf file of the entire document structure as can be browsed in the debugger window GUI. It would allow for easier searching and understanding of the structure. Not sure if it should be an option to PDFReader/PDFDebugger  or a separate class that might also be bundled into an app jar. I would even start working on it given the preferred base to start on



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org