You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Matthew Jung (Jira)" <ji...@apache.org> on 2021/05/25 01:24:00 UTC

[jira] [Updated] (PDFBOX-5198) When merging multiple pdf ua documents, Tags become nested

     [ https://issues.apache.org/jira/browse/PDFBOX-5198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matthew Jung updated PDFBOX-5198:
---------------------------------
    Attachment: non_nested_tags_4documents_combined_using+adobe_pro.tif
                nested_tags_4documents_merged_using_pdfbox.tif

Hi Hausherr
I am not sure how to attach the image files to the Jira case. This is not source but it shows how the tags look after 4 files were merged using PDFBOX versus the same 4 files merged using Adobe Acrobat DC
thank youMatt Jung


    On Monday, May 24, 2021, 01:23:33 AM EDT, Ozeki Jung <oz...@yahoo.com> wrote:  
 
  Thank you HausherrLet me work on getting the bad and good destination files
regardsMatthew Jung
    On Friday, May 21, 2021, 11:39:02 PM EDT, Tilman Hausherr (Jira) <ji...@apache.org> wrote:  
 
 
    [ https://issues.apache.org/jira/browse/PDFBOX-5198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17349593#comment-17349593 ] 

Tilman Hausherr edited comment on PDFBOX-5198 at 5/22/21, 3:38 AM:
-------------------------------------------------------------------

Please attach two source files, a "bad" destination file and a "good" destination file. Also retry with 2.0.24 just to be sure.


was (Author: tilman):
Please attach two source files, a "bad" destination file and a "good" destination file.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


> When merging multiple pdf ua documents, Tags become nested
> ----------------------------------------------------------
>
>                 Key: PDFBOX-5198
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5198
>             Project: PDFBox
>          Issue Type: Wish
>    Affects Versions: 2.0.21
>            Reporter: Matthew Jung
>            Priority: Major
>         Attachments: nested_tags_4documents_merged_using_pdfbox.tif, non_nested_tags_4documents_combined_using+adobe_pro.tif
>
>
> When merging PDF UA documents the tags seen in Adobe reader are nested. If merging 200 documents then the tags are 200 nested deep. It does not appear to affect that JAWS reader can still read the document  but it may slow down performance when loading to a content repository.
> <DOCUMENT>
>           <DOCUMENT>
>                        <DOCUMENT>
> when using Adobe DC to merge multiple documents the tags are flatten
> <DOCUMENT>
>      <DOCUMENT>
>       <DOCUMENT>
>       <DOCUMENT>
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org