You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/01/31 18:20:00 UTC

[jira] [Commented] (PDFBOX-5372) *LOADS of* "WARNING: key node000xxxxx already exists in destination IDTree"

    [ https://issues.apache.org/jira/browse/PDFBOX-5372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17484840#comment-17484840 ] 

Tilman Hausherr commented on PDFBOX-5372:
-----------------------------------------

What PDFBox version are you using?

The name warning means that these names are in both files, so your result structure tree might be useless.

> *LOADS of* "WARNING: key node000xxxxx already exists in destination IDTree"
> ---------------------------------------------------------------------------
>
>                 Key: PDFBOX-5372
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5372
>             Project: PDFBox
>          Issue Type: Bug
>            Reporter: nunop5
>            Priority: Major
>
> Hello,
> I've been using PDFBox since very very long, and it's been working very well 👍
>  
> However, for 2 sets of PDFs I'm trying to merge in the past day, for some reason I'm always getting LOADS of these Warnings, with a "out of memory" crash at the end.
> I believe (but could be wrong) that the "out of memory" is due to these endless Warnings.
> (heap is currently 15g)
>  
> But, does anyone know what could be causing these warnings? (anything in specific you'd suggest me to look at, in these PDFs?)
>  
> Thank you very much!
>  
> (... LOADS of these, redacted ...)
> Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility mergeIDTree
> WARNING: key node00018714 already exists in destination IDTree
> Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility mergeIDTree
> WARNING: key node00018715 already exists in destination IDTree
> Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility mergeIDTree
> WARNING: key node00018716 already exists in destination IDTree
> Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility mergeIDTree
> WARNING: key node00018717 already exists in destination IDTree
> Jan 31, 2022 2:41:48 AM org.apache.pdfbox.multipdf.PDFMergerUtility mergeIDTree
> WARNING: key node00018718 already exists in destination IDTree
> Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
>         at org.apache.pdfbox.io.ScratchFileBuffer.addPage(ScratchFileBuffer.java:132)
>         at org.apache.pdfbox.io.ScratchFileBuffer.ensureAvailableBytesInPage(ScratchFileBuffer.java:184)
>         at org.apache.pdfbox.io.ScratchFileBuffer.write(ScratchFileBuffer.java:236)
>         at org.apache.pdfbox.io.RandomAccessOutputStream.write(RandomAccessOutputStream.java:46)
>         at org.apache.pdfbox.cos.COSStream$2.write(COSStream.java:281)
>         at org.apache.pdfbox.io.IOUtils.copy(IOUtils.java:70)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:127)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:117)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:117)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:117)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:109)
>         at org.apache.pdfbox.multipdf.PDFCloneUtility.cloneForNewDocument(PDFCloneUtility.java:146)
>         at org.apache.pdfbox.multipdf.PDFMergerUtility.appendDocument(PDFMergerUtility.java:800)
>         at org.apache.pdfbox.multipdf.PDFMergerUtility.legacyMergeDocuments(PDFMergerUtility.java:459)
>         at org.apache.pdfbox.multipdf.PDFMergerUtility.mergeDocuments(PDFMergerUtility.java:346)
>         at org.apache.pdfbox.tools.PDFMerger.merge(PDFMerger.java:70)
>         at org.apache.pdfbox.tools.PDFMerger.main(PDFMerger.java:49)
>         at org.apache.pdfbox.tools.PDFBox.main(PDFBox.java:81)
>  
> –
>  
> EDIT - Increased the heap, and seems I got something new too:
>  
> Jan 31, 2022 10:50:46 AM org.apache.pdfbox.multipdf.PDFMergerUtility mergeIDTree
> WARNING: key node00018714 already exists in destination IDTree
> Jan 31, 2022 10:51:29 AM org.apache.pdfbox.cos.COSDocument finalize
> *WARNING: Warning: You did not close a PDF Document*
> Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
>  
> Not sure if this means something is actually wrong with one of the PDFs, or a consequence of going out of memory.
> (and how do I know what's the PDF it's referring to? :) )
> I'm using Pupeteer to generate the PDFs, so I assume it generated them well... (also, it's consistently failing even if I re-generate all of them)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org