You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Matthew Clemente <mj...@gmail.com> on 2018/07/24 00:07:44 UTC

Removing Embedded Index

First of all, thanks from all the feedback this mailing list has provided
to previous questions; it’s been incredibly helpful.

I have a pdf created with Acrobat that has an embedded index that I would
like to remove.

When I use the debugger, I can see it, under "Root/PieceInfo/SearchIndex”.

I’ve attached an example pdf and a screenshot of the debugger, with the
path.

Is there a way that I can access and remove the embedded search index with
PDFBox?

Thanks!

-- 
Matthew Clemente

Re: Removing Embedded Index

Posted by Matthew Clemente <mj...@gmail.com>.
That did it! And that approach is really helpful to know; I didn’t realize
I could access elements that way.

Thank you!

-- 
Matthew Clemente

From: Tilman Hausherr <th...@t-online.de> <th...@t-online.de>
Reply: users@pdfbox.apache.org <us...@pdfbox.apache.org>
<us...@pdfbox.apache.org>
Date: July 24, 2018 at 12:26:17 AM
To: users@pdfbox.apache.org <us...@pdfbox.apache.org>
<us...@pdfbox.apache.org>
Subject:  Re: Removing Embedded Index

Am 24.07.2018 um 02:07 schrieb Matthew Clemente:
> First of all, thanks from all the feedback this mailing list has
> provided to previous questions; it’s been incredibly helpful.
>
> I have a pdf created with Acrobat that has an embedded index that I
> would like to remove.
>
> When I use the debugger, I can see it, under
"Root/PieceInfo/SearchIndex”.
>
> I’ve attached an example pdf and a screenshot of the debugger, with
> the path.
>
> Is there a way that I can access and remove the embedded search index
> with PDFBox?

Your attachments didn't get through... you'll have to upload them to a
sharehoster

likely

((COSDictionary)document.getDocumentCatalog().getCOSObject().getItem("PieceInfo")).removeItem(COSName.getPDFName("SearchIndex");


Tilman


>
> Thanks!
>
> --
> Matthew Clemente
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org

Re: Removing Embedded Index

Posted by Tilman Hausherr <TH...@t-online.de>.
Am 24.07.2018 um 02:07 schrieb Matthew Clemente:
> First of all, thanks from all the feedback this mailing list has 
> provided to previous questions; it’s been incredibly helpful.
>
> I have a pdf created with Acrobat that has an embedded index that I 
> would like to remove.
>
> When I use the debugger, I can see it, under "Root/PieceInfo/SearchIndex”.
>
> I’ve attached an example pdf and a screenshot of the debugger, with 
> the path.
>
> Is there a way that I can access and remove the embedded search index 
> with PDFBox?

Your attachments didn't get through... you'll have to upload them to a 
sharehoster

likely

((COSDictionary)document.getDocumentCatalog().getCOSObject().getItem("PieceInfo")).removeItem(COSName.getPDFName("SearchIndex");

Tilman


>
> Thanks!
>
> -- 
> Matthew Clemente
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org