You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by lalit gupta <la...@yahoo.com.INVALID> on 2017/04/10 11:32:33 UTC
Help urgently
Hi team , I have got and assignment where I need to extract some information from the PDF file and then based on that info I need to split that file into multiple files.
Here now I splitting is not a problem but I am finding difficult to extract the information.
E.g.
Fund name : abcPortfolio I'd : 1234
Now suppose I want to extract fund name and portfolio I'd then how to achieve this . There may be many fields in PDF which I need to extract. Could u suggest or send sample code .
Sent from Yahoo Mail on Android
Re: Help urgently
Posted by Tilman Hausherr <TH...@t-online.de>.
Am 10.04.2017 um 13:32 schrieb lalit gupta:
> Hi team , I have got and assignment where I need to extract some information from the PDF file and then based on that info I need to split that file into multiple files.
> Here now I splitting is not a problem but I am finding difficult to extract the information.
> E.g.
> Fund name : abcPortfolio I'd : 1234
> Now suppose I want to extract fund name and portfolio I'd then how to achieve this . There may be many fields in PDF which I need to extract. Could u suggest or send sample code .
>
> Sent from Yahoo Mail on Android
Have you found out whether the information is in an AcroForm field or
part of the normal content stream? (Use the printFields example and the
ExtractText command line tool)
Can you share a sample PDF?
Tilman
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org