You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2014/10/23 20:19:34 UTC
[jira] [Closed] (PDFBOX-1066) There is no functionlaity of reading
the text line by line with its input field
[ https://issues.apache.org/jira/browse/PDFBOX-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andreas Lehmkühler closed PDFBOX-1066.
--------------------------------------
Resolution: Not a Problem
Assignee: Andreas Lehmkühler
PDFs aren't organized in lines. So, if you want to read a pdf line by line you have to extract the whole text first. It should be easy to process that result line by line without PDFBox.
> There is no functionlaity of reading the text line by line with its input field
> -------------------------------------------------------------------------------
>
> Key: PDFBOX-1066
> URL: https://issues.apache.org/jira/browse/PDFBOX-1066
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.7.3
> Environment: Windows
> Reporter: Nishant
> Assignee: Andreas Lehmkühler
> Labels: patch
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> I am trying to read the PDF texts along with its input type like textfield/checkboxes. What i found is TextStripper is pasing the whole document and retuning the string in getText(). And using Acroform.getfields i am able ot get all fields.
> But I have perticuler requierment of reading the texts and its input type. Do we have any class/method which can resolve this issue.
> Its very urgent.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)