You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2014/10/23 20:19:34 UTC

[jira] [Closed] (PDFBOX-1066) There is no functionlaity of reading the text line by line with its input field

     [ https://issues.apache.org/jira/browse/PDFBOX-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler closed PDFBOX-1066.
--------------------------------------
    Resolution: Not a Problem
      Assignee: Andreas Lehmkühler

PDFs aren't organized in lines. So, if you want to read a pdf line by line you have to extract the whole text first. It should be easy to process that result line by line without PDFBox.


> There is no functionlaity of reading the text line by line with its input field
> -------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1066
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1066
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 0.7.3
>         Environment: Windows
>            Reporter: Nishant
>            Assignee: Andreas Lehmkühler
>              Labels: patch
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> I am trying to read the PDF texts along with its input type like textfield/checkboxes. What i found is TextStripper is pasing the whole document and retuning the string in getText(). And using Acroform.getfields i am able ot get all fields. 
> But I have perticuler requierment of reading the texts and its input type. Do we have any class/method which can resolve this issue. 
> Its very urgent.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)