You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2014/04/29 12:43:21 UTC

[jira] [Closed] (PDFBOX-2043) While Reading a PDF which contains Image the Content of the PDF is misaligned in the resulting text.

     [ https://issues.apache.org/jira/browse/PDFBOX-2043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler closed PDFBOX-2043.
--------------------------------------

    Resolution: Not a Problem
      Assignee: Andreas Lehmkühler

[~venkateshmpr] There is some space but no spaces. The given pdf doesn't contain any spaces at the postion you've mentioned and PDFBox doesn't add some spaces to preserve the (visual) formatting. It simply extracts the whole text of a pdf. So everything works as expected.

Set to closed

> While Reading a PDF which contains Image the Content of the PDF is misaligned in the resulting text.
> ----------------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-2043
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2043
>             Project: PDFBox
>          Issue Type: Bug
>         Environment: Visual Studio 2005
>            Reporter: Venkatesan
>            Assignee: Andreas Lehmkühler
>         Attachments: Result.txt, Sample.pdf, Space.jpg
>
>
> We are trying to read content of a PDF file, The PDF has images in the header. We use the PDFTextStripper.getText() method. After calling this method the resulting text is misaligned compare to the Original PDF.



--
This message was sent by Atlassian JIRA
(v6.2#6252)