You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Sigitas Limontas (JIRA)" <ji...@apache.org> on 2013/12/06 15:27:35 UTC
[jira] [Comment Edited] (PDFBOX-1797) PDFText2HTML incorrectly
interprets indentation
[ https://issues.apache.org/jira/browse/PDFBOX-1797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13841296#comment-13841296 ]
Sigitas Limontas edited comment on PDFBOX-1797 at 12/6/13 2:26 PM:
-------------------------------------------------------------------
Lorem.pdf - test pdf file
out.html - result html
expected.html - expected result
was (Author: sigitas):
Test PDF
> PDFText2HTML incorrectly interprets indentation
> -----------------------------------------------
>
> Key: PDFBOX-1797
> URL: https://issues.apache.org/jira/browse/PDFBOX-1797
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.8.3
> Reporter: Sigitas Limontas
> Attachments: Lorem.pdf, expected.html, out.html
>
>
> PDFText2HTML incorrectly interprets one line paragraphs.
--
This message was sent by Atlassian JIRA
(v6.1#6144)