You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2017/05/09 10:15:04 UTC
[jira] [Comment Edited] (PDFBOX-3674) Incorrect ordering of fatha
-- potentially indicative of larger issue with RTL
[ https://issues.apache.org/jira/browse/PDFBOX-3674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15999993#comment-15999993 ]
Andreas Lehmkühler edited comment on PDFBOX-3674 at 5/9/17 10:14 AM:
---------------------------------------------------------------------
[~tilman] you're right my "fix" was nonsense and it doesn't fix anything :-(
was (Author: lehmi):
[~tilman] you're right my "fix" was nonsense. :-(
> Incorrect ordering of fatha -- potentially indicative of larger issue with RTL
> ------------------------------------------------------------------------------
>
> Key: PDFBOX-3674
> URL: https://issues.apache.org/jira/browse/PDFBOX-3674
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Reporter: Tim Allison
> Assignee: Andreas Lehmkühler
> Priority: Minor
>
> On TIKA-2257, [~ccreutzig] shared a file that triggers PDFBox to flip the order of the fatha. I suspect this is happening in {{normalizeAdd}} within PDFTextStripper, but I'm not familiar enough with the code to diagnose and fix.
> I confirmed this is still happening in trunk.
> Triggering file and the start of a diagnosis is available on the Tika issue.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org