You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Justin LeFebvre (JIRA)" <ji...@apache.org> on 2009/04/06 21:35:12 UTC

[jira] Issue Comment Edited: (PDFBOX-105) Missing repeated characters

    [ https://issues.apache.org/jira/browse/PDFBOX-105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12696162#action_12696162 ] 

Justin LeFebvre edited comment on PDFBOX-105 at 4/6/09 12:33 PM:
-----------------------------------------------------------------

Cannot reproduce this problem. This issue should be resolved. 

      was (Author: justinl):
    Cannot reproduce this problem,  though there is a separate issue where question mark on the first page is backwards. 
  
> Missing repeated characters
> ---------------------------
>
>                 Key: PDFBOX-105
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-105
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1339022
> Originally submitted by nobody on 2005-10-26 16:44.
> When I extract the first line of text from each page, 
> sometimes a 2nd letter in a 2 character repeated letter 
> is dropped. Like: aplication instead of application
> So I did:
> stripper.setSuppressDuplicateOverlappingText(false);
> This helped, but is this a reliable fix and expected 
> behavior?
> example pdf included
> brp@cayuse.com
> [attachment on SourceForge]
> http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1339022&file_id=153919
> RP.pdf (application/pdf), 215389 bytes
> example with headings

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.