You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Bernard (JIRA)" <ji...@apache.org> on 2010/05/06 01:22:10 UTC

[jira] Created: (PDFBOX-721) Regression from 1.1.0 to 0.7.8 on accents

Regression from 1.1.0 to 0.7.8 on accents
-----------------------------------------

                 Key: PDFBOX-721
                 URL: https://issues.apache.org/jira/browse/PDFBOX-721
             Project: PDFBox
          Issue Type: Bug
          Components: Text extraction
    Affects Versions: 1.1.0
         Environment: Android
            Reporter: Bernard


The included file text extraction was OK on 0.7.8, but not on 1.1.0

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PDFBOX-721) Regression from 1.1.0 to 0.7.8 on accents

Posted by "Bernard (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bernard updated PDFBOX-721:
---------------------------

    Attachment: 4-4-ObjectifExport.pdf

> Regression from 1.1.0 to 0.7.8 on accents
> -----------------------------------------
>
>                 Key: PDFBOX-721
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-721
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.1.0
>         Environment: Android
>            Reporter: Bernard
>         Attachments: 4-4-ObjectifExport.pdf
>
>
> The included file text extraction was OK on 0.7.8, but not on 1.1.0

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (PDFBOX-721) Regression from 1.1.0 to 0.7.8 on accents

Posted by "Bernard (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bernard closed PDFBOX-721.
--------------------------

    Fix Version/s: 1.1.0
       Resolution: Later

It seems the accent are correct, but can not be rendered using Android fonts.  A conversion between Unicode and html should be done :


            	if(c==321)	s1 = "&egrave;";
            	else if(c==216)	s1 = "&eacute;";
            	else if(c==223)	s1 = "&ucirc;";
            	else if(c==8217)	s1 = "'";
            	else if(c==338)	s1 = "&ecirc;";
            	else if(c==248)	s1 = "&ucirc;";
            	

> Regression from 1.1.0 to 0.7.8 on accents
> -----------------------------------------
>
>                 Key: PDFBOX-721
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-721
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.1.0
>         Environment: Android
>            Reporter: Bernard
>             Fix For: 1.1.0
>
>         Attachments: 4-4-ObjectifExport.pdf
>
>
> The included file text extraction was OK on 0.7.8, but not on 1.1.0

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PDFBOX-721) Regression from 1.1.0 to 0.7.8 on accents

Posted by "Andreas Lehmkühler (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PDFBOX-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864847#action_12864847 ] 

Andreas Lehmkühler commented on PDFBOX-721:
-------------------------------------------

What exactly goes wrong, can you be more specific, please.

The current trunk seems to work like a charme.

P.S.: Correct me if I'm wrong , but the mentioned 0.7.8 version has to be a typo. AFAIK there isn't no such version.

> Regression from 1.1.0 to 0.7.8 on accents
> -----------------------------------------
>
>                 Key: PDFBOX-721
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-721
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.1.0
>         Environment: Android
>            Reporter: Bernard
>         Attachments: 4-4-ObjectifExport.pdf
>
>
> The included file text extraction was OK on 0.7.8, but not on 1.1.0

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.