You are viewing a plain text version of this content. The canonical link for it is here.
Posted to fop-dev@xmlgraphics.apache.org by bu...@apache.org on 2010/08/31 11:26:56 UTC

DO NOT REPLY [Bug 49849] New: PDF links do only support ISO encoding

https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

           Summary: PDF links do only support ISO encoding
           Product: Fop
           Version: all
          Platform: PC
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: pdf
        AssignedTo: fop-dev@xmlgraphics.apache.org
        ReportedBy: maximilian.aster@boc-eu.com


The current version of FOP (1.0) does only support "ISO-8859-1" encoding for
pdf actions like links.

See PDFDocument.java

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

--- Comment #1 from maximilian.aster@boc-eu.com 2010-08-31 05:32:47 EDT ---
Created an attachment (id=25963)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=25963)
test case

Test case with some polish characters

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] [PATCH] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

--- Comment #8 from Glenn Adams <ga...@apache.org> 2012-04-24 05:56:11 UTC ---
(In reply to comment #6)
> see comment 5

Max, I am still awaiting your input as requested above. if I see no further
input by April 30, I will close this bug due to lack of requested information.
Regards, Glenn

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

maximilian.aster@boc-eu.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED

--- Comment #3 from maximilian.aster@boc-eu.com 2010-08-31 05:37:24 EDT ---
See patch

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] [PATCH] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

--- Comment #5 from Glenn Adams <gl...@skynav.com> 2012-04-01 21:08:49 UTC ---
(In reply to comment #3)
> See patch

A brief look at this patch shows that it simply changes the output encoding
used for the PDFDocument.encode() function as follows:

-    public static final String ENCODING = "ISO-8859-1";
+    public static final String ENCODING = "UTF-8";

I believe this is incorrect. PDF files employ three string types:

(1) byte string (unspecified encoding)
(2) ascii string (us-ascii encoding)
(3) text string (either PDFDocEncoding or UTF-16BE)

Since (1) the encode() mechanism is used in a variety of contexts and (2) no
explicit use of UTF-8 is made by PDF, it would be incorrect to simply change
the output encoding returned by encode().

See ISO/IEC 32000 (2008), Section 7.9.2 for details.

This patch needs to be reworked to take these details into account.
Furthermore, the description of this bug is not adequate: it really doesn't
explain what the problem is:

* is it the fact that the rendered text of the content of basic-link is not
rendered with Polish characters? if so, then the problem is a font selection
problem, not a character encoding problem

* is it related to the character encoding used in the /Filespec dictionary for
the link annotation?

In any case, the present patch MUST NOT be applied.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] [PATCH] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

--- Comment #7 from Glenn Adams <gl...@skynav.com> 2012-04-07 01:41:47 UTC ---
resetting P2 open bugs to P3 pending further review

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] [PATCH] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

Glenn Adams <gl...@skynav.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Priority|P2                          |P3

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] [PATCH] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

Vincent Hennebert <vh...@gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|PDF links do only support   |[PATCH] PDF links do only
                   |ISO encoding                |support ISO encoding

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

--- Comment #2 from maximilian.aster@boc-eu.com 2010-08-31 05:33:51 EDT ---
Created an attachment (id=25964)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=25964)
patch to utf-8

Changes the encoding to UTF-8

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] [PATCH] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

Glenn Adams <gl...@skynav.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|REOPENED                    |NEEDINFO

--- Comment #6 from Glenn Adams <gl...@skynav.com> 2012-04-01 21:09:51 UTC ---
see comment 5

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

DO NOT REPLY [Bug 49849] PDF links do only support ISO encoding

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49849

Vincent Hennebert <vh...@gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|FIXED                       |

--- Comment #4 from Vincent Hennebert <vh...@gmail.com> 2010-08-31 09:27:21 EDT ---
Hi,

Thanks for your patch. This bug should remain open until it has actually been
committed. Otherwise we will loose track of it.

Vincent

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.