You are viewing a plain text version of this content. The canonical link for it is here.
Posted to fop-users@xmlgraphics.apache.org by "Griffin,Sean" <SG...@CERNER.COM> on 2009/04/30 16:04:22 UTC

Incorrect display of accented characters in plain font

Fellow FOP users,
I am looking for some help in an issue I am investigating for one of my clients that actually doesn't appear to have much to do with FOP at all.  However, since there's a good chance that someone on this list knows more than I do about how fonts map to glyphs in PostScript, I figure this might be a good pool to ask.

The issue is that a PDF generated from FOP with accented characters stored in the FO documented as Windows-1252 encoding, in either Helvetica or Times fonts, will display just fine when viewed in a PDF viewer, and they'll print just fine when printed from the PDF viewer.  However, when that PDF is ran through a PostScript interpreter (Preview SDK from Liberty Systems), the accented characters are mangled.  For example, é converts to D.  What's strange is that this only happens if it's plain font -- not bold, italic, or bold-italic.

Now I understand that plain, bold, italic, and bold-italic are 4 different fonts, and I've looked through the PostScript specification to see how the interpreter is supposed to map the font to the glyph, but I still can't figure out what's going on.  And finally, when trying this same code execution here in house, in the US (the client is in France), I cannot reproduce the problem.

Does anyone know if PostScript interpreters look anywhere on the PC itself (this is Windows) to find the font glyph information, or does anyone know why the decoding of characters would be different for one font than the other, even within the same font family?

Thanks ahead of time for any clues you can offer.

Sean Griffin | MSVC Architect | Cerner Corporation | 816.201.1599 | sgriffin@cerner.com | www.cerner.com <http://www.cerner.com/> 


----------------------------------------------------------------------
CONFIDENTIALITY NOTICE This message and any included attachments are from Cerner Corporation and are intended only for the addressee. The information contained in this message is confidential and may constitute inside or non-public information under international, federal, or state securities laws. Unauthorized forwarding, printing, copying, distribution, or use of such information is strictly prohibited and may be unlawful. If you are not the addressee, please promptly delete this message and notify the sender of the delivery error by e-mail or you may call Cerner's corporate offices in Kansas City, Missouri, U.S.A at (+1) (816)221-1024.

RE: Incorrect display of accented characters in plain font

Posted by "Griffin,Sean" <SG...@CERNER.COM>.
I need to clarify something I have below.  It should be this:

 

when that PDF is ran through a PostScript interpreter (Preview SDK from Liberty Systems) and sent to the printer, the accented characters are mangled on the printed output.

                                                                                                                                                                                                                                                         

From: Griffin,Sean [mailto:SGRIFFIN@CERNER.COM] 
Sent: Thursday, April 30, 2009 9:04 AM
To: fop-users@xmlgraphics.apache.org
Subject: Incorrect display of accented characters in plain font

 

Fellow FOP users,

I am looking for some help in an issue I am investigating for one of my clients that actually doesn't appear to have much to do with FOP at all.  However, since there's a good chance that someone on this list knows more than I do about how fonts map to glyphs in PostScript, I figure this might be a good pool to ask.

The issue is that a PDF generated from FOP with accented characters stored in the FO documented as Windows-1252 encoding, in either Helvetica or Times fonts, will display just fine when viewed in a PDF viewer, and they'll print just fine when printed from the PDF viewer.  However, when that PDF is ran through a PostScript interpreter (Preview SDK from Liberty Systems), the accented characters are mangled.  For example, é converts to D.  What's strange is that this only happens if it's plain font -- not bold, italic, or bold-italic.

Now I understand that plain, bold, italic, and bold-italic are 4 different fonts, and I've looked through the PostScript specification to see how the interpreter is supposed to map the font to the glyph, but I still can't figure out what's going on.  And finally, when trying this same code execution here in house, in the US (the client is in France), I cannot reproduce the problem.

Does anyone know if PostScript interpreters look anywhere on the PC itself (this is Windows) to find the font glyph information, or does anyone know why the decoding of characters would be different for one font than the other, even within the same font family?

Thanks ahead of time for any clues you can offer.

Sean Griffin | MSVC Architect | Cerner Corporation | 816.201.1599 | sgriffin@cerner.com <ma...@cerner.com>  | www.cerner.com <http://www.cerner.com/> 

________________________________

CONFIDENTIALITY NOTICE This message and any included attachments are from Cerner Corporation and are intended only for the addressee. The information contained in this message is confidential and may constitute inside or non-public information under international, federal, or state securities laws. Unauthorized forwarding, printing, copying, distribution, or use of such information is strictly prohibited and may be unlawful. If you are not the addressee, please promptly delete this message and notify the sender of the delivery error by e-mail or you may call Cerner's corporate offices in Kansas City, Missouri, U.S.A at (+1) (816)221-1024.