You are viewing a plain text version of this content. The canonical link for it is here.
Posted to fop-users@xmlgraphics.apache.org by Keen Tim <Ti...@nrm.qld.gov.au> on 2002/03/01 06:55:27 UTC
RE: Displaying characters in PDF

Well, I've finally had a chance to look at this. Here's what happened:

FWIW I'm using FOP 0.20.3 (the latest binary release).

If I try an encoding of 'cp1252' or 'windows-1252' in the xml file I get
an error saying that the encoding is not supported. Not happy with this
I added the latest xerces.jar (1.4.4) to CLASSPATH and it worked with an
encoding of 'windows-1252'. The same happened when I added
xercesImpl.jar (Xerces-2.0.0) to CLASSPATH.

Thank you Rainer, once again, for your help with this problem.

I've copied this to the dev list so a developer can pick this up and
change whatever it is they need to change if they so desire.

Cheers

Tim

-----Original Message-----
From: Keen Tim 
Sent: Tuesday, 5 February 2002 9:25
To: fop-user@xml.apache.org
Subject: RE: Displaying characters in PDF


Rainer,

I think you hit the nail on the head. Initially the character encoding
of the source was not supported by the xml-parser of fop. This led me to
use iso-8859-1 just to get it working. As you suggest this is probably
not the appropriate encoding.

As you guessed the most likely source of this data is a windows editor,
so on your sound advice I looked at those web sites and will try
'cp1252' and 'windows-1252'.

Thanks, once again, for your help. If I have any more problems I will
certainly let you (and others) know.

Cheers

Tim




-----Original Message-----
From: Rainer Garus [mailto:rainer.garus@arcor.de]
Sent: Tuesday, 5 February 2002 7:07
To: fop-user@xml.apache.org
Subject: RE: Displaying characters in PDF


> I'm fairly dumb when it comes to fonts. My problem is that I don't
> control the source of the character and I would expect that in most
> cases the characters that are a problem are in ANSI set from 127-255.
> From what you're saying I'll have to pre-parse the string and replace
it
> with the matching character in Adobe. Is that right? Is there another
> way?

If the character encoding of the source is supported by the xml-parser
of fop then you have to do nothing. The xml-parser maps the characters
to the correct unicode values. Fop use the xerces parser, see
http://xml.apache.org/xerces-j/faq-general.html#faq-8 for the supported
encodings.

What character encoding do you use? It seems to be an extended ascii
encoding which contains printable characters in the interval 128 .. 159,
so it is not iso-8895-1. You can try cp1252 (latin 1 windows) (see
http://czyborra.com/charsets/iso8859.html).

Rainer Garus



************************************************************************
The information in this e-mail together with any attachments is
intended only for the person or entity to which it is addressed
and may contain confidential and/or privileged material.

Any form of review, disclosure, modification, distribution
and/or publication of this e-mail message is prohibited.  

If you have received this message in error, you are asked to
inform the sender as quickly as possible and delete this message
and any copies of this message from your computer and/or your
computer system network.  
************************************************************************




************************************************************************
The information in this e-mail together with any attachments is
intended only for the person or entity to which it is addressed
and may contain confidential and/or privileged material.

Any form of review, disclosure, modification, distribution
and/or publication of this e-mail message is prohibited.  

If you have received this message in error, you are asked to
inform the sender as quickly as possible and delete this message
and any copies of this message from your computer and/or your
computer system network.  
************************************************************************