You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@cocoon.apache.org by Jesse Reynolds <li...@va.com.au> on 2001/08/28 07:08:57 UTC
MacRoman text encoding and C2
Hello
I've been using the following xml declaration in Cocoon 1 with
excellent results:
<?xml version="1.0" encoding="MacRoman"?>
(becuase our live XML data is coming from a database on a Mac and
there are some non-ascii characters we're sending out that need to be
recognised appropriately by cocoon)
Anyway, Cocoon 2 doesn't seem to like this at all... I get the following error:
org.apache.cocoon.ProcessingException: Failed to execute
pipeline.:java.lang.RuntimeException: Problem in getTransformer:The
encoding "MacRoman" is not supported.
So, is the MacRoman text encoding just completely not supported by
Cocoon 2? or is there something I can do to make this work?
Thanks
-jesse
--
Jesse Reynolds - Virtual Artists Pty Ltd - http://www.va.com.au
jesse (at) va.com.au
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by Gianugo Rabellino <g....@bibop.it>.
On Tue, Aug 28, 2001 at 05:19:24PM +0200, Fulvio Picecchi wrote:
> <?php
> $XMLdoc_Latin1 = "Some characters: òàùèéì";
> $XMLdoc_UTF-8 = iconv("ISO-8859-1", "UTF-8", $XMLdoc_Latin1);
> $XMLdoc_MacRoman = iconv("ISO-8859-1", "MacRoman", $XMLdoc_Latin1);
> ?>
> Docs here: http://www.php.net/manual/en/ref.iconv.php
> I can try to find some solutions in C2 but I'm really a newbie.
> I'm really sorry Jesse to not be able to help you. And for speaking
> about PHP in a wrong place ...
Might not be that wrong. While I do think that there might be a
much better solution, there should be a sort of quick and dirty
way to use PHP right inside Cocoon. Look at the PHP generator
(it does not get compiled by default, it needs the PHP libraries).
Hope it helps,
--
Gianugo Rabellino
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by Fulvio Picecchi <fu...@prm.it>.
Well, I'm really sorry but I can't really help you 'cause I'm absolutely
new to Cocoon. I'm just learning, for the moment. I come from PHP
development and this operation in PHP is done in an extremely fast way
with the iconv() function:
<?php
$XMLdoc_Latin1 = "Some characters: òàùèéì";
$XMLdoc_UTF-8 = iconv("ISO-8859-1", "UTF-8", $XMLdoc_Latin1);
$XMLdoc_MacRoman = iconv("ISO-8859-1", "MacRoman", $XMLdoc_Latin1);
?>
Docs here: http://www.php.net/manual/en/ref.iconv.php
I can try to find some solutions in C2 but I'm really a newbie.
I'm really sorry Jesse to not be able to help you. And for speaking
about PHP in a wrong place ...
Sincerely,
--
Fulvio Picecchi
Jesse Reynolds wrote:
> Thanks Fulvio
>
> Looks like the JDK's native2ascii is what I want, only i'd need to make
> my own Generator out of it or something in order to perform the
> translation live, yeah?
>
> And that scares me!
>
> Do you think a Generator is the right way to go here? (That basically
> does the same as the xml file generator, or the serverpages generator,
> but performs the translation to utf-8 beforehand)
>
> Cheers
>
> Jesse
>
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by "Piroumian, Konstantin" <KP...@flagship.ru>.
Maybe another solution will be to post a message to Xerces or Xalan Dev mail
list with your problem? If C1 supports that encoding then I think that
Xerces/Xalan does the job and there must be either a reason that support was
removed or they simply didn't add it at all until somebody will ask them
about it.
Also, as it's Open source, you can look at other encoder classes in Xalan,
write your own and send it to xalan-dev.
Regards,
Konstantin Piroumian.
----- Original Message -----
From: "Jesse Reynolds" <li...@va.com.au>
To: <co...@xml.apache.org>
Sent: Tuesday, August 28, 2001 5:37 PM
Subject: Re: MacRoman text encoding and C2
> Thanks Fulvio
>
> Looks like the JDK's native2ascii is what I want, only i'd need to
> make my own Generator out of it or something in order to perform the
> translation live, yeah?
>
> And that scares me!
>
> Do you think a Generator is the right way to go here? (That basically
> does the same as the xml file generator, or the serverpages
> generator, but performs the translation to utf-8 beforehand)
>
> Cheers
>
> Jesse
>
> At 3:01 PM +0200 28/8/01, Fulvio Picecchi wrote:
> >Sorry Jesse, I didn't know that C1 supports platform dependent
> >charset encodings ...
> >Anyway, if you'll ever need to do this conversion maybe this tools
> >can help you:
> >- Francois Pinard has written "recode" for Linux/Unix:
> >http://www.iro.umontreal.ca/contrib/recode/
> >- But I think you already have what you need. Look inside the bin/
> >directory in your JDK, you'll find a tool called native2ascii that
> >you can use this way:
> >native2ascii -encoding MacRoman example.xml example.jtx
> >native2ascii -reverse -encoding UTF-8 example.jtx example.xml
> >The supported encodings:
> >http://java.sun.com/j2se/1.3.0/docs/guide/intl/encoding.doc.html
> >And a little help for this tool:
> >http://java.sun.com/j2se/1.3.0/docs/tooldocs/win32/native2ascii.html
> >
> >Hope this can help you. Sincerely.
> >--
> >Fulvio Picecchi
> >
> >
> >
> >Jesse Reynolds wrote:
> >>Thanks Fulvio for your kind thoughts.
> >>
> >>I'm fully aware that it would be much better if our database could
> >>send out UTF-8, but that's alot of work that I'd rather not do.
> >>Cocoon1 has been doing a perfect job of understanding MacRoman to
> >>date, it would seem a great pity to have lost this functionality.
> >>
> >>There must be a way of doing it in a pipeline somehow. Hopefully
> >>someone has already written the code.
> >>
> >>Cheers
> >>
> >>Jesse
> >...
> >
> >
> >---------------------------------------------------------------------
> >Please check that your question has not already been answered in the
> >FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
> >
> >To unsubscribe, e-mail: <co...@xml.apache.org>
> >For additional commands, e-mail: <co...@xml.apache.org>
>
>
> --
>
> Jesse Reynolds - Virtual Artists Pty Ltd - http://www.va.com.au
> jesse (at) va.com.au
>
> ---------------------------------------------------------------------
> Please check that your question has not already been answered in the
> FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
>
> To unsubscribe, e-mail: <co...@xml.apache.org>
> For additional commands, e-mail: <co...@xml.apache.org>
>
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by Jesse Reynolds <li...@va.com.au>.
Thanks Fulvio
Looks like the JDK's native2ascii is what I want, only i'd need to
make my own Generator out of it or something in order to perform the
translation live, yeah?
And that scares me!
Do you think a Generator is the right way to go here? (That basically
does the same as the xml file generator, or the serverpages
generator, but performs the translation to utf-8 beforehand)
Cheers
Jesse
At 3:01 PM +0200 28/8/01, Fulvio Picecchi wrote:
>Sorry Jesse, I didn't know that C1 supports platform dependent
>charset encodings ...
>Anyway, if you'll ever need to do this conversion maybe this tools
>can help you:
>- Francois Pinard has written "recode" for Linux/Unix:
>http://www.iro.umontreal.ca/contrib/recode/
>- But I think you already have what you need. Look inside the bin/
>directory in your JDK, you'll find a tool called native2ascii that
>you can use this way:
>native2ascii -encoding MacRoman example.xml example.jtx
>native2ascii -reverse -encoding UTF-8 example.jtx example.xml
>The supported encodings:
>http://java.sun.com/j2se/1.3.0/docs/guide/intl/encoding.doc.html
>And a little help for this tool:
>http://java.sun.com/j2se/1.3.0/docs/tooldocs/win32/native2ascii.html
>
>Hope this can help you. Sincerely.
>--
>Fulvio Picecchi
>
>
>
>Jesse Reynolds wrote:
>>Thanks Fulvio for your kind thoughts.
>>
>>I'm fully aware that it would be much better if our database could
>>send out UTF-8, but that's alot of work that I'd rather not do.
>>Cocoon1 has been doing a perfect job of understanding MacRoman to
>>date, it would seem a great pity to have lost this functionality.
>>
>>There must be a way of doing it in a pipeline somehow. Hopefully
>>someone has already written the code.
>>
>>Cheers
>>
>>Jesse
>...
>
>
>---------------------------------------------------------------------
>Please check that your question has not already been answered in the
>FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
>
>To unsubscribe, e-mail: <co...@xml.apache.org>
>For additional commands, e-mail: <co...@xml.apache.org>
--
Jesse Reynolds - Virtual Artists Pty Ltd - http://www.va.com.au
jesse (at) va.com.au
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by Fulvio Picecchi <fu...@prm.it>.
Sorry Jesse, I didn't know that C1 supports platform dependent charset
encodings ...
Anyway, if you'll ever need to do this conversion maybe this tools can
help you:
- Francois Pinard has written "recode" for Linux/Unix:
http://www.iro.umontreal.ca/contrib/recode/
- But I think you already have what you need. Look inside the bin/
directory in your JDK, you'll find a tool called native2ascii that you
can use this way:
native2ascii -encoding MacRoman example.xml example.jtx
native2ascii -reverse -encoding UTF-8 example.jtx example.xml
The supported encodings:
http://java.sun.com/j2se/1.3.0/docs/guide/intl/encoding.doc.html
And a little help for this tool:
http://java.sun.com/j2se/1.3.0/docs/tooldocs/win32/native2ascii.html
Hope this can help you. Sincerely.
--
Fulvio Picecchi
Jesse Reynolds wrote:
> Thanks Fulvio for your kind thoughts.
>
> I'm fully aware that it would be much better if our database could send
> out UTF-8, but that's alot of work that I'd rather not do. Cocoon1 has
> been doing a perfect job of understanding MacRoman to date, it would
> seem a great pity to have lost this functionality.
>
> There must be a way of doing it in a pipeline somehow. Hopefully someone
> has already written the code.
>
> Cheers
>
> Jesse
...
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by Jesse Reynolds <li...@va.com.au>.
Thanks Fulvio for your kind thoughts.
I'm fully aware that it would be much better if our database could
send out UTF-8, but that's alot of work that I'd rather not do.
Cocoon1 has been doing a perfect job of understanding MacRoman to
date, it would seem a great pity to have lost this functionality.
There must be a way of doing it in a pipeline somehow. Hopefully
someone has already written the code.
Cheers
Jesse
At 12:12 PM +0200 28/8/01, Fulvio Picecchi wrote:
>Don't really know if Cocoon2 supports MacRoman, but when composing
>XML docs it would be always better to _not_ use a platform-dependent
>encoding like MacRoman (Mac) or Cp1252 (Windows).
>
>If you try writing something like this in MacRoman <<Je l'ai cherché
>de tous bords, tous côtés>> a PC user would see <<Je lÕai cherchZ
>de tous bords, tous ctZs>>.
>
>In general the best way would be using UTF8 (if you have a text
>editor that supports it), the same encoding of this e-mail message,
>that contains the characters of all languages of the world or a
>Latin dialect which are ISO standards: Latin-1 (ISO 8859-1), etc.
>Sincerely.
>
>--
>Fulvio Picecchi
>
>
>Jesse Reynolds wrote:
>>Hello
>>
>>I've been using the following xml declaration in Cocoon 1 with
>>excellent results:
>>
>><?xml version="1.0" encoding="MacRoman"?>
>>
>>(becuase our live XML data is coming from a database on a Mac and
>>there are some non-ascii characters we're sending out that need to
>>be recognised appropriately by cocoon)
>>
>>Anyway, Cocoon 2 doesn't seem to like this at all... I get the
>>following error:
>>
>>org.apache.cocoon.ProcessingException: Failed to execute
>>pipeline.:java.lang.RuntimeException: Problem in getTransformer:The
>>encoding "MacRoman" is not supported.
>>
>>
>>So, is the MacRoman text encoding just completely not supported by
>>Cocoon 2? or is there something I can do to make this work?
>>
>>Thanks
>>
>>-jesse
>>
>
>
>--
>Fulvio Picecchi
>
>
>---------------------------------------------------------------------
>Please check that your question has not already been answered in the
>FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
>
>To unsubscribe, e-mail: <co...@xml.apache.org>
>For additional commands, e-mail: <co...@xml.apache.org>
--
Jesse Reynolds - Virtual Artists Pty Ltd - http://www.va.com.au
jesse (at) va.com.au
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>
Re: MacRoman text encoding and C2
Posted by Fulvio Picecchi <fu...@prm.it>.
Don't really know if Cocoon2 supports MacRoman, but when composing XML
docs it would be always better to _not_ use a platform-dependent
encoding like MacRoman (Mac) or Cp1252 (Windows).
If you try writing something like this in MacRoman <<Je l'ai cherché de
tous bords, tous côtés>> a PC user would see <<Je lÕai cherchŽ de tous
bords, tous c™tŽs>>.
In general the best way would be using UTF8 (if you have a text editor
that supports it), the same encoding of this e-mail message, that
contains the characters of all languages of the world or a Latin dialect
which are ISO standards: Latin-1 (ISO 8859-1), etc.
Sincerely.
--
Fulvio Picecchi
Jesse Reynolds wrote:
> Hello
>
> I've been using the following xml declaration in Cocoon 1 with excellent
> results:
>
> <?xml version="1.0" encoding="MacRoman"?>
>
> (becuase our live XML data is coming from a database on a Mac and there
> are some non-ascii characters we're sending out that need to be
> recognised appropriately by cocoon)
>
> Anyway, Cocoon 2 doesn't seem to like this at all... I get the following
> error:
>
> org.apache.cocoon.ProcessingException: Failed to execute
> pipeline.:java.lang.RuntimeException: Problem in getTransformer:The
> encoding "MacRoman" is not supported.
>
>
> So, is the MacRoman text encoding just completely not supported by
> Cocoon 2? or is there something I can do to make this work?
>
> Thanks
>
> -jesse
>
>
--
Fulvio Picecchi
---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>
To unsubscribe, e-mail: <co...@xml.apache.org>
For additional commands, e-mail: <co...@xml.apache.org>