You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@lenya.apache.org by qMax <qm...@mediasoft.ru> on 2005/01/31 11:02:19 UTC
using UTF-8 in titles, meta and file content
If I get right,
in order to properly use utf encoding in navigation titles and meta data,
one needs to change all serializers: "xhtml-iso-8859-1", "html-iso-8859-1", "text-iso-8859-1"
into "xhtml","html" and "text" at least in files
lenya/info.xmap
lenya/usecase.xmap
this also automagically makes all files to be stored in utf-8 as well,
and thus - manually viewable and editable w/out any black magic.
otherwise all national characters became totally unusable in titles and meta
(they got converted to &#dddd; and then stored as &dddd;),
and files are stored in latin-1 as well, holding all characters
now could someone point me:
what was the reason to put iso-8859-1 there ?
will my "fix" break some other aspects ?
and if so - where should i also to tune ?
--
qMax
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@lenya.apache.org
For additional commands, e-mail: user-help@lenya.apache.org
Re: using UTF-8 in titles, meta and file content
Posted by "Gregor J. Rothfuss" <gr...@apache.org>.
Rolf Kulemann wrote:
>>and also to change all serializers
>>in global-sitemap.xmap and sitemap.xmap
>>to have utf-8 encoding.
>>
>>i hope this will be fixed in future versions.
the only laggards in the 1.2 tree are lucene.xmap and admin.xmap.
everything else uses utf-8 already.
--
Gregor J. Rothfuss
COO, Wyona Content Management Solutions http://wyona.com
Apache Lenya http://lenya.apache.org
gregor.rothfuss@wyona.com gregor@apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org
Re: using UTF-8 in titles, meta and file content
Posted by Rolf Kulemann <ro...@apache.org>.
On Thu, 2005-02-10 at 10:10, qMax wrote:
> Monday, January 31, 2005, 4:02:19 PM, qmax@mediasoft.ru wrote:
>
> q> If I get right,
>
> q> in order to properly use utf encoding in navigation titles and meta data,
> q> one needs to change all serializers: "xhtml-iso-8859-1",
> q> "html-iso-8859-1", "text-iso-8859-1"
> q> into "xhtml","html" and "text" at least in files
> q> lenya/info.xmap
> q> lenya/usecase.xmap
> actually, everywhere in *.xmap files
In general you are right.
>
> and also to change all serializers
> in global-sitemap.xmap and sitemap.xmap
> to have utf-8 encoding.
>
> i hope this will be fixed in future versions.
I hope so too. We recently switched some parts to utf-8 in the 1.2.1
release. It was a bit experimental, since we had a lot of problems with
utf-8 in the past. However, I guess we made it except for some small
exceptions, like you encountered.
Would you mind to create a bugzilla entry and submit the patches?
Thanks a lot.
--
Rolf Kulemann
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org
Re: using UTF-8 in titles, meta and file content
Posted by qMax <qm...@mediasoft.ru>.
Monday, January 31, 2005, 4:02:19 PM, qmax@mediasoft.ru wrote:
q> If I get right,
q> in order to properly use utf encoding in navigation titles and meta data,
q> one needs to change all serializers: "xhtml-iso-8859-1",
q> "html-iso-8859-1", "text-iso-8859-1"
q> into "xhtml","html" and "text" at least in files
q> lenya/info.xmap
q> lenya/usecase.xmap
actually, everywhere in *.xmap files
and also to change all serializers
in global-sitemap.xmap and sitemap.xmap
to have utf-8 encoding.
i hope this will be fixed in future versions.
--
qMax
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@lenya.apache.org
For additional commands, e-mail: user-help@lenya.apache.org
Re: using UTF-8 in titles, meta and file content
Posted by "Gregor J. Rothfuss" <gr...@apache.org>.
qMax wrote:
> If I get right,
>
> in order to properly use utf encoding in navigation titles and meta data,
> one needs to change all serializers: "xhtml-iso-8859-1", "html-iso-8859-1", "text-iso-8859-1"
> into "xhtml","html" and "text" at least in files
> lenya/info.xmap
> lenya/usecase.xmap
>
> this also automagically makes all files to be stored in utf-8 as well,
> and thus - manually viewable and editable w/out any black magic.
correct
> otherwise all national characters became totally unusable in titles and meta
> (they got converted to &#dddd; and then stored as &dddd;),
> and files are stored in latin-1 as well, holding all characters
>
> now could someone point me:
> what was the reason to put iso-8859-1 there ?
ignorance, at the time.
> will my "fix" break some other aspects ?
make sure you don't mix encodings
> and if so - where should i also to tune ?
should be fine. actually, most pipelines in 1.2 already use utf-8. the
only pipelines i found that still use iso are lucene and one in admin.
--
Gregor J. Rothfuss
COO, Wyona Content Management Solutions http://wyona.com
Apache Lenya http://lenya.apache.org
gregor.rothfuss@wyona.com gregor@apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@lenya.apache.org
For additional commands, e-mail: user-help@lenya.apache.org