You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@lenya.apache.org by qMax <qm...@mediasoft.ru> on 2005/01/31 11:02:19 UTC

using UTF-8 in titles, meta and file content

If I get right,

in order to properly use utf encoding in navigation titles and meta data,
one needs to change all serializers: "xhtml-iso-8859-1", "html-iso-8859-1", "text-iso-8859-1"
into "xhtml","html" and "text" at least in files
lenya/info.xmap
lenya/usecase.xmap

this also automagically makes all files to be stored in utf-8 as well,
and thus - manually viewable and editable w/out any black magic.

otherwise all national characters became totally unusable in titles and meta
(they got converted to &#dddd; and then stored as &amp;dddd;),
and files are stored in latin-1 as well, holding all characters

now could someone point me:
what was the reason to put iso-8859-1 there ?
will my "fix" break some other aspects ?
and if so - where should i also to tune ?

-- 
 qMax


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@lenya.apache.org
For additional commands, e-mail: user-help@lenya.apache.org


Re: using UTF-8 in titles, meta and file content

Posted by "Gregor J. Rothfuss" <gr...@apache.org>.
Rolf Kulemann wrote:

>>and also to change all serializers
>>in global-sitemap.xmap and sitemap.xmap
>>to have utf-8 encoding.
>>
>>i hope this will be fixed in future versions.

the only laggards in the 1.2 tree are lucene.xmap and admin.xmap. 
everything else uses utf-8 already.


-- 
Gregor J. Rothfuss
COO, Wyona       Content Management Solutions    http://wyona.com
Apache Lenya                              http://lenya.apache.org
gregor.rothfuss@wyona.com                       gregor@apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


Re: using UTF-8 in titles, meta and file content

Posted by Rolf Kulemann <ro...@apache.org>.
On Thu, 2005-02-10 at 10:10, qMax wrote:
> Monday, January 31, 2005, 4:02:19 PM, qmax@mediasoft.ru wrote:
> 
> q> If I get right,
> 
> q> in order to properly use utf encoding in navigation titles and meta data,
> q> one needs to change all serializers: "xhtml-iso-8859-1",
> q> "html-iso-8859-1", "text-iso-8859-1"
> q> into "xhtml","html" and "text" at least in files
> q> lenya/info.xmap
> q> lenya/usecase.xmap
> actually, everywhere in *.xmap files

In general you are right.

> 
> and also to change all serializers
> in global-sitemap.xmap and sitemap.xmap
> to have utf-8 encoding.
> 
> i hope this will be fixed in future versions.

I hope so too. We recently switched some parts to utf-8 in the 1.2.1
release. It was a bit experimental, since we had a lot of problems with
utf-8 in the past. However, I guess we made it except for some small
exceptions, like you encountered. 

Would you mind to create a bugzilla entry and submit the patches?

Thanks a lot.

-- 
Rolf Kulemann


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lenya.apache.org
For additional commands, e-mail: dev-help@lenya.apache.org


Re: using UTF-8 in titles, meta and file content

Posted by qMax <qm...@mediasoft.ru>.
Monday, January 31, 2005, 4:02:19 PM, qmax@mediasoft.ru wrote:

q> If I get right,

q> in order to properly use utf encoding in navigation titles and meta data,
q> one needs to change all serializers: "xhtml-iso-8859-1",
q> "html-iso-8859-1", "text-iso-8859-1"
q> into "xhtml","html" and "text" at least in files
q> lenya/info.xmap
q> lenya/usecase.xmap
actually, everywhere in *.xmap files

and also to change all serializers
in global-sitemap.xmap and sitemap.xmap
to have utf-8 encoding.

i hope this will be fixed in future versions.

-- 
 qMax


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@lenya.apache.org
For additional commands, e-mail: user-help@lenya.apache.org


Re: using UTF-8 in titles, meta and file content

Posted by "Gregor J. Rothfuss" <gr...@apache.org>.
qMax wrote:
> If I get right,
> 
> in order to properly use utf encoding in navigation titles and meta data,
> one needs to change all serializers: "xhtml-iso-8859-1", "html-iso-8859-1", "text-iso-8859-1"
> into "xhtml","html" and "text" at least in files
> lenya/info.xmap
> lenya/usecase.xmap
> 
> this also automagically makes all files to be stored in utf-8 as well,
> and thus - manually viewable and editable w/out any black magic.

correct

> otherwise all national characters became totally unusable in titles and meta
> (they got converted to &#dddd; and then stored as &amp;dddd;),
> and files are stored in latin-1 as well, holding all characters
> 
> now could someone point me:
> what was the reason to put iso-8859-1 there ?

ignorance, at the time.

> will my "fix" break some other aspects ?

make sure you don't mix encodings

> and if so - where should i also to tune ?

should be fine. actually, most pipelines in 1.2 already use utf-8. the 
only pipelines i found that still use iso are lucene and one in admin.

-- 
Gregor J. Rothfuss
COO, Wyona       Content Management Solutions    http://wyona.com
Apache Lenya                              http://lenya.apache.org
gregor.rothfuss@wyona.com                       gregor@apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@lenya.apache.org
For additional commands, e-mail: user-help@lenya.apache.org