You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@cocoon.apache.org by Gernot Koller <gr...@gmx.at> on 2003/06/20 17:58:28 UTC

HTML Generator, JTidy, charencoding

Hi!

I'm reading a HTML page from various URLs and convert it to XHTML using 
JTidy (as in HTML Generator). I know that I can configure JTidy to use a 
certain encoding by calling setCharEncoding(Configuration.UTF8); for 
example.
My problem is, that the character encoding is very often specifyed only 
within the HTML document using tags like <meta http-equiv="content-type" 
content="text/html; charset=ISO-8859-1">.

Any tricks how to solve this problem ?

thx,

Gernot


-- 
DI Gernot Koller

---------------------------------------------------------------------
To unsubscribe, e-mail: cocoon-users-unsubscribe@xml.apache.org
For additional commands, e-mail: cocoon-users-help@xml.apache.org