You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Justin Mason <jm...@jmason.org> on 2004/02/24 19:46:35 UTC
Re: Testing markup tags
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Aleksander Adamowski writes:
> John Hardin wrote:
>
> >ITYM it fools recipes that expect the tag to all be on a single line,
> >which is my point. I don't think SA is actually parsing the HTML (beyond
> >the uri stuff).
> >
> >
> That's a pity, using a standard HTML parsing module from CPAN would
> offload the work involving handling of maliciously malformed HTML syntax
> to that external module and its maintainer.
SpamAssassin does use a std HTML parsing module, HTML::Parser. Plus
it does some work itself to emulate popular MUA behaviour. Not sure
where John got the 'not actually parsing' idea from....
- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS
iD8DBQFAO5wLQTcbUG5Y7woRAjRwAKDT9J56hj4T7RWCGMne4CSBSSphRACfYETa
3MKaKSqRlWFHc3r+Z3ZoU/Y=
=KlDO
-----END PGP SIGNATURE-----