You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Justin Mason <jm...@jmason.org> on 2004/02/24 19:46:35 UTC

Re: Testing markup tags

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Aleksander Adamowski writes:
> John Hardin wrote:
> 
> >ITYM it fools recipes that expect the tag to all be on a single line,
> >which is my point. I don't think SA is actually parsing the HTML (beyond
> >the uri stuff).
> >  
> >
> That's a pity, using a standard HTML parsing module from CPAN would 
> offload the work involving handling of maliciously malformed HTML syntax 
> to that external module and its maintainer.

SpamAssassin does use a std HTML parsing module, HTML::Parser.   Plus
it does some work itself to emulate popular MUA behaviour.  Not sure
where John got the 'not actually parsing' idea from....

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFAO5wLQTcbUG5Y7woRAjRwAKDT9J56hj4T7RWCGMne4CSBSSphRACfYETa
3MKaKSqRlWFHc3r+Z3ZoU/Y=
=KlDO
-----END PGP SIGNATURE-----