You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2004/11/15 01:24:48 UTC
[Bug 3111] RFE: rule to look for repeated URIs in body
http://bugzilla.spamassassin.org/show_bug.cgi?id=3111
felicity@kluge.net changed:
What |Removed |Added
----------------------------------------------------------------------------
Summary|RFE: rule to look for |RFE: rule to look for
|repeated URIs in body |repeated URIs in body
------- Additional Comments From felicity@kluge.net 2004-11-14 16:24 -------
I think the supposition is invalid. I've seen plenty of legit newsletters which repeat the same URL
multiple times.
There's also a number of issues:
- checking for the same exact URL is open to easy circumvention, either random hostnames or ?
id=%RNDCHAR, etc.
- limiting to the hostname part is open to the same issue and will FP like mad
- limiting to domain is just going to FP like mad
So I think we can test for same URL, but I think if it's useful now, it won't be after we add it as a rule.
It's be pretty easy in Util::uri_list_canonify() to count the number of duplicates, and/or come up with
data about how many were listed 1 time, 2 times, 3 times, etc.
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.