You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2004/11/15 01:24:48 UTC

[Bug 3111] RFE: rule to look for repeated URIs in body

http://bugzilla.spamassassin.org/show_bug.cgi?id=3111

felicity@kluge.net changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|RFE: rule to look for       |RFE: rule to look for
                   |repeated URIs in body       |repeated URIs in body



------- Additional Comments From felicity@kluge.net  2004-11-14 16:24 -------
I think the supposition is invalid.  I've seen plenty of legit newsletters which repeat the same URL 
multiple times.

There's also a number of issues:

- checking for the same exact URL is open to easy circumvention, either random hostnames or ?
id=%RNDCHAR, etc.
- limiting to the hostname part is open to the same issue and will FP like mad
- limiting to domain is just going to FP like mad

So I think we can test for same URL, but I think if it's useful now, it won't be after we add it as a rule.

It's be pretty easy in Util::uri_list_canonify() to count the number of duplicates, and/or come up with 
data about how many were listed 1 time, 2 times, 3 times, etc.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.