You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2007/03/07 14:26:11 UTC

[Bug 5369] New: HTML_TINY_FONT too broad

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5369

           Summary: HTML_TINY_FONT too broad
           Product: Spamassassin
           Version: 3.1.8
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: normal
          Priority: P5
         Component: Rules
        AssignedTo: dev@spamassassin.apache.org
        ReportedBy: achowe@snert.com


20_html_tests.cf:rawbody HTML_TINY_FONT /\<.*font\-size\:[ \"]*[01][^0-9]+.*\>/i
20_html_tests.cf:describe HTML_TINY_FONT        body contains 1 or 0-point font
50_scores.cf:score HTML_TINY_FONT 2.607 1.425 3.393 2.324


I'm seeing lots of FP related to HTML_TINY_FONT (score 3.4) driving the score
up. Either the score is too high or the pattern to broad. For example 

<div align="center" style="font-size:0.8em;">

is perfectly visible print, but triggers HTML_TINY_FONT. 

I would suggest refining the pattern. One possible would be

/\<.*font\-size\:[ \"]*(?:1pt|0(?:\.[01][0-9]*)?[^.0-9]+).*\>/i

The above should match

0pt
1pt
0em
0.0em
0.0123em
0.1em

while exclude

2pt    (is it readable?)
3pt    (is it readable?)
0.2em  (is it readable?)
0.8em  (readable)



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 5369] HTML_TINY_FONT too broad

Posted by bu...@bugzilla.spamassassin.org.
http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5369


jm@jmason.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED
   Target Milestone|Undefined                   |3.2.0




------- Additional Comments From jm@jmason.org  2007-03-08 11:20 -------
fixed as of 3.2.0; HTML_TINY_FONT didn't meet quality thresholds and was thrown out.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 5369] HTML_TINY_FONT too broad

Posted by bu...@bugzilla.spamassassin.org.
http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5369


gfk@logidac.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |gfk@logidac.com




------- Additional Comments From gfk@logidac.com  2007-03-29 08:06 -------
*** Bug 5394 has been marked as a duplicate of this bug. ***



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.