You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2010/03/31 16:27:16 UTC

[Bug 6398] New: SUBJ_ALL_CAPS matches arabic letters in subject when replied

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6398

           Summary: SUBJ_ALL_CAPS matches arabic letters in subject when
                    replied
           Product: Spamassassin
           Version: unspecified
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P5
         Component: Rules
        AssignedTo: dev@spamassassin.apache.org
        ReportedBy: luther.blissett@gmx.net


Created an attachment (id=4729)
 --> (https://issues.apache.org/SpamAssassin/attachment.cgi?id=4729)
Mailheader example

Hi

When sending a mail with arabic subject (e.g.  بسم الله الرحمن الرحيم), a reply
or forward causes the SUBJ_ALL_CAPS pattern to match (e.g. "AW: بسم الله الرحمن
الرحيم" or "FWD: بسم الله الرحمن الرحيم").

This might also be the case for other languages (e.g. Hindi, Thai, etc.)

Kindest regards,
Luther

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

[Bug 6398] SUBJ_ALL_CAPS matches arabic letters in subject when replied

Posted by bu...@bugzilla.spamassassin.org.
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6398

Mark Martinec <Ma...@ijs.si> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |DUPLICATE

--- Comment #3 from Mark Martinec <Ma...@ijs.si> 2010-03-31 14:53:28 UTC ---
> Looks like a duplicate of bug 5859...

Indeed. Marking as such.

*** This bug has been marked as a duplicate of bug 5859 ***

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

[Bug 6398] SUBJ_ALL_CAPS matches arabic letters in subject when replied

Posted by bu...@bugzilla.spamassassin.org.
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6398

John Wilcock <jo...@tradoc.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |john@tradoc.fr

--- Comment #1 from John Wilcock <jo...@tradoc.fr> 2010-03-31 14:41:23 UTC ---
Looks like a duplicate of bug 5859...

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

[Bug 6398] SUBJ_ALL_CAPS matches arabic letters in subject when replied

Posted by bu...@bugzilla.spamassassin.org.
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6398

Mark Martinec <Ma...@ijs.si> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
          Component|Rules                       |Libraries
           Platform|Other                       |All
            Version|unspecified                 |3.3.1
   Target Milestone|Undefined                   |3.3.2

--- Comment #2 from Mark Martinec <Ma...@ijs.si> 2010-03-31 14:49:25 UTC ---
> When sending a mail with arabic subject (e.g.  بسم الله الرحمن الرحيم), a reply
> or forward causes the SUBJ_ALL_CAPS pattern to match (e.g. "AW: بسم الله الرحمن
> الرحيم" or "FWD: بسم الله الرحمن الرحيم").
> This might also be the case for other languages (e.g. Hindi, Thai, etc.)

Here is the attached header field sample:

  Subject: =?utf-8?Q?AW:_=D8=A7=D9=84=D9=84=D9=87_=D9=83=D8=A8=D8=B1?=

Should the CHARSETS_LIKELY_TO_FP_AS_CAPS in Constants.pm include 'utf-8'?

Shouldn't the "=hh" entities be exempt from QP encoded strings entirely?

Also, shouldn't the B-encoded (base64) MIME strings be exempt entirely
from this test?

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.