You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2010/03/31 16:52:43 UTC

[Bug 5859] SUBJ_ALL_CAPS got fired up on on non-latin subjects with some latin characters which are all capital

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=5859

--- Comment #2 from Mark Martinec <Ma...@ijs.si> 2010-03-31 14:52:42 UTC ---
Copied from Bug 6398 comment 2:

Luther:
When sending a mail with arabic subject (e.g.  بسم الله الرحمن الرحيم), a reply
or forward causes the SUBJ_ALL_CAPS pattern to match (e.g. "AW: بسم الله الرحمن
الرحيم" or "FWD: بسم الله الرحمن الرحيم").

This might also be the case for other languages (e.g. Hindi, Thai, etc.)


Mark:
Here is the attached header field sample:
  Subject: =?utf-8?Q?AW:_=D8=A7=D9=84=D9=84=D9=87_=D9=83=D8=A8=D8=B1?=

Should the CHARSETS_LIKELY_TO_FP_AS_CAPS in Constants.pm include 'utf-8'?

Shouldn't the "=hh" entities be exempt from QP encoded strings entirely?

Also, shouldn't the B-encoded (base64) MIME strings be exempt entirely
from this test?

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.