You are viewing a plain text version of this content. The canonical link for it is here.
Posted to infrastructure-issues@apache.org by "Greg Stein (JIRA)" <ji...@apache.org> on 2016/06/10 23:47:21 UTC

[jira] [Commented] (INFRA-11880) add charset spam filter

    [ https://issues.apache.org/jira/browse/INFRA-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15325521#comment-15325521 ] 

Greg Stein commented on INFRA-11880:
------------------------------------

Reinforcing the "major" priority (IMO). The bulk of spam to our ASF mailing lists, which lead to moderation are in non-latin character sets. Enabling the charset filter would drop moderation *easily* by 80%.


> add charset spam filter
> -----------------------
>
>                 Key: INFRA-11880
>                 URL: https://issues.apache.org/jira/browse/INFRA-11880
>             Project: Infrastructure
>          Issue Type: Planned Work
>          Components: Mailing Lists
>            Reporter: Greg Stein
>            Assignee: Chris Lambertus
>
> Much of the recent spam to the lists that I moderate use cyrillic or some middle eastern character set (hebrew? arabian?). We do not allow non-English on *most* of our mailing lists, so such messages should have a seriously high spam score. The same would apply to most/all non-latin charsets (eg. also chinese/japanese/etc).
> AOO may have lists where such is allowed, so it may be necessary to have an "escape hatch" for the filter on a per-list basis.
> In email, Chris mentioned that SpamAssassin appears to have some language-based filters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)