You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2011/05/09 08:57:16 UTC

[Bug 6584] New: RTF parser for rendered body

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6584

             Bug #: 6584
           Summary: RTF parser for rendered body
           Product: Spamassassin
           Version: SVN Trunk (Latest Devel Version)
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: P2
         Component: Libraries
        AssignedTo: dev@spamassassin.apache.org
        ReportedBy: hege@hege.li
    Classification: Unclassified


While doing bug 6582, I found many messages with text/rtf text/richtext
attachments. These are (were) popular also in spams.

Such RTF usually include _lots_ of formatting and could have large images or
such embedded. It serves no purpose to scan that redundant data in rendered
body array (we can use rawbody if desired).

Basic tag stripping could be done even with few regexes, there are lots of
examples and CPAN modules around.

-- 
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.