You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Ronnie Tartar <rt...@symbiostech.com> on 2006/12/22 20:26:32 UTC

FuzzyOcr questions

I have a Qmail Toaster setup.  I have everything working except the fuzzyocr.  Should it have information in the header about being scanned?

Here is a header but I don't see the fuzzyocr plugin working

X-Spam-Status: Yes, score=10.5 required=1.0 tests=EXTRA_MPART_TYPE,
     HTML_IMAGE_ONLY_16,HTML_MESSAGE,HTML_SHORT_LINK_IMG_2,INVALID_TZ_GMT,
     MY_CID_AND_ARIAL2,MY_CID_AND_CLOSING,MY_CID_AND_STYLE,
     MY_CID_ARIAL2_CLOSING,MY_CID_ARIAL_STYLE,SARE_GIF_ATTACH,
     SARE_GIF_STOX autolearn=no version=3.1.7
X-Spam-Report: 
     * 1.1 INVALID_TZ_GMT Invalid date in header (wrong GMT/UTC timezone)
     * 0.8 EXTRA_MPART_TYPE Header has extraneous Content-type:...type= entry
     * 0.0 HTML_MESSAGE BODY: HTML included in message
     * 0.6 HTML_IMAGE_ONLY_16 BODY: HTML: images with 1200-1600 bytes of words
     * 0.8 SARE_GIF_ATTACH FULL: Email has a inline gif
     * 1.1 MY_CID_ARIAL_STYLE SARE cid arial2 style
     * 1.0 HTML_SHORT_LINK_IMG_2 HTML is very short with a linked image
     * 0.9 MY_CID_AND_CLOSING SARE cid and closing
     * 0.7 MY_CID_AND_STYLE SARE cid and style
     * 0.7 MY_CID_AND_ARIAL2 SARE CID and Arial2
     * 1.2 MY_CID_ARIAL2_CLOSING SARE cid arial2 closing
     * 1.7 SARE_GIF_STOX Inline Gif with little HTML

spamassassin -D --lint  shows fuzzyocr loading?

[2261] dbg: plugin: fixed relative path: /etc/mail/spamassassin/FuzzyOcr.pm
[2261] dbg: plugin: loading FuzzyOcr from /etc/mail/spamassassin/FuzzyOcr.pm
[2261] dbg: plugin: registered FuzzyOcr=HASH(0xb305908)
[2261] dbg: plugin: FuzzyOcr=HASH(0xb305908) implements 'parse_config'
[2261] dbg: FuzzyOcr: Option verbose = 1
[2261] dbg: FuzzyOcr: Option logfile = /etc/mail/spamassassin/FuzzyOcr.log
[2261] dbg: FuzzyOcr: Option global_wordlist = /etc/mail/spamassassin/FuzzyOcr.words
[2261] dbg: FuzzyOcr: Valid search path: /usr/local/bin
[2261] dbg: FuzzyOcr: Valid search path: /usr/bin
[2261] dbg: config: allowing user rules!
[2261] dbg: plugin: Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0xac9d804) implements 'finish_parsing_end'
[2261] dbg: plugin: FuzzyOcr=HASH(0xb305908) implements 'finish_parsing_end'
[2261] dbg: replacetags: replacing tags
[2261] dbg: replacetags: done replacing tags
[2261] dbg: FuzzyOcr: Using gifsicle => /usr/bin/gifsicle
[2261] dbg: FuzzyOcr: Cannot find executable for giffix
[2261] dbg: FuzzyOcr: Cannot find executable for giftext
[2261] dbg: FuzzyOcr: Cannot find executable for gifinter
[2261] dbg: FuzzyOcr: Cannot find executable for giftopnm
[2261] dbg: FuzzyOcr: Cannot find executable for jpegtopnm
[2261] dbg: FuzzyOcr: Cannot find executable for pngtopnm
[2261] dbg: FuzzyOcr: Cannot find executable for bmptopnm
[2261] dbg: FuzzyOcr: Cannot find executable for tifftopnm
[2261] dbg: FuzzyOcr: Cannot find executable for ppmhist
[2261] dbg: FuzzyOcr: Cannot find executable for pamfile
[2261] dbg: FuzzyOcr: Using gocr => /usr/local/bin/gocr
[2261] dbg: FuzzyOcr: Using ocrad => /usr/local/bin/ocrad
[2261] dbg: FuzzyOcr: Loaded <49> words from "/etc/mail/spamassassin/FuzzyOcr.words"
[2261] dbg: FuzzyOcr: Using scan: $gocr -i $pfile
[2261] dbg: FuzzyOcr: Using scan: $gocr -l 180 -d 2 -i $pfile


Any help would be greatly appreciated.

Thanks

Re: FuzzyOcr questions

Posted by decoder <de...@own-hero.net>.
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Ronnie Tartar wrote:
> I have a Qmail Toaster setup.  I have everything working except the
>  fuzzyocr.  Should it have information in the header about being
> scanned?
>
> Here is a header but I don't see the fuzzyocr plugin working
>
> *X-Spam-Status:* Yes, score=10.5 required=1.0
> tests=EXTRA_MPART_TYPE,
>
> HTML_IMAGE_ONLY_16,HTML_MESSAGE,HTML_SHORT_LINK_IMG_2,INVALID_TZ_GMT,
>  MY_CID_AND_ARIAL2,MY_CID_AND_CLOSING,MY_CID_AND_STYLE,
> MY_CID_ARIAL2_CLOSING,MY_CID_ARIAL_STYLE,SARE_GIF_ATTACH,
> SARE_GIF_STOX autolearn=no version=3.1.7 *X-Spam-Report:* * 1.1
> INVALID_TZ_GMT Invalid date in header (wrong GMT/UTC timezone) *
> 0.8 EXTRA_MPART_TYPE Header has extraneous Content-type:...type=
> entry * 0.0 HTML_MESSAGE BODY: HTML included in message * 0.6
> HTML_IMAGE_ONLY_16 BODY: HTML: images with 1200-1600 bytes of words
>  * 0.8 SARE_GIF_ATTACH FULL: Email has a inline gif * 1.1
> MY_CID_ARIAL_STYLE SARE cid arial2 style * 1.0
> HTML_SHORT_LINK_IMG_2 HTML is very short with a linked image * 0.9
> MY_CID_AND_CLOSING SARE cid and closing * 0.7 MY_CID_AND_STYLE SARE
> cid and style * 0.7 MY_CID_AND_ARIAL2 SARE CID and Arial2 * 1.2
> MY_CID_ARIAL2_CLOSING SARE cid arial2 closing * 1.7 SARE_GIF_STOX
> Inline Gif with little HTML spamassassin -D --lint  shows fuzzyocr
> loading?
>
> [2261] dbg: plugin: fixed relative path:
> /etc/mail/spamassassin/FuzzyOcr.pm [2261] dbg: plugin: loading
> FuzzyOcr from /etc/mail/spamassassin/FuzzyOcr.pm [2261] dbg:
> plugin: registered FuzzyOcr=HASH(0xb305908) [2261] dbg: plugin:
> FuzzyOcr=HASH(0xb305908) implements 'parse_config' [2261] dbg:
> FuzzyOcr: Option verbose = 1 [2261] dbg: FuzzyOcr: Option logfile =
>  /etc/mail/spamassassin/FuzzyOcr.log [2261] dbg: FuzzyOcr: Option
> global_wordlist = /etc/mail/spamassassin/FuzzyOcr.words [2261] dbg:
> FuzzyOcr: Valid search path: /usr/local/bin [2261] dbg: FuzzyOcr:
> Valid search path: /usr/bin [2261] dbg: config: allowing user
> rules! [2261] dbg: plugin:
> Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0xac9d804) implements
> 'finish_parsing_end' [2261] dbg: plugin: FuzzyOcr=HASH(0xb305908)
> implements 'finish_parsing_end' [2261] dbg: replacetags: replacing
> tags [2261] dbg: replacetags: done replacing tags [2261] dbg:
> FuzzyOcr: Using gifsicle => /usr/bin/gifsicle [2261] dbg: FuzzyOcr:
> Cannot find executable for giffix [2261] dbg: FuzzyOcr: Cannot find
> executable for giftext [2261] dbg: FuzzyOcr: Cannot find executable
> for gifinter [2261] dbg: FuzzyOcr: Cannot find executable for
> giftopnm [2261] dbg: FuzzyOcr: Cannot find executable for jpegtopnm
>  [2261] dbg: FuzzyOcr: Cannot find executable for pngtopnm [2261]
> dbg: FuzzyOcr: Cannot find executable for bmptopnm [2261] dbg:
> FuzzyOcr: Cannot find executable for tifftopnm [2261] dbg:
> FuzzyOcr: Cannot find executable for ppmhist [2261] dbg: FuzzyOcr:
> Cannot find executable for pamfile

Can't you read? You need to tell the plugin where those binaries are
located, if they are not in the standard locations. Did you even
satisfy the dependencies?


Chris

> [2261] dbg: FuzzyOcr: Using gocr => /usr/local/bin/gocr [2261] dbg:
> FuzzyOcr: Using ocrad => /usr/local/bin/ocrad [2261] dbg: FuzzyOcr:
> Loaded <49> words from "/etc/mail/spamassassin/FuzzyOcr.words"
> [2261] dbg: FuzzyOcr: Using scan: $gocr -i $pfile [2261] dbg:
> FuzzyOcr: Using scan: $gocr -l 180 -d 2 -i $pfile
>
> Any help would be greatly appreciated.
>
> Thanks

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFjDOzJQIKXnJyDxURAjAVAKCWT4V1yhFl4kyHoIzRCKHJQLnsQgCePc1A
gwbjOF8+3Se2F8wafm7iuJc=
=BZ1h
-----END PGP SIGNATURE-----


RE: FuzzyOcr questions

Posted by Gary V <mr...@hotmail.com>.
>I have a Qmail Toaster setup.  I have everything working except the 
>fuzzyocr.  Should it have information in the header about being scanned?
>
>Here is a header but I don't see the fuzzyocr plugin working
>
>X-Spam-Status: Yes, score=10.5 required=1.0 tests=EXTRA_MPART_TYPE,
>      HTML_IMAGE_ONLY_16,HTML_MESSAGE,HTML_SHORT_LINK_IMG_2,INVALID_TZ_GMT,
>      MY_CID_AND_ARIAL2,MY_CID_AND_CLOSING,MY_CID_AND_STYLE,
>      MY_CID_ARIAL2_CLOSING,MY_CID_ARIAL_STYLE,SARE_GIF_ATTACH,
>      SARE_GIF_STOX autolearn=no version=3.1.7
>X-Spam-Report:
>      * 1.1 INVALID_TZ_GMT Invalid date in header (wrong GMT/UTC timezone)
>      * 0.8 EXTRA_MPART_TYPE Header has extraneous Content-type:...type= 
>entry
>      * 0.0 HTML_MESSAGE BODY: HTML included in message
>      * 0.6 HTML_IMAGE_ONLY_16 BODY: HTML: images with 1200-1600 bytes of 
>words
>      * 0.8 SARE_GIF_ATTACH FULL: Email has a inline gif
>      * 1.1 MY_CID_ARIAL_STYLE SARE cid arial2 style
>      * 1.0 HTML_SHORT_LINK_IMG_2 HTML is very short with a linked image
>      * 0.9 MY_CID_AND_CLOSING SARE cid and closing
>      * 0.7 MY_CID_AND_STYLE SARE cid and style
>      * 0.7 MY_CID_AND_ARIAL2 SARE CID and Arial2
>      * 1.2 MY_CID_ARIAL2_CLOSING SARE cid arial2 closing
>      * 1.7 SARE_GIF_STOX Inline Gif with little HTML
>
>spamassassin -D --lint  shows fuzzyocr loading?
>
>[2261] dbg: plugin: fixed relative path: /etc/mail/spamassassin/FuzzyOcr.pm
>[2261] dbg: plugin: loading FuzzyOcr from 
>/etc/mail/spamassassin/FuzzyOcr.pm
>[2261] dbg: plugin: registered FuzzyOcr=HASH(0xb305908)
>[2261] dbg: plugin: FuzzyOcr=HASH(0xb305908) implements 'parse_config'
>[2261] dbg: FuzzyOcr: Option verbose = 1
>[2261] dbg: FuzzyOcr: Option logfile = /etc/mail/spamassassin/FuzzyOcr.log
>[2261] dbg: FuzzyOcr: Option global_wordlist = 
>/etc/mail/spamassassin/FuzzyOcr.words
>[2261] dbg: FuzzyOcr: Valid search path: /usr/local/bin
>[2261] dbg: FuzzyOcr: Valid search path: /usr/bin
>[2261] dbg: config: allowing user rules!
>[2261] dbg: plugin: Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0xac9d804) 
>implements 'finish_parsing_end'
>[2261] dbg: plugin: FuzzyOcr=HASH(0xb305908) implements 
>'finish_parsing_end'
>[2261] dbg: replacetags: replacing tags
>[2261] dbg: replacetags: done replacing tags
>[2261] dbg: FuzzyOcr: Using gifsicle => /usr/bin/gifsicle
>[2261] dbg: FuzzyOcr: Cannot find executable for giffix
>[2261] dbg: FuzzyOcr: Cannot find executable for giftext
>[2261] dbg: FuzzyOcr: Cannot find executable for gifinter
>[2261] dbg: FuzzyOcr: Cannot find executable for giftopnm
>[2261] dbg: FuzzyOcr: Cannot find executable for jpegtopnm
>[2261] dbg: FuzzyOcr: Cannot find executable for pngtopnm
>[2261] dbg: FuzzyOcr: Cannot find executable for bmptopnm
>[2261] dbg: FuzzyOcr: Cannot find executable for tifftopnm
>[2261] dbg: FuzzyOcr: Cannot find executable for ppmhist
>[2261] dbg: FuzzyOcr: Cannot find executable for pamfile
>[2261] dbg: FuzzyOcr: Using gocr => /usr/local/bin/gocr
>[2261] dbg: FuzzyOcr: Using ocrad => /usr/local/bin/ocrad
>[2261] dbg: FuzzyOcr: Loaded <49> words from 
>"/etc/mail/spamassassin/FuzzyOcr.words"
>[2261] dbg: FuzzyOcr: Using scan: $gocr -i $pfile
>[2261] dbg: FuzzyOcr: Using scan: $gocr -l 180 -d 2 -i $pfile
>
>
>Any help would be greatly appreciated.
>
>Thanks

FuzzyOcr is disabled when mail scores above focr_autodisable_score. Check 
this setting in FuzzyOcr.cf. Set to a high level for testing purposes, but 
put it back down afterwards to save processing time and energy.

Gary V

_________________________________________________________________
>From photos to predictions, The MSN Entertainment Guide to Golden Globes has 
it all. http://tv.msn.com/tv/globes2007/