You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Robert Nicholson <ro...@elastica.com> on 2007/01/28 02:28:29 UTC

FuzzyOCR not hitting on this at all Fwd: LOG! nuclear defendant

Fuzzy OCR isn't getting any hits on this mail. Anybody know why?

Begin forwarded message:

> From: "ernest.buttiens" <er...@pandora.be>
> Date: January 27, 2007 5:36:40 PM CST
> To: <er...@rp2.tralix.com>
> Cc: <er...@rp2.tralix.com>,  
> <er...@rp2.tralix.com>,  
> <er...@rp2.tralix.com>
> Subject: LOG! nuclear defendant
> Received: from rrcs-24-105-152-45.nyc.biz.rr.com ([24.105.152.45])  
> by kermit.lizardhill.com with esmtp (Exim 4.62) (envelope-from  
> <er...@fedex.com>) id 1HAx9O-000AkY-7M for  
> errorrobert@elastica.com; Sat, 27 Jan 2007 15:40:03 -0800
> Received: from ALPG ([10.90.166.159]) by  
> rrcs-24-105-152-45.nyc.biz.rr.com (8.13.4/8.13.4) with SMTP id  
> g4121914545813e5Br010965 for  
> <er...@rp2.tralix.com>; Sat,  
> 27 Jan 2007 18:42:24 -0500 (CDT) (envelope-from  
> ernest.buttiens@pandora.be)
> Message-Id: <00...@ALPG>
> X-Priority: 3
> X-Msmail-Priority: Normal
> X-Mailer: Microsoft Outlook Express 6.00.2900.3028
> X-Mimeole: Produced By Microsoft MimeOLE V6.00.2900.3028MIME- 
> Version: 1.0
> Content-Type: multipart/mixed; boundary="----------=_45BBE2DA. 
> 1C011582"
> X-Log: Yes
> Lines: 428
>
>     SPAM ignoring because of BAYES_99
>
> From: "ernest.buttiens" <er...@pandora.be>
> Date: January 27, 2007 5:36:40 PM CST
> To: <er...@rp2.tralix.com>
> Cc: <er...@rp2.tralix.com>,  
> <er...@rp2.tralix.com>,  
> <er...@rp2.tralix.com>
> Subject: nuclear defendant
>
>
> hngrbmx ijuwt ntikq gdjln 
> oysc fhwcntd ufyjyit 

Re: FuzzyOCR not hitting on this at all Fwd: LOG! nuclear defendant

Posted by Robert Nicholson <ro...@gmail.com>.
I do notice some faint lines thru the images and I'm wondering if  
this is enough to confuse gocr

Looks like those lines are there to deliberately throw of ocr scanning.

Fuzzy OCR log just shows

[2007-01-27 19:30:42] Debug mode: Starting FuzzyOcr...
[2007-01-27 19:30:42] Debug mode: Attempting to load personal  
wordlist...
[2007-01-27 19:30:42] Debug mode: No personal wordlist found,  
skipping...
[2007-01-27 19:30:42] Debug mode: Analyzing file with content-type  
"image/jpeg"
[2007-01-27 19:30:42] Debug mode: Recognized file type: 2
[2007-01-27 19:30:42] Debug mode: Calculating the image hash...
[2007-01-27 19:30:43] Debug mode: Hash not yet known to the database,  
saving for later db storage...
[2007-01-27 19:30:43] Debug mode: FuzzyOcr ending successfully...

On Jan 27, 2007, at 7:28 PM, Robert Nicholson wrote:

> Fuzzy OCR isn't getting any hits on this mail. Anybody know why?
>
> Begin forwarded message:
>
>> From: "ernest.buttiens" <er...@pandora.be>
>> Date: January 27, 2007 5:36:40 PM CST
>> To: <er...@rp2.tralix.com>
>> Cc:  
>> <er...@rp2.tralix.com>,  
>> <er...@rp2.tralix.com>,  
>> <er...@rp2.tralix.com>
>> Subject: LOG! nuclear defendant
>> Received: from rrcs-24-105-152-45.nyc.biz.rr.com ([24.105.152.45])  
>> by kermit.lizardhill.com with esmtp (Exim 4.62) (envelope-from  
>> <er...@fedex.com>) id 1HAx9O-000AkY-7M for  
>> errorrobert@elastica.com; Sat, 27 Jan 2007 15:40:03 -0800
>> Received: from ALPG ([10.90.166.159]) by  
>> rrcs-24-105-152-45.nyc.biz.rr.com (8.13.4/8.13.4) with SMTP id  
>> g4121914545813e5Br010965 for  
>> <er...@rp2.tralix.com>; Sat,  
>> 27 Jan 2007 18:42:24 -0500 (CDT) (envelope-from  
>> ernest.buttiens@pandora.be)
>> Message-Id: <00...@ALPG>
>> X-Priority: 3
>> X-Msmail-Priority: Normal
>> X-Mailer: Microsoft Outlook Express 6.00.2900.3028
>> X-Mimeole: Produced By Microsoft MimeOLE V6.00.2900.3028MIME- 
>> Version: 1.0
>> Content-Type: multipart/mixed; boundary="----------=_45BBE2DA. 
>> 1C011582"
>> X-Log: Yes
>> Lines: 428
>>
>>     SPAM ignoring because of BAYES_99
>>
>> From: "ernest.buttiens" <er...@pandora.be>
>> Date: January 27, 2007 5:36:40 PM CST
>> To: <er...@rp2.tralix.com>
>> Cc:  
>> <er...@rp2.tralix.com>,  
>> <er...@rp2.tralix.com>,  
>> <er...@rp2.tralix.com>
>> Subject: nuclear defendant
>>
>>
>> hngrbmx ijuwt ntikq gdjln
> <cheese wheel.jpg>
>>
>> oysc fhwcntd ufyjyit
> <cheese wheel.jpg>
>


Re: FuzzyOCR not hitting on this at all Fwd: LOG! nuclear defendant

Posted by Robert Nicholson <ro...@gmail.com>.
Well I cannot see why for myself can I? I mean in this case it's  
simply not matching any of the words when they are clearly visible.

On Jan 27, 2007, at 8:16 PM, René Berber wrote:

> Robert Nicholson wrote:
>
>> Fuzzy OCR isn't getting any hits on this mail. Anybody know why?
> [snip]
>
> You can see for yourself, use `spamassassin -x -t -D FuzzyOcr <  
> sample.eml`.
> -- 
> René Berber
>


Re: FuzzyOCR not hitting on this at all Fwd: LOG! nuclear defendant

Posted by René Berber <r....@computer.org>.
Robert Nicholson wrote:

> Fuzzy OCR isn't getting any hits on this mail. Anybody know why?
[snip]

You can see for yourself, use `spamassassin -x -t -D FuzzyOcr < sample.eml`.
-- 
René Berber