You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by "Peter H. Lemieux" <ph...@cyways.com> on 2006/10/16 15:47:26 UTC

FuzzyOCR (and gocr) can't detect HGH spams

I get a lot of messages with a gif ad for HGH drugs with this image: 
http://www.crystalmail.net/hgh.gif.  FuzzyOCR doesn't return anything 
because gocr doesn't show any text.  I've tried various -i settings for 
gocr from 1 to 254 and get gibberish at all settings.

For instance, 'gocr -i 180 hgh.gif' yields:

     lI__c_tc)r _rc_hc_rihc_Ll _cnLl .h1c_Llic_;cll_ _u__c_c __ihc LI
          l c htc)hlc_rc)c_c_ B llr_ll l hc r_cp_


     _ t4____ __cc_'un ic) __'ri_c _ hH3s, t_k   _ ,r o_E,y _h K E,_
     _ ,_ics r _ sncu)._r. t.ihk). lhirkrr x_))  '   gg __, r
     _ Krvc)_H t)r r_irk cct .__             _
                          O _' Y O ___ TE_ E
      _Lncl nLnn __ mc)R hnrtb

Results at other -i settings are about the same.

System is CentOS 4.3
gocr is at version 0.37 (from rpmforge)
netpbm is version 10.25

Any hints?

Peter