You are viewing a plain text version of this content. The canonical link for it is here.
Posted to ruleqa@spamassassin.apache.org by "Kevin A. McGrail" <KM...@PCCC.com> on 2012/08/12 05:39:57 UTC

Rules published

Woohoo! We had 20 masscheckers and met the threshold!

  HAM CONTRIBUTORS FOUND: 20 (required 10)
SPAM CONTRIBUTORS FOUND: 20 (required 10)


  HAM: 156383 (150000 required)
SPAM: 341595 (150000 required)



Regards,
KAM



Re: Rules published

Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
On 8/13/2012 1:00 PM, Benny Pedersen wrote:
> Den 2012-08-13 17:28, Kevin A. McGrail skrev:
>
>> I'm sorry Benny but I don't understand what you mean with either of
>> these statements?
>
> you said that i am not a developper, now i sent you 2 patch files for 
> spamassassin ?, unfair ?

I really don't know what you are talking about with this statement either.

You wrote: "who will run dos2unix on 72_active.cf now ? "

What does this mean?  Do you see an issue with 72_active.cf?  Is there a 
CRLF issue you think is causing issues or annoyance?


You also wrote: "you said that i am not a developper, now i sent you 2 
patch files for spamassassin ?, unfair ? "

Please add more context.  I work with a lot of people and don't have a 
perfect memory.

However, I have no memory of ever saying you were not a developer and 
honestly haven't seen your patches so I don't know what you are talking 
about.  BUT as an open source project, we welcome people to submit 
patches and contribute to the project.  If any statement I made was 
taken otherwise, I apologize.  But I think perhaps you are confusing me 
with someone else?

Regards,
KAM





Re: Rules published

Posted by Benny Pedersen <me...@junc.org>.
Den 2012-08-13 17:28, Kevin A. McGrail skrev:

> I'm sorry Benny but I don't understand what you mean with either of
> these statements?

you said that i am not a developper, now i sent you 2 patch files for 
spamassassin ?, unfair ?




Re: Rules published

Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
On 8/12/2012 1:35 PM, Benny Pedersen wrote:
> Den 2012-08-12 05:39, Kevin A. McGrail skrev:
>> Woohoo! We had 20 masscheckers and met the threshold!
>
> who will run dos2unix on 72_active.cf now ?
>
> super to be an outsider :)

I'm sorry Benny but I don't understand what you mean with either of 
these statements?

Regards,
KAM

Re: Rules published

Posted by Benny Pedersen <me...@junc.org>.
Den 2012-08-12 05:39, Kevin A. McGrail skrev:
> Woohoo! We had 20 masscheckers and met the threshold!

who will run dos2unix on 72_active.cf now ?

super to be an outsider :)



Re: Rules published

Posted by Jari Fredriksson <ja...@iki.fi>.
> Woohoo! We had 20 masscheckers and met the threshold!
>
>   HAM CONTRIBUTORS FOUND: 20 (required 10)
> SPAM CONTRIBUTORS FOUND: 20 (required 10)
>
>
>   HAM: 156383 (150000 required)
> SPAM: 341595 (150000 required)
>
>
>
> Regards,
> KAM
>

Excellent!


Re: Rules published

Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
Thanks.  I will look at this.  I'm on a fairly active witch-hunt for 
bugs.  I'm guessing it's going to have to be the configuration change 
for message boundaries.

On 8/12/2012 10:38 AM, John Hardin wrote:
> On Sun, 12 Aug 2012, Kevin A. McGrail wrote:
>
>> On 8/12/2012 12:34 AM, John Hardin wrote:
>>>  It's still vastly underreporting my corpora. 
>>
>> Is this what it is reporting?
>>
>> ls -al *jhar* | grep Aug  | grep -v \~ | awk '{print $9;}' | xargs wc -l
>>      241 ham-bb-jhardin.log
>>        7 ham-bb-jhardin_fraud.log
>>      243 ham-net-bb-jhardin.log
>>        7 ham-net-bb-jhardin_fraud.log
>>      104 spam-bb-jhardin.log
>>       23 spam-bb-jhardin_fraud.log
>>       99 spam-net-bb-jhardin.log
>>       28 spam-net-bb-jhardin_fraud.log
>>      752 total
>
> Close, but not exact, and the spam corpus counts in the "set 0, broken 
> down by contributor" section differ from the counts in the "corpus 
> quality" section.
>
> "set 0":
>     ham-bb-jhardin: 235
>     ham-bb-jhardin_fraud: 1
>     ham-net-bb-jhardin: 237
>     ham-net-bb-jhardin_fraud: 1
>     spam-bb-jhardin: 65
>     spam-bb-jhardin_fraud: 17
>     spam-net-bb-jhardin: 63
>     spam-net-bb-jhardin_fraud: 22
>
> "corpus quality":
>     ham-bb-jhardin: 235
>     ham-bb-jhardin_fraud: 1
>     ham-net-bb-jhardin: 237
>     ham-net-bb-jhardin_fraud: 1
>     spam-bb-jhardin: 98
>     spam-bb-jhardin_fraud: 17
>     spam-net-bb-jhardin: 93
>     spam-net-bb-jhardin_fraud: 22
>
> Here are the message counts from the master copies of my uploaded 
> corpora mailboxes based on /^From\s/:
>
>     fraud/corpus_ham_fraud.mbox: 25
>     fraud/spam: 5628
>     public/ham: 6092
>     public/spam: 7197
>
>


-- 
*Kevin A. McGrail*
President

Peregrine Computer Consultants Corporation
3927 Old Lee Highway, Suite 102-C
Fairfax, VA 22030-2422

http://www.pccc.com/

703-359-9700 x50 / 800-823-8402 (Toll-Free)
703-359-8451 (fax)
KMcGrail@PCCC.com <ma...@pccc.com>


Re: Rules published

Posted by John Hardin <jh...@impsec.org>.
On Sun, 12 Aug 2012, Kevin A. McGrail wrote:

> On 8/12/2012 12:34 AM, John Hardin wrote:
>>  It's still vastly underreporting my corpora. 
>
> Is this what it is reporting?
>
> ls -al *jhar* | grep Aug  | grep -v \~ | awk '{print $9;}' | xargs wc -l
>      241 ham-bb-jhardin.log
>        7 ham-bb-jhardin_fraud.log
>      243 ham-net-bb-jhardin.log
>        7 ham-net-bb-jhardin_fraud.log
>      104 spam-bb-jhardin.log
>       23 spam-bb-jhardin_fraud.log
>       99 spam-net-bb-jhardin.log
>       28 spam-net-bb-jhardin_fraud.log
>      752 total

Close, but not exact, and the spam corpus counts in the "set 0, broken 
down by contributor" section differ from the counts in the "corpus 
quality" section.

"set 0":
 	ham-bb-jhardin: 235
 	ham-bb-jhardin_fraud: 1
 	ham-net-bb-jhardin: 237
 	ham-net-bb-jhardin_fraud: 1
 	spam-bb-jhardin: 65
 	spam-bb-jhardin_fraud: 17
 	spam-net-bb-jhardin: 63
 	spam-net-bb-jhardin_fraud: 22

"corpus quality":
 	ham-bb-jhardin: 235
 	ham-bb-jhardin_fraud: 1
 	ham-net-bb-jhardin: 237
 	ham-net-bb-jhardin_fraud: 1
 	spam-bb-jhardin: 98
 	spam-bb-jhardin_fraud: 17
 	spam-net-bb-jhardin: 93
 	spam-net-bb-jhardin_fraud: 22

Here are the message counts from the master copies of my uploaded corpora 
mailboxes based on /^From\s/:

 	fraud/corpus_ham_fraud.mbox: 25
 	fraud/spam: 5628
 	public/ham: 6092
 	public/spam: 7197


-- 
  John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
  jhardin@impsec.org    FALaholic #11174     pgpk -a jhardin@impsec.org
  key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  3 days until the 67th anniversary of the end of World War II

Re: Rules published

Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
On 8/12/2012 12:34 AM, John Hardin wrote:
> It's still vastly underreporting my corpora. 

Is this what it is reporting?

ls -al *jhar* | grep Aug  | grep -v \~ | awk '{print $9;}' | xargs wc -l
      241 ham-bb-jhardin.log
        7 ham-bb-jhardin_fraud.log
      243 ham-net-bb-jhardin.log
        7 ham-net-bb-jhardin_fraud.log
      104 spam-bb-jhardin.log
       23 spam-bb-jhardin_fraud.log
       99 spam-net-bb-jhardin.log
       28 spam-net-bb-jhardin_fraud.log
      752 total

Re: Rules published

Posted by Axb <ax...@gmail.com>.
On 08/12/2012 06:34 AM, John Hardin wrote:
> On Sat, 11 Aug 2012, Kevin A. McGrail wrote:
>
>> Woohoo! We had 20 masscheckers and met the threshold!
>>
>> HAM CONTRIBUTORS FOUND: 20 (required 10)
>> SPAM CONTRIBUTORS FOUND: 20 (required 10)
>>
>> HAM: 156383 (150000 required)
>> SPAM: 341595 (150000 required)
>
> It's still vastly underreporting my corpora.
>
> Jari, how do your counts look?
>

after +12hr of processing:

16M     ham-net-axb-brasil.log
1.6M    ham-net-axb-coi-bulk.log
16M     ham-net-axb-fraud.log
16M     ham-net-axb-generic.log
28M     spam-net-axb-brasil.log
4.0K    spam-net-axb-coi-bulk.log
310M    spam-net-axb-fraud.log
263M    spam-net-axb-generic.log



Re: Rules published

Posted by Jari Fredriksson <ja...@iki.fi>.
12.08.2012 07:34, John Hardin kirjoitti:
> Jari, how do your counts look?
Looks just what I would expect. Just great!

-- 

Avoid reality at all costs.



Re: Rules published

Posted by John Hardin <jh...@impsec.org>.
On Sat, 11 Aug 2012, Kevin A. McGrail wrote:

> Woohoo! We had 20 masscheckers and met the threshold!
>
> HAM CONTRIBUTORS FOUND: 20 (required 10)
> SPAM CONTRIBUTORS FOUND: 20 (required 10)
>
> HAM: 156383 (150000 required)
> SPAM: 341595 (150000 required)

It's still vastly underreporting my corpora.

Jari, how do your counts look?

-- 
  John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
  jhardin@impsec.org    FALaholic #11174     pgpk -a jhardin@impsec.org
  key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
   No representation without taxation!
-----------------------------------------------------------------------
  4 days until the 67th anniversary of the end of World War II