You are viewing a plain text version of this content. The canonical link for it is here.
Posted to ruleqa@spamassassin.apache.org by "Kevin A. McGrail" <KM...@PCCC.com> on 2012/08/12 05:39:57 UTC
Rules published
Woohoo! We had 20 masscheckers and met the threshold!
HAM CONTRIBUTORS FOUND: 20 (required 10)
SPAM CONTRIBUTORS FOUND: 20 (required 10)
HAM: 156383 (150000 required)
SPAM: 341595 (150000 required)
Regards,
KAM
Re: Rules published
Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
On 8/13/2012 1:00 PM, Benny Pedersen wrote:
> Den 2012-08-13 17:28, Kevin A. McGrail skrev:
>
>> I'm sorry Benny but I don't understand what you mean with either of
>> these statements?
>
> you said that i am not a developper, now i sent you 2 patch files for
> spamassassin ?, unfair ?
I really don't know what you are talking about with this statement either.
You wrote: "who will run dos2unix on 72_active.cf now ? "
What does this mean? Do you see an issue with 72_active.cf? Is there a
CRLF issue you think is causing issues or annoyance?
You also wrote: "you said that i am not a developper, now i sent you 2
patch files for spamassassin ?, unfair ? "
Please add more context. I work with a lot of people and don't have a
perfect memory.
However, I have no memory of ever saying you were not a developer and
honestly haven't seen your patches so I don't know what you are talking
about. BUT as an open source project, we welcome people to submit
patches and contribute to the project. If any statement I made was
taken otherwise, I apologize. But I think perhaps you are confusing me
with someone else?
Regards,
KAM
Re: Rules published
Posted by Benny Pedersen <me...@junc.org>.
Den 2012-08-13 17:28, Kevin A. McGrail skrev:
> I'm sorry Benny but I don't understand what you mean with either of
> these statements?
you said that i am not a developper, now i sent you 2 patch files for
spamassassin ?, unfair ?
Re: Rules published
Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
On 8/12/2012 1:35 PM, Benny Pedersen wrote:
> Den 2012-08-12 05:39, Kevin A. McGrail skrev:
>> Woohoo! We had 20 masscheckers and met the threshold!
>
> who will run dos2unix on 72_active.cf now ?
>
> super to be an outsider :)
I'm sorry Benny but I don't understand what you mean with either of
these statements?
Regards,
KAM
Re: Rules published
Posted by Benny Pedersen <me...@junc.org>.
Den 2012-08-12 05:39, Kevin A. McGrail skrev:
> Woohoo! We had 20 masscheckers and met the threshold!
who will run dos2unix on 72_active.cf now ?
super to be an outsider :)
Re: Rules published
Posted by Jari Fredriksson <ja...@iki.fi>.
> Woohoo! We had 20 masscheckers and met the threshold!
>
> HAM CONTRIBUTORS FOUND: 20 (required 10)
> SPAM CONTRIBUTORS FOUND: 20 (required 10)
>
>
> HAM: 156383 (150000 required)
> SPAM: 341595 (150000 required)
>
>
>
> Regards,
> KAM
>
Excellent!
Re: Rules published
Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
Thanks. I will look at this. I'm on a fairly active witch-hunt for
bugs. I'm guessing it's going to have to be the configuration change
for message boundaries.
On 8/12/2012 10:38 AM, John Hardin wrote:
> On Sun, 12 Aug 2012, Kevin A. McGrail wrote:
>
>> On 8/12/2012 12:34 AM, John Hardin wrote:
>>> It's still vastly underreporting my corpora.
>>
>> Is this what it is reporting?
>>
>> ls -al *jhar* | grep Aug | grep -v \~ | awk '{print $9;}' | xargs wc -l
>> 241 ham-bb-jhardin.log
>> 7 ham-bb-jhardin_fraud.log
>> 243 ham-net-bb-jhardin.log
>> 7 ham-net-bb-jhardin_fraud.log
>> 104 spam-bb-jhardin.log
>> 23 spam-bb-jhardin_fraud.log
>> 99 spam-net-bb-jhardin.log
>> 28 spam-net-bb-jhardin_fraud.log
>> 752 total
>
> Close, but not exact, and the spam corpus counts in the "set 0, broken
> down by contributor" section differ from the counts in the "corpus
> quality" section.
>
> "set 0":
> ham-bb-jhardin: 235
> ham-bb-jhardin_fraud: 1
> ham-net-bb-jhardin: 237
> ham-net-bb-jhardin_fraud: 1
> spam-bb-jhardin: 65
> spam-bb-jhardin_fraud: 17
> spam-net-bb-jhardin: 63
> spam-net-bb-jhardin_fraud: 22
>
> "corpus quality":
> ham-bb-jhardin: 235
> ham-bb-jhardin_fraud: 1
> ham-net-bb-jhardin: 237
> ham-net-bb-jhardin_fraud: 1
> spam-bb-jhardin: 98
> spam-bb-jhardin_fraud: 17
> spam-net-bb-jhardin: 93
> spam-net-bb-jhardin_fraud: 22
>
> Here are the message counts from the master copies of my uploaded
> corpora mailboxes based on /^From\s/:
>
> fraud/corpus_ham_fraud.mbox: 25
> fraud/spam: 5628
> public/ham: 6092
> public/spam: 7197
>
>
--
*Kevin A. McGrail*
President
Peregrine Computer Consultants Corporation
3927 Old Lee Highway, Suite 102-C
Fairfax, VA 22030-2422
http://www.pccc.com/
703-359-9700 x50 / 800-823-8402 (Toll-Free)
703-359-8451 (fax)
KMcGrail@PCCC.com <ma...@pccc.com>
Re: Rules published
Posted by John Hardin <jh...@impsec.org>.
On Sun, 12 Aug 2012, Kevin A. McGrail wrote:
> On 8/12/2012 12:34 AM, John Hardin wrote:
>> It's still vastly underreporting my corpora.
>
> Is this what it is reporting?
>
> ls -al *jhar* | grep Aug | grep -v \~ | awk '{print $9;}' | xargs wc -l
> 241 ham-bb-jhardin.log
> 7 ham-bb-jhardin_fraud.log
> 243 ham-net-bb-jhardin.log
> 7 ham-net-bb-jhardin_fraud.log
> 104 spam-bb-jhardin.log
> 23 spam-bb-jhardin_fraud.log
> 99 spam-net-bb-jhardin.log
> 28 spam-net-bb-jhardin_fraud.log
> 752 total
Close, but not exact, and the spam corpus counts in the "set 0, broken
down by contributor" section differ from the counts in the "corpus
quality" section.
"set 0":
ham-bb-jhardin: 235
ham-bb-jhardin_fraud: 1
ham-net-bb-jhardin: 237
ham-net-bb-jhardin_fraud: 1
spam-bb-jhardin: 65
spam-bb-jhardin_fraud: 17
spam-net-bb-jhardin: 63
spam-net-bb-jhardin_fraud: 22
"corpus quality":
ham-bb-jhardin: 235
ham-bb-jhardin_fraud: 1
ham-net-bb-jhardin: 237
ham-net-bb-jhardin_fraud: 1
spam-bb-jhardin: 98
spam-bb-jhardin_fraud: 17
spam-net-bb-jhardin: 93
spam-net-bb-jhardin_fraud: 22
Here are the message counts from the master copies of my uploaded corpora
mailboxes based on /^From\s/:
fraud/corpus_ham_fraud.mbox: 25
fraud/spam: 5628
public/ham: 6092
public/spam: 7197
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhardin@impsec.org FALaholic #11174 pgpk -a jhardin@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
3 days until the 67th anniversary of the end of World War II
Re: Rules published
Posted by "Kevin A. McGrail" <KM...@PCCC.com>.
On 8/12/2012 12:34 AM, John Hardin wrote:
> It's still vastly underreporting my corpora.
Is this what it is reporting?
ls -al *jhar* | grep Aug | grep -v \~ | awk '{print $9;}' | xargs wc -l
241 ham-bb-jhardin.log
7 ham-bb-jhardin_fraud.log
243 ham-net-bb-jhardin.log
7 ham-net-bb-jhardin_fraud.log
104 spam-bb-jhardin.log
23 spam-bb-jhardin_fraud.log
99 spam-net-bb-jhardin.log
28 spam-net-bb-jhardin_fraud.log
752 total
Re: Rules published
Posted by Axb <ax...@gmail.com>.
On 08/12/2012 06:34 AM, John Hardin wrote:
> On Sat, 11 Aug 2012, Kevin A. McGrail wrote:
>
>> Woohoo! We had 20 masscheckers and met the threshold!
>>
>> HAM CONTRIBUTORS FOUND: 20 (required 10)
>> SPAM CONTRIBUTORS FOUND: 20 (required 10)
>>
>> HAM: 156383 (150000 required)
>> SPAM: 341595 (150000 required)
>
> It's still vastly underreporting my corpora.
>
> Jari, how do your counts look?
>
after +12hr of processing:
16M ham-net-axb-brasil.log
1.6M ham-net-axb-coi-bulk.log
16M ham-net-axb-fraud.log
16M ham-net-axb-generic.log
28M spam-net-axb-brasil.log
4.0K spam-net-axb-coi-bulk.log
310M spam-net-axb-fraud.log
263M spam-net-axb-generic.log
Re: Rules published
Posted by Jari Fredriksson <ja...@iki.fi>.
12.08.2012 07:34, John Hardin kirjoitti:
> Jari, how do your counts look?
Looks just what I would expect. Just great!
--
Avoid reality at all costs.
Re: Rules published
Posted by John Hardin <jh...@impsec.org>.
On Sat, 11 Aug 2012, Kevin A. McGrail wrote:
> Woohoo! We had 20 masscheckers and met the threshold!
>
> HAM CONTRIBUTORS FOUND: 20 (required 10)
> SPAM CONTRIBUTORS FOUND: 20 (required 10)
>
> HAM: 156383 (150000 required)
> SPAM: 341595 (150000 required)
It's still vastly underreporting my corpora.
Jari, how do your counts look?
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhardin@impsec.org FALaholic #11174 pgpk -a jhardin@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
No representation without taxation!
-----------------------------------------------------------------------
4 days until the 67th anniversary of the end of World War II