You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@spamassassin.apache.org by Rich Wales <ri...@richw.org> on 2016/11/23 23:24:41 UTC

"Complex regular subexpression recursion limit exceeded" error from sa-learn

I'm running Postfix, Spamassassin, amavisd-new, and Dovecot on an Ubuntu
16.04 LTS server.

For some time now, I've been running my inbox and junk folder through
*sa-learn* every night, in order to educate my e-mail server about spam
messages that make it past Spamassassin but which I subsequently mark as
spam manually.

Lately, I've been seeing a fair number of messages from *sa-learn* like
the following:

/Complex regular subexpression recursion limit (32766) exceeded at
/usr/share/perl5/Mail/SpamAssassin/HTML.pm line 745./

I've managed to identify specific individual e-mails that generate this
diagnostic, but I've looked at them and can't see anything obviously
strange.

Any thoughts?
-- 
*Rich Wales*
richw@richw.org

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by Rich Wales <ri...@richw.org>.
On 11/23/16 21:13, Benny Pedersen wrote:

> and output to this ticket of "spamassassin --lint -D 2>&1 >/tmp.txt"
> so all installed plugins versions are known, in case its already fixed

Done.

Rich Wales
richw@richw.org

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by John Hardin <jh...@impsec.org>.
On Thu, 24 Nov 2016, Benny Pedersen wrote:

> Rich Wales skrev den 2016-11-24 06:01:
>> >  _OK, I can repro on trunk: . . .  It's that very long block of QP
>> >  blanks right at the end. . . .  Please open a bug and attach that
>> >  spample as a repro test case._
>>
>>  Done.  (https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7374)
>
> disable html postings on maillist still left to do ? :=)
>
> and output to this ticket of "spamassassin --lint -D 2>&1 >/tmp.txt"
>
> so all installed plugins versions are known, in case its already fixed

"I can repro on trunk" suggests it is not already fixed.

-- 
  John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
  jhardin@impsec.org    FALaholic #11174     pgpk -a jhardin@impsec.org
  key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
   A good high-school education is still essential, and
   college is where you go to get one.                -- MiddleAgedKen
-----------------------------------------------------------------------
  339 days since the first successful real return to launch site (SpaceX)

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by Benny Pedersen <me...@junc.eu>.
Rich Wales skrev den 2016-11-24 06:01:
>> _OK, I can repro on trunk: . . .  It's that very long block of QP
>> blanks right at the end. . . .  Please open a bug and attach that
>> spample as a repro test case._
> 
> Done.  (https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7374)

disable html postings on maillist still left to do ? :=)

and output to this ticket of "spamassassin --lint -D 2>&1 >/tmp.txt"

so all installed plugins versions are known, in case its already fixed

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by Rich Wales <ri...@richw.org>.
> /OK, I can repro on trunk: . . .  It's that very long block of QP
> blanks right at the end. . . .  Please open a bug and attach that
> spample as a repro test case./

Done.  (https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7374)
-- 
*Rich Wales*
richw@richw.org

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by John Hardin <jh...@impsec.org>.
On Wed, 23 Nov 2016, Rich Wales wrote:

>> /The RE at that line looks pretty firmly anchored... Can you gzip up a
>> sample that fails for you and send it to me?/
>
> Sure.  See the attachment.

OK, I can repro on trunk:

Nov 23 19:17:00.141 [18349] dbg: message: HTML::Parser utf8_mode on (assumed UTF-8 octets)
Nov 23 19:17:00.187 [18349] warn: Complex regular subexpression recursion limit (32766) exceeded at lib/Mail/SpamAssassin/HTML.pm line 745.
Nov 23 19:17:00.193 [18349] dbg: message: spaces (octets) in HTML: 952 out of 3954

It's that very long block of QP blanks right at the end. If you edit out 
all those =20s after the </td> it stops emitting that warning.

That would be a workaround for you to make sa-learn shut up about your 
corpus until the problem is fixed. Blanks don't affect Bayes (at least, 
not until we implement multi-word tokens) so it shouldn't affect what gets 
learned.

Please open a bug and attach that spample as a repro test case. I'm not 
too familiar with that bit of the code so I don't have a fast fix.


-- 
  John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
  jhardin@impsec.org    FALaholic #11174     pgpk -a jhardin@impsec.org
  key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  338 days since the first successful real return to launch site (SpaceX)

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by John Hardin <jh...@impsec.org>.
On Wed, 23 Nov 2016, Rich Wales wrote:

> I'm running Postfix, Spamassassin, amavisd-new, and Dovecot on an Ubuntu
> 16.04 LTS server.
>
> For some time now, I've been running my inbox and junk folder through
> *sa-learn* every night, in order to educate my e-mail server about spam
> messages that make it past Spamassassin but which I subsequently mark as
> spam manually.
>
> Lately, I've been seeing a fair number of messages from *sa-learn* like
> the following:
>
> /Complex regular subexpression recursion limit (32766) exceeded at
> /usr/share/perl5/Mail/SpamAssassin/HTML.pm line 745./
>
> I've managed to identify specific individual e-mails that generate this
> diagnostic, but I've looked at them and can't see anything obviously
> strange.
>
> Any thoughts?

The RE at that line looks pretty firmly anchored...

Can you gzip up a sample that fails for you and send it to me?


-- 
  John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
  jhardin@impsec.org    FALaholic #11174     pgpk -a jhardin@impsec.org
  key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
   You do not examine legislation in the light of the benefits it
   will convey if properly administered, but in the light of the
   wrongs it would do and the harms it would cause if improperly
   administered.                                  -- Lyndon B. Johnson
-----------------------------------------------------------------------
  338 days since the first successful real return to launch site (SpaceX)

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by Rich Wales <ri...@richw.org>.
> try disabling any local rules to see if it helps. . . .
> it's apparently about rules, not about mail content.

I temporarily removed my local.cf file, but I still got the warning.

Rich Wales
richw@richw.org

Re: "Complex regular subexpression recursion limit exceeded" error from sa-learn

Posted by Matus UHLAR - fantomas <uh...@fantomas.sk>.
On 23.11.16 15:24, Rich Wales wrote:
>I'm running Postfix, Spamassassin, amavisd-new, and Dovecot on an Ubuntu
>16.04 LTS server.
>
>For some time now, I've been running my inbox and junk folder through
>*sa-learn* every night, in order to educate my e-mail server about spam
>messages that make it past Spamassassin but which I subsequently mark as
>spam manually.
>
>Lately, I've been seeing a fair number of messages from *sa-learn* like
>the following:
>
>/Complex regular subexpression recursion limit (32766) exceeded at
>/usr/share/perl5/Mail/SpamAssassin/HTML.pm line 745./

try disabling any local rules to see if it helps.

>I've managed to identify specific individual e-mails that generate this
>diagnostic, but I've looked at them and can't see anything obviously
>strange.

it's apparently about rules, not about mail content.

-- 
Matus UHLAR - fantomas, uhlar@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
Depression is merely anger without enthusiasm.