You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@bugzilla.spamassassin.org on 2004/12/17 16:22:49 UTC

[Bug 4040] New: 30_text_de.cf brings up error message

http://bugzilla.spamassassin.org/show_bug.cgi?id=4040

           Summary: 30_text_de.cf brings up error message
           Product: Spamassassin
           Version: 3.0.1
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P4
         Component: Rules
        AssignedTo: dev@spamassassin.apache.org
        ReportedBy: floeff@arcor.de


The 30_text_de.cf rule brings up an error message with -D --lint:

warning: description for PORN_URL_SEX is over 50 chars
(and so on for a lot of other translations)



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040





------- Additional Comments From jm@jmason.org  2004-12-17 18:39 -------
Personally, I would be very happy to see the length restrictions on descriptions
and rule names disappear.  IMO, they were a mistake:

1. they have resulted in less readable names and descriptions, even in english.
2. lint errors from this is one of the top 5 FAQs.
3. we have a hard enough time getting descriptions for other languages, without
this limitation making it harder for translators.

How hard would it be to fix the report code to just deal with it?



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040





------- Additional Comments From klaus.heinz@onlinehome.de  2004-12-17 18:18 -------
Subject: Re:  30_text_de.cf brings up error message

bugzilla-daemon@bugzilla.spamassassin.org wrote:

> Yeah, we need some translators I guess to get the descriptions down below
> the limit.

I do not agree with this conclusion. As I already commented on another
bug ([Bug 3085] TRACKER_ID rule not very useful) some languages simply
use longer words/sentences (on average) than English.
Having no short and accurate translations of many/most computer related
English terms complicates the matter.

> We have this on more than one set of translations.

IMO, this really indicates the limit of 50 characters is too low.

I count 823 German rule description (788 unique) and 242 of them have a
length of > 50 characters, with a maximum length of 70. This was the
result of cutting away as much as I could for the release of 3.0 (I
_did_ test with --lint before I submitted the descriptions).
I doubt you can bring them _all_ under the limit of 50 characters.

ciao
     Klaus





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040

duncf@debian.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|DUPLICATE                   |



------- Additional Comments From duncf@debian.org  2005-01-01 13:44 -------
Hmmm.... Duplicate but of what?



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040





------- Additional Comments From quinlan@pathname.com  2004-12-18 00:45 -------
Subject: Re:  30_text_de.cf brings up error message

I'd accept a patch to remove the restriction for lang != en (standard
disclaimers apply).

Daniel





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040





------- Additional Comments From parkerm@pobox.com  2005-01-05 08:12 -------
It is a dup of 4007, granted all of the interesting discussion about fixing the
two bug happened here, instead of there.  I'll close that one as a dup so we
don't lose the discussion.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040

parkerm@pobox.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |admin@mtp.epsi.fr



------- Additional Comments From parkerm@pobox.com  2005-01-05 08:13 -------
*** Bug 4007 has been marked as a duplicate of this bug. ***



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040

parkerm@pobox.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |DUPLICATE



------- Additional Comments From parkerm@pobox.com  2004-12-17 08:06 -------
Yeah, we need some translators I guess to get the descriptions down below the limit.

We have this on more than one set of translations.

*** This bug has been marked as a duplicate of 4007 ***



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

[Bug 4040] 30_text_de.cf brings up error message

Posted by bu...@bugzilla.spamassassin.org.
http://bugzilla.spamassassin.org/show_bug.cgi?id=4040





------- Additional Comments From lwilton@earthlink.net  2004-12-17 20:11 -------
Subject: Re:  30_text_de.cf brings up error message

>I do not agree with this conclusion. As I already commented on another
>bug ([Bug 3085] TRACKER_ID rule not very useful) some languages simply
>use longer words/sentences (on average) than English.
>Having no short and accurate translations of many/most computer related
>English terms complicates the matter.

Some standard rules for internationalizing software:

Creating Application Messages
When you create messages in your application, English text strings are usually shorter than equivalent text strings in other languages. The following table shows the additional average growth for strings, based on their initial length.

English length (in characters) Additional growth for localized strings 
1 to 4 100% 
5 to 10 80% 
11 to 20 60% 
21 to 30 40% 
31 to 50 20% 
over 50 10% 


At a glance, a 50 byte English string should be allowed to grow to at least 60 bytes in other languages.  In this case I'd personally allow it to grow to 70 characters at least.  

I'm aware of the original goal of keeping the description and score on a single 80-character line.  However, there are cases right now where that doesn't work, and SA contains code to wrap long descriptions.  I think trying to enforce an 80 byte line limit (especially on translations) is beating a horse that was never alive in the first place, and is certainly dead now.

In fact the 50 byte limit is a pain even for English descriptions in many cases.





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.