You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2002/05/12 18:56:22 UTC

DO NOT REPLY [Bug 9015] New: - e-mail token in StandardTokenizer.jj does not match valid e-mail addresses

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=9015>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=9015

e-mail token in StandardTokenizer.jj does not match valid e-mail addresses

           Summary: e-mail token in StandardTokenizer.jj does not match
                    valid e-mail addresses
           Product: Lucene
           Version: CVS Nightly - Specify date in submission
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: Analysis
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: danson@germane-software.com


E-mail token in StandardTokenizer.jj does not match many valid e-mail 
addresses. See line 106: 
 
<EMAIL: <ALPHANUM> "@" <ALPHANUM> ("." <ALPHANUM>)+ > 
 
For example, neither danson@germane-software.com (because of the dash) nor 
dale.anson@germane-software.com (because of the first dot and the dash) match. 
the following is slightly better, but does not come close to meeting the 
specifications of RFC 822: 
 
<EMAIL: <ALPHANUM> ("."|"-" <ALPHANUM>)+ "@" <ALPHANUM> ("."|"-" <ALPHANUM>)+ 
> 
 
This is being reported against the May 11 nightly build (I compiled from 
source using the supplied Ant build file on RedHat Linux 7.2, jikes, javacc 
2.0, and Sun Linux JDK 1.4), however, I originally ran across this problem in 
Lucene 1.2 rc4.

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>