You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by sh...@apache.org on 2009/06/05 12:23:48 UTC

svn commit: r781975 - in /lucene/solr/trunk: CHANGES.txt src/java/org/apache/solr/spelling/SpellingQueryConverter.java

Author: shalin
Date: Fri Jun  5 10:23:48 2009
New Revision: 781975

URL: http://svn.apache.org/viewvc?rev=781975&view=rev
Log:
SOLR-1204 -- Enhance SpellingQueryConverter to handle UTF-8 instead of ASCII only

Modified:
    lucene/solr/trunk/CHANGES.txt
    lucene/solr/trunk/src/java/org/apache/solr/spelling/SpellingQueryConverter.java

Modified: lucene/solr/trunk/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/solr/trunk/CHANGES.txt?rev=781975&r1=781974&r2=781975&view=diff
==============================================================================
--- lucene/solr/trunk/CHANGES.txt (original)
+++ lucene/solr/trunk/CHANGES.txt Fri Jun  5 10:23:48 2009
@@ -230,6 +230,8 @@
 
 57. SOLR-1152: Snapshoot on ReplicationHandler should acccept location as a request parameter (shalin)
 
+58. SOLR-1204: Enhance SpellingQueryConverter to handle UTF-8 instead of ASCII only (Michael Ludwig via shalin)
+
 Optimizations
 ----------------------
  1. SOLR-374: Use IndexReader.reopen to save resources by re-using parts of the

Modified: lucene/solr/trunk/src/java/org/apache/solr/spelling/SpellingQueryConverter.java
URL: http://svn.apache.org/viewvc/lucene/solr/trunk/src/java/org/apache/solr/spelling/SpellingQueryConverter.java?rev=781975&r1=781974&r2=781975&view=diff
==============================================================================
--- lucene/solr/trunk/src/java/org/apache/solr/spelling/SpellingQueryConverter.java (original)
+++ lucene/solr/trunk/src/java/org/apache/solr/spelling/SpellingQueryConverter.java Fri Jun  5 10:23:48 2009
@@ -37,7 +37,7 @@
  **/
 public class SpellingQueryConverter extends QueryConverter  {
 
-  protected Pattern QUERY_REGEX = Pattern.compile("(?:(?!(\\w+:|\\d+)))\\w+");
+  protected Pattern QUERY_REGEX = Pattern.compile("(?:(?!(\\p{L}+:|\\d+)))\\p{L}+");
 
   /**
    * Converts the original query string to a collection of Lucene Tokens.