You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-commits@lucene.apache.org by mi...@apache.org on 2009/03/27 20:04:26 UTC

svn commit: r759307 - in /lucene/java/trunk/contrib: CHANGES.txt analyzers/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java

Author: mikemccand
Date: Fri Mar 27 19:04:25 2009
New Revision: 759307

URL: http://svn.apache.org/viewvc?rev=759307&view=rev
Log:
LUCENE-1576: fix BrazilianAnalyzer to downcase before filtering stop words

Modified:
    lucene/java/trunk/contrib/CHANGES.txt
    lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java

Modified: lucene/java/trunk/contrib/CHANGES.txt
URL: http://svn.apache.org/viewvc/lucene/java/trunk/contrib/CHANGES.txt?rev=759307&r1=759306&r2=759307&view=diff
==============================================================================
--- lucene/java/trunk/contrib/CHANGES.txt (original)
+++ lucene/java/trunk/contrib/CHANGES.txt Fri Mar 27 19:04:25 2009
@@ -32,6 +32,10 @@
     characters to only apply to the correct subset (Daniel Cheng via
     Mike McCandless)
 
+ 7. LUCENE-1576: Fix BrazilianAnalyzer to downcase tokens after
+    StandardTokenizer so that stop words with mixed case are filtered
+    out.  (Rafael Cunha de Almeida, Douglas Campos via Mike McCandless)
+
 New features
 
  1. LUCENE-1470: Added TrieRangeQuery, a much faster implementation of

Modified: lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java
URL: http://svn.apache.org/viewvc/lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java?rev=759307&r1=759306&r2=759307&view=diff
==============================================================================
--- lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java (original)
+++ lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java Fri Mar 27 19:04:25 2009
@@ -130,11 +130,10 @@
 	 */
 	public final TokenStream tokenStream(String fieldName, Reader reader) {
 		TokenStream result = new StandardTokenizer( reader );
+		result = new LowerCaseFilter( result );
 		result = new StandardFilter( result );
 		result = new StopFilter( result, stoptable );
 		result = new BrazilianStemFilter( result, excltable );
-		// Convert to lowercase after stemming!
-		result = new LowerCaseFilter( result );
 		return result;
 	}
 }