You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Nils Hoeller <ni...@arcor.de> on 2005/09/06 07:53:52 UTC

use of Luke s getHighFreqTerms

Hi,

i ve got only one little question:


I m using the class HighFreqTerms of the Luke Project to
find those terms in my index ( made by Nutch) 

Now I wanted to filter the Terms with a 
stopwordlist (junkwords).

The method getHighFreqTerms gives me the ability 
to define a Hashtable junkwords , which I suppose 
to be the filtering part. 

But how do I have to use it, since 
my first tries failed:

I ve tried something like:

Hashtable junk = new Hashtable();
String word = new String();
word = "the";
junk.put(new Integer(word.hashCode()),word);
TermInfo[] terms = getHighFreqTerms(dir, junk, new String[]{"content"});

But this did not work, which means did not filter the word "the".

What am I doing wrong?

Thanks for your help.

Nils


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org