You are viewing a plain text version of this content. The canonical link for it is here.

Posted to java-user@lucene.apache.org by Ridzwan Aminuddin <ra...@world-check.com> on 2009/05/21 08:22:08 UTC

corpus vacabulary

hi can someone point me in the direction of how i can get a string array of
the corpus/index vocabulary from the index using an indexreader?

Currently this is what i am doing:

IndexReader reader = IndexReader.open(indexdirectorypath);
termenumvar = reader.terms();

then i iterate through this termenum using the term() function while
termenum.next() still exists.

Is there a more simple refined way to do this? Thanks

Thanks.

Re: corpus vacabulary

Posted by Otis Gospodnetic <ot...@yahoo.com>.

Hello,

Perhaps the following will help:

asf-lucene/contrib$ ff HighFreq*java
./miscellaneous/src/java/org/apache/lucene/misc/HighFreqTerms.java

 Oits
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Ridzwan Aminuddin <ra...@world-check.com>
> To: java-user@lucene.apache.org
> Sent: Thursday, May 21, 2009 2:22:08 AM
> Subject: corpus vacabulary
> 
> hi can someone point me in the direction of how i can get a string array of
> the corpus/index vocabulary from the index using an indexreader?
> 
> Currently this is what i am doing:
> 
> IndexReader reader = IndexReader.open(indexdirectorypath);
> termenumvar = reader.terms();
> 
> then i iterate through this termenum using the term() function while
> termenum.next() still exists.
> 
> Is there a more simple refined way to do this? Thanks
> 
> Thanks.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org