You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Plastic Flat <pl...@gmail.com> on 2011/06/20 03:19:14 UTC

lucene.vector SequenceFile formatted dictionary?

Hello all,

I noticed that lucene.vector can only produce a plaintext formatted
dictionary file (no dictType option to specify, say, sequencefile
formatting).

- Is there a way to convert this text dictionary to a sequencefile using the
mahout command line?

- Or is there an easy way to read to read the text dictionary into a java
HashMap using something analogous to the SequenceFile.Reader class?

Thanks!
Jake