You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Necati Demir <nd...@demir.web.tr> on 2012/03/20 22:04:50 UTC

TFIDF weights are greater than zero

Hello,

I am trying to use seq2sparse to learn the TFIDF weights. But I think I am
doing a mistake because although I use -lnorm parameter, the words which
appear in all documents has weights greater than zero.  I am using the
following command:

/usr/local/src/mahout/bin/mahout seq2sparse -lnorm -wt TFIDF -i
files-seqdir/ -o files-vectors -ow

And to see result, i use this command:

/usr/local/src/mahout/bin/mahout seqdumper -s
files-vectors/tfidf-vectors/part-r-00000

Any suggestions?

-- 
Necati DEMÄ°R
--------------------