You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Necati Demir <nd...@demir.web.tr> on 2012/03/20 22:04:50 UTC
TFIDF weights are greater than zero
Hello,
I am trying to use seq2sparse to learn the TFIDF weights. But I think I am
doing a mistake because although I use -lnorm parameter, the words which
appear in all documents has weights greater than zero. I am using the
following command:
/usr/local/src/mahout/bin/mahout seq2sparse -lnorm -wt TFIDF -i
files-seqdir/ -o files-vectors -ow
And to see result, i use this command:
/usr/local/src/mahout/bin/mahout seqdumper -s
files-vectors/tfidf-vectors/part-r-00000
Any suggestions?
--
Necati DEMÄ°R
--------------------