You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by newein <ka...@gmail.com> on 2014/10/16 08:45:30 UTC

Document topic mapping problem using Mahout 0.9 CVB algorithm

Hi,

I am trying to do the topics analysis on set of documents using the latest
version of Mahout.

The output for topic to term mapping is proper with each topic having list
of terms with corresponding probabilities.

But the when I tried getting the document to topic mapping , it only
displays a set of topics starting with some letter. Like in this case all
topics starting with letter a

Following is the sample code used to generate the document topic mapping:

*VectorDumper.main(new String[] 
                { 
                "-i" , inputDocTopicsDir
                , "-o", oututDocTopicsDir
                , "-d", inputDictionaryDir
                , "-dt", "sequencefile"
                , "-sort", "true"
                , "-vs", "10" });*

Sample output:
{2d:0.019996671414880783,3d:0.019994853350969108,4d:0.02000171234917903,5d:0.019994290328033588,a.config:0.01999309367417373,a.k.a:0.02000227944902019,a.system:0.01999771644223781,aaa:0.020003361639812457,aam:0.019990182999365072,aapm:0.020012465032122083,aapv:0.01999879522431889,aar:0.019995543474585993,aas:0.019995157547471696,aav:0.02000267326012652,ab:0.020025978185034182,aba:0.01999553819903237,abandon:0.020013355238553677,abandoned:0.01999559962237951,abandonment:0.019994194616256,abandons:0.02001433184497984,abatement:0.01997728075793184,abberationa:0.020001189392395737}



--
View this message in context: http://lucene.472066.n3.nabble.com/Document-topic-mapping-problem-using-Mahout-0-9-CVB-algorithm-tp4164471.html
Sent from the Mahout User List mailing list archive at Nabble.com.