You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by Jake Mannix <ja...@gmail.com> on 2011/12/01 04:44:25 UTC

Re: Review Request: New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2944/
-----------------------------------------------------------

(Updated 2011-12-01 03:44:25.140987)


Review request for mahout and Ted Dunning.


Changes
-------

VectorDumper becomes a "top-terms" dumper as well.


Summary
-------

See MAHOUT-897


This addresses bug MAHOUT-897.
    https://issues.apache.org/jira/browse/MAHOUT-897


Diffs (updated)
-----

  trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDADriver.java 1208933 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDASampler.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0DocInferenceMapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0Driver.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0TopicTermVectorNormalizerMapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CachingCVB0Mapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CachingCVB0PerplexityMapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/InMemoryCollapsedVariationalBayes0.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/ModelTrainer.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/TopicModel.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/common/MemoryUtil.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/common/Pair.java 1208933 
  trunk/core/src/main/java/org/apache/mahout/math/DistributedRowMatrixWriter.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/math/MatrixUtils.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/math/stats/Sampler.java PRE-CREATION 
  trunk/core/src/test/java/org/apache/mahout/clustering/ClusteringTestUtils.java 1208933 
  trunk/core/src/test/java/org/apache/mahout/clustering/lda/TestMapReduce.java 1208933 
  trunk/core/src/test/java/org/apache/mahout/clustering/lda/cvb/TestCVBModelTrainer.java PRE-CREATION 
  trunk/core/src/test/java/org/apache/mahout/math/stats/SamplerTest.java PRE-CREATION 
  trunk/integration/src/main/java/org/apache/mahout/utils/vectors/VectorDumper.java 1208933 
  trunk/integration/src/main/java/org/apache/mahout/utils/vectors/VectorHelper.java 1208933 
  trunk/math/src/main/java/org/apache/mahout/math/AbstractVector.java 1208933 
  trunk/math/src/main/java/org/apache/mahout/math/NamedVector.java 1208933 
  trunk/src/conf/driver.classes.props 1208933 

Diff: https://reviews.apache.org/r/2944/diff


Testing
-------

mvn clean test


Thanks,

Jake


Re: Review Request: New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching

Posted by Jake Mannix <ja...@gmail.com>.
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2944/
-----------------------------------------------------------

(Updated 2011-12-02 20:49:52.055735)


Review request for mahout and Ted Dunning.


Changes
-------

Updates to VectorDumper and VectorHelper


Summary
-------

See MAHOUT-897


This addresses bug MAHOUT-897.
    https://issues.apache.org/jira/browse/MAHOUT-897


Diffs (updated)
-----

  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/InMemoryCollapsedVariationalBayes0.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CachingCVB0PerplexityMapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0TopicTermVectorNormalizerMapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CachingCVB0Mapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0Driver.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0DocInferenceMapper.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDASampler.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDADriver.java 1209684 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/ModelTrainer.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/TopicModel.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/common/MemoryUtil.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/common/Pair.java 1209684 
  trunk/core/src/main/java/org/apache/mahout/math/DistributedRowMatrixWriter.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/math/MatrixUtils.java PRE-CREATION 
  trunk/core/src/main/java/org/apache/mahout/math/stats/Sampler.java PRE-CREATION 
  trunk/core/src/test/java/org/apache/mahout/clustering/ClusteringTestUtils.java 1209684 
  trunk/core/src/test/java/org/apache/mahout/clustering/lda/TestMapReduce.java 1209684 
  trunk/core/src/test/java/org/apache/mahout/clustering/lda/cvb/TestCVBModelTrainer.java PRE-CREATION 
  trunk/core/src/test/java/org/apache/mahout/math/stats/SamplerTest.java PRE-CREATION 
  trunk/integration/src/main/java/org/apache/mahout/utils/vectors/VectorDumper.java 1209684 
  trunk/integration/src/main/java/org/apache/mahout/utils/vectors/VectorHelper.java 1209684 
  trunk/integration/src/test/java/org/apache/mahout/utils/vectors/VectorHelperTest.java PRE-CREATION 
  trunk/math/src/main/java/org/apache/mahout/math/AbstractVector.java 1209684 
  trunk/math/src/main/java/org/apache/mahout/math/NamedVector.java 1209684 
  trunk/src/conf/driver.classes.props 1209684 

Diff: https://reviews.apache.org/r/2944/diff


Testing
-------

mvn clean test


Thanks,

Jake