You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Grant Ingersoll (JIRA)" <ji...@apache.org> on 2012/06/03 20:26:23 UTC
[jira] [Assigned] (MAHOUT-1009) Remove old LDA implementation from
codebase
[ https://issues.apache.org/jira/browse/MAHOUT-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Grant Ingersoll reassigned MAHOUT-1009:
---------------------------------------
Assignee: Sebastian Schelter (was: Jake Mannix)
Sebastian is working on this
> Remove old LDA implementation from codebase
> -------------------------------------------
>
> Key: MAHOUT-1009
> URL: https://issues.apache.org/jira/browse/MAHOUT-1009
> Project: Mahout
> Issue Type: Improvement
> Components: Clustering
> Affects Versions: 0.7
> Reporter: Jake Mannix
> Assignee: Sebastian Schelter
> Priority: Minor
> Fix For: 0.7
>
>
> The old LDA is unmaintained and unsupported. We already (since 0.6) have a newer, faster version in the o.a.m.clustering.lda.cvb package, which I'm actively working on and using in production at Twitter. We should delete the old o.a.m.clustering.lda codebase.
> Normally, I'd say that we should at the same time promote o.a.m.clustering.lda.cvb up a package-level, but that would cause some serious merge conflicts on my GitHub branch (with updates/improvements/new features targetted for 0.8), so we can get users on this new code by simply changing the driver.classes.props to have "lda" point to CVB0Driver as the main().
> One thing which goes away entirely, is the LDAPrintTopics class, but it's replaced by simply doing VectorDumper with the -sort option on the model files, which is more standard anyways.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira