You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2016/03/03 00:56:18 UTC

[jira] [Commented] (SPARK-13161) Extend MLlib LDA to include options for Author Topic Modeling

    [ https://issues.apache.org/jira/browse/SPARK-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15176747#comment-15176747 ] 

Joseph K. Bradley commented on SPARK-13161:
-------------------------------------------

There are many generalizations of LDA, so it would be valuable to know about people's use cases and needs.  Do you have a use case you could describe for this?

It would be great to have this feature as a Spark package in the meantime.

> Extend MLlib LDA to include options for Author Topic Modeling
> -------------------------------------------------------------
>
>                 Key: SPARK-13161
>                 URL: https://issues.apache.org/jira/browse/SPARK-13161
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>    Affects Versions: 1.6.0
>            Reporter: John Hogue
>
> The author-topic model, a generative model for documents that extends Latent Dirichlet Allocation.
> By modeling the interests of authors, we can answer a range of important queries about the content of document collections. With an appropriate author model, we can establish which subjects an author writes about, which authors are likely to have written documents similar to an observed document, and which authors produce similar work.
> Full whitepaper here.
> http://mimno.infosci.cornell.edu/info6150/readings/398.pdf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org