You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2015/09/14 21:04:45 UTC

[jira] [Updated] (SPARK-10595) Various ML programming guide cleanups post 1.5

     [ https://issues.apache.org/jira/browse/SPARK-10595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xiangrui Meng updated SPARK-10595:
----------------------------------
    Shepherd: Feynman Liang

> Various ML programming guide cleanups post 1.5
> ----------------------------------------------
>
>                 Key: SPARK-10595
>                 URL: https://issues.apache.org/jira/browse/SPARK-10595
>             Project: Spark
>          Issue Type: Documentation
>          Components: Documentation, ML, MLlib
>    Affects Versions: 1.5.0
>            Reporter: Joseph K. Bradley
>            Assignee: Joseph K. Bradley
>            Priority: Minor
>
> Various ML guide cleanups.
> * ml-guide.md: Make it easier to access the algorithm-specific guides.
> * LDA user guide: EM often begins with useless topics, but running longer generally improves them dramatically.  E.g., 10 iterations on a Wikipedia dataset produces useless topics, but 50 iterations produces very meaningful topics.
> * mllib-feature-extraction.html#elementwiseproduct: “w” parameter should be “scalingVec”
> * Clean up Binarizer user guide a little.
> * Document in Pipeline that users should not put an instance into the Pipeline in more than 1 place.
> * spark.ml Word2Vec user guide: clean up grammar/writing
> * Chi Sq Feature Selector docs: Improve text in doc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org