You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by td...@apache.org on 2014/05/20 06:29:41 UTC
[4/4] git commit: [SPARK-1874][MLLIB] Clean up MLlib sample data
[SPARK-1874][MLLIB] Clean up MLlib sample data
1. Added synthetic datasets for `MovieLensALS`, `LinearRegression`, `BinaryClassification`.
2. Embedded instructions in the help message of those example apps.
Per discussion with Matei on the JIRA page, new example data is under `data/mllib`.
Author: Xiangrui Meng <me...@databricks.com>
Closes #833 from mengxr/mllib-sample-data and squashes the following commits:
59f0a18 [Xiangrui Meng] add sample binary classification data
3c2f92f [Xiangrui Meng] add linear regression data
050f1ca [Xiangrui Meng] add a sample dataset for MovieLensALS example
Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/bcb9dce6
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/bcb9dce6
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/bcb9dce6
Branch: refs/heads/master
Commit: bcb9dce6f444a977c714117811bce0c54b417650
Parents: b0ce22e
Author: Xiangrui Meng <me...@databricks.com>
Authored: Mon May 19 21:29:33 2014 -0700
Committer: Tathagata Das <ta...@gmail.com>
Committed: Mon May 19 21:29:33 2014 -0700
----------------------------------------------------------------------
.../mllib/sample_binary_classification_data.txt | 100 ++
data/mllib/sample_linear_regression_data.txt | 501 ++++++
data/mllib/sample_movielens_data.txt | 1501 ++++++++++++++++++
.../examples/mllib/BinaryClassification.scala | 12 +-
.../spark/examples/mllib/LinearRegression.scala | 11 +-
.../spark/examples/mllib/MovieLensALS.scala | 15 +
6 files changed, 2138 insertions(+), 2 deletions(-)
----------------------------------------------------------------------