You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2017/01/09 11:31:58 UTC
[jira] [Updated] (SPARK-19134) Fix several Python mllib and status
api examples not working
[ https://issues.apache.org/jira/browse/SPARK-19134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hyukjin Kwon updated SPARK-19134:
---------------------------------
Description:
*binary_classification_metrics_example.py*
{code}
./bin/spark-submit examples/src/main/python/mllib/binary_classification_metrics_example.py
{code}
{code}
File ".../spark/examples/src/main/python/mllib/binary_classification_metrics_example.py", line 39, in <lambda>
.rdd.map(lambda row: LabeledPoint(row[0], row[1]))
File ".../spark/python/pyspark/mllib/regression.py", line 54, in __init__
self.features = _convert_to_vector(features)
File ".../spark/python/pyspark/mllib/linalg/__init__.py", line 80, in _convert_to_vector
raise TypeError("Cannot convert type %s into Vector" % type(l))
TypeError: Cannot convert type <class 'pyspark.ml.linalg.SparseVector'> into Vector
{code}
*status_api_demo.py*
{code}
PYSPARK_PYTHON=python3 ./bin/spark-submit examples/src/main/python/status_api_demo.py
{code}
{code}
Traceback (most recent call last):
File ".../spark/examples/src/main/python/status_api_demo.py", line 22, in <module>
import Queue
ImportError: No module named 'Queue'
{code}
*bisecting_k_means_example.py*
{code}
./bin/spark-submit examples/src/main/python/mllib/bisecting_k_means_example.py
{code}
{code}
Traceback (most recent call last):
File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/examples/src/main/python/mllib/bisecting_k_means_example.py", line 46, in <module>
model.save(sc, path)
AttributeError: 'BisectingKMeansModel' object has no attribute 'save'
{code}
*elementwise_product_example.py*
{code}
./bin/spark-submit examples/src/main/python/mllib/elementwise_product_example.py
{code}
{code}
Traceback (most recent call last):
File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/examples/src/main/python/mllib/elementwise_product_example.py", line 48, in <module>
for each in transformedData2.collect():
File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/python/pyspark/mllib/linalg/__init__.py", line 478, in __getattr__
return getattr(self.array, item)
AttributeError: 'numpy.ndarray' object has no attribute 'collect'
{code}
was:
{code}
./bin/spark-submit examples/src/main/python/mllib/binary_classification_metrics_example.py
{code}
{code}
File ".../spark/examples/src/main/python/mllib/binary_classification_metrics_example.py", line 39, in <lambda>
.rdd.map(lambda row: LabeledPoint(row[0], row[1]))
File ".../spark/python/pyspark/mllib/regression.py", line 54, in __init__
self.features = _convert_to_vector(features)
File ".../spark/python/pyspark/mllib/linalg/__init__.py", line 80, in _convert_to_vector
raise TypeError("Cannot convert type %s into Vector" % type(l))
TypeError: Cannot convert type <class 'pyspark.ml.linalg.SparseVector'> into Vector
{code}
{code}
PYSPARK_PYTHON=python3 ./bin/spark-submit examples/src/main/python/status_api_demo.py
{code}
{code}
Traceback (most recent call last):
File ".../spark/examples/src/main/python/status_api_demo.py", line 22, in <module>
import Queue
ImportError: No module named 'Queue'
{code}
{code}
./bin/spark-submit examples/src/main/python/mllib/bisecting_k_means_example.py
{code}
{code}
Traceback (most recent call last):
File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/examples/src/main/python/mllib/bisecting_k_means_example.py", line 46, in <module>
model.save(sc, path)
AttributeError: 'BisectingKMeansModel' object has no attribute 'save'
{code}
{code}
./bin/spark-submit examples/src/main/python/mllib/elementwise_product_example.py
{code}
{code}
Traceback (most recent call last):
File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/examples/src/main/python/mllib/elementwise_product_example.py", line 48, in <module>
for each in transformedData2.collect():
File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/python/pyspark/mllib/linalg/__init__.py", line 478, in __getattr__
return getattr(self.array, item)
AttributeError: 'numpy.ndarray' object has no attribute 'collect'
{code}
> Fix several Python mllib and status api examples not working
> ------------------------------------------------------------
>
> Key: SPARK-19134
> URL: https://issues.apache.org/jira/browse/SPARK-19134
> Project: Spark
> Issue Type: Bug
> Components: MLlib, PySpark
> Reporter: Hyukjin Kwon
> Priority: Minor
>
> *binary_classification_metrics_example.py*
> {code}
> ./bin/spark-submit examples/src/main/python/mllib/binary_classification_metrics_example.py
> {code}
> {code}
> File ".../spark/examples/src/main/python/mllib/binary_classification_metrics_example.py", line 39, in <lambda>
> .rdd.map(lambda row: LabeledPoint(row[0], row[1]))
> File ".../spark/python/pyspark/mllib/regression.py", line 54, in __init__
> self.features = _convert_to_vector(features)
> File ".../spark/python/pyspark/mllib/linalg/__init__.py", line 80, in _convert_to_vector
> raise TypeError("Cannot convert type %s into Vector" % type(l))
> TypeError: Cannot convert type <class 'pyspark.ml.linalg.SparseVector'> into Vector
> {code}
> *status_api_demo.py*
> {code}
> PYSPARK_PYTHON=python3 ./bin/spark-submit examples/src/main/python/status_api_demo.py
> {code}
> {code}
> Traceback (most recent call last):
> File ".../spark/examples/src/main/python/status_api_demo.py", line 22, in <module>
> import Queue
> ImportError: No module named 'Queue'
> {code}
> *bisecting_k_means_example.py*
> {code}
> ./bin/spark-submit examples/src/main/python/mllib/bisecting_k_means_example.py
> {code}
> {code}
> Traceback (most recent call last):
> File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/examples/src/main/python/mllib/bisecting_k_means_example.py", line 46, in <module>
> model.save(sc, path)
> AttributeError: 'BisectingKMeansModel' object has no attribute 'save'
> {code}
> *elementwise_product_example.py*
> {code}
> ./bin/spark-submit examples/src/main/python/mllib/elementwise_product_example.py
> {code}
> {code}
> Traceback (most recent call last):
> File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/examples/src/main/python/mllib/elementwise_product_example.py", line 48, in <module>
> for each in transformedData2.collect():
> File "/Users/hyukjinkwon/Desktop/workspace/repos/forked/spark/python/pyspark/mllib/linalg/__init__.py", line 478, in __getattr__
> return getattr(self.array, item)
> AttributeError: 'numpy.ndarray' object has no attribute 'collect'
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org