You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Egor Pakhomov (JIRA)" <ji...@apache.org> on 2014/10/09 14:20:34 UTC

[jira] [Created] (SPARK-3878) Benchmarks and common tests for mllib algorithm

Egor Pakhomov created SPARK-3878:
------------------------------------

             Summary: Benchmarks and common tests for mllib algorithm
                 Key: SPARK-3878
                 URL: https://issues.apache.org/jira/browse/SPARK-3878
             Project: Spark
          Issue Type: New Feature
          Components: MLlib
            Reporter: Egor Pakhomov


There is no common practice among MLlib for testing algorithms: every model generates it's own random test data. There is no easy extractable test cases applible to another algorithm. There is no benchmarks for comparing algorithms. After implementing new algorithm it's very hard to understand how it should be tested. 
Lack of serialization testing: MLlib algorithms don't contain tests which test that model work after serialization. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org