You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Ievgen Prokhorenko (JIRA)" <ji...@apache.org> on 2019/07/18 03:50:00 UTC

[jira] [Created] (SPARK-28434) Decision Tree model isn't equal after save and load

Ievgen Prokhorenko created SPARK-28434:
------------------------------------------

             Summary: Decision Tree model isn't equal after save and load
                 Key: SPARK-28434
                 URL: https://issues.apache.org/jira/browse/SPARK-28434
             Project: Spark
          Issue Type: Bug
          Components: MLlib
    Affects Versions: 2.4.3
         Environment: spark from master
            Reporter: Ievgen Prokhorenko


The file `mllib/src/test/scala/org/apache/spark/mllib/tree/DecisionTreeSuite.scala` on the line no. 628 has a TODO saying:

 
{code:java}
// TODO: Check other fields besides the information gain.
{code}
If, in addition to the existing check of InformationGainStats' gain value I add another check, for instance, impurity – the test fails because the values are different in the saved model and the one restored from disk.

 

See PR with an example.

 

The tests are executed with this command:
{code:java}
build/mvn -e -Dtest=none -DwildcardSuites=org.apache.spark.mllib.tree.DecisionTreeSuite test{code}
 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org