You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Ievgen Prokhorenko (JIRA)" <ji...@apache.org> on 2019/07/18 03:50:00 UTC
[jira] [Created] (SPARK-28434) Decision Tree model isn't equal
after save and load
Ievgen Prokhorenko created SPARK-28434:
------------------------------------------
Summary: Decision Tree model isn't equal after save and load
Key: SPARK-28434
URL: https://issues.apache.org/jira/browse/SPARK-28434
Project: Spark
Issue Type: Bug
Components: MLlib
Affects Versions: 2.4.3
Environment: spark from master
Reporter: Ievgen Prokhorenko
The file `mllib/src/test/scala/org/apache/spark/mllib/tree/DecisionTreeSuite.scala` on the line no. 628 has a TODO saying:
{code:java}
// TODO: Check other fields besides the information gain.
{code}
If, in addition to the existing check of InformationGainStats' gain value I add another check, for instance, impurity – the test fails because the values are different in the saved model and the one restored from disk.
See PR with an example.
The tests are executed with this command:
{code:java}
build/mvn -e -Dtest=none -DwildcardSuites=org.apache.spark.mllib.tree.DecisionTreeSuite test{code}
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org