You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Matthew Bedford (Jira)" <ji...@apache.org> on 2019/09/24 20:04:00 UTC

[jira] [Created] (SPARK-29235) CrossValidatorModel.avgMetrics disappears after model is written/read again

Matthew Bedford created SPARK-29235:
---------------------------------------

             Summary: CrossValidatorModel.avgMetrics disappears after model is written/read again
                 Key: SPARK-29235
                 URL: https://issues.apache.org/jira/browse/SPARK-29235
             Project: Spark
          Issue Type: Bug
          Components: ML
    Affects Versions: 2.4.1
         Environment: Databricks cluster:

{
    "num_workers": 4,
    "cluster_name": "mabedfor-test-classfix",
    "spark_version": "5.3.x-cpu-ml-scala2.11",
    "spark_conf": {
        "spark.databricks.delta.preview.enabled": "true"
    },
    "node_type_id": "Standard_DS12_v2",
    "driver_node_type_id": "Standard_DS12_v2",
    "ssh_public_keys": [],
    "custom_tags": {},
    "spark_env_vars": {
        "PYSPARK_PYTHON": "/databricks/python3/bin/python3"
    },
    "autotermination_minutes": 120,
    "enable_elastic_disk": true,
    "cluster_source": "UI",
    "init_scripts": [],
    "cluster_id": "0722-165622-calls746"
}
            Reporter: Matthew Bedford


 
Right after a CrossValidatorModel is trained, it has avgMetrics.  After the model is written to disk and read later, it no longer has avgMetrics.  To reproduce:
{{from pyspark.ml.tuning import CrossValidator, CrossValidatorModel
}}{{}}

{{cv = CrossValidator(...) #fill with params
}}{{}}

{{cvModel = cv.fit(trainDF) #given dataframe with training}}

{{data}}{{print(cvModel.avgMetrics) #prints a nonempty list as expected}}

{{cvModel.write().save({color:#172b4d}"/tmp/model"{color})}}

{{cvModel2 = CrossValidatorModel.read().load({color:#172b4d}{color:#172b4d}"/tmp/model"{color}{color})}}{{{color:#172b4d}print(cvModel2.avgMetrics) #BUG - prints an empty list{color}}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org