You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Vincenzo Selvaggio (JIRA)" <ji...@apache.org> on 2014/12/14 12:47:13 UTC

[jira] [Commented] (SPARK-1406) PMML model evaluation support via MLib

    [ https://issues.apache.org/jira/browse/SPARK-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14245893#comment-14245893 ] 

Vincenzo Selvaggio commented on SPARK-1406:
-------------------------------------------

Find at
https://github.com/selvinsource/spark-pmml-exporter-validator
a simple validator project showing that the prediction made by Apache Spark and JPMML Evaluator (by loading the PMML exported from Spark) produces comparable results, therefore proving the PMML export from Apache Spark works as expected.

> PMML model evaluation support via MLib
> --------------------------------------
>
>                 Key: SPARK-1406
>                 URL: https://issues.apache.org/jira/browse/SPARK-1406
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: Thomas Darimont
>            Assignee: Vincenzo Selvaggio
>         Attachments: MyJPMMLEval.java, SPARK-1406.pdf, SPARK-1406_v2.pdf, kmeans.xml
>
>
> It would be useful if spark would provide support the evaluation of PMML models (http://www.dmg.org/v4-2/GeneralStructure.html).
> This would allow to use analytical models that were created with a statistical modeling tool like R, SAS, SPSS, etc. with Spark (MLib) which would perform the actual model evaluation for a given input tuple. The PMML model would then just contain the "parameterization" of an analytical model.
> Other projects like JPMML-Evaluator do a similar thing.
> https://github.com/jpmml/jpmml/tree/master/pmml-evaluator



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org