You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spark.apache.org by Hossein <fa...@gmail.com> on 2018/05/22 22:23:10 UTC
ML Pipelines in R
Hi all,
SparkR supports calling MLlib functionality with an R-friendly API. Since
Spark 1.5 the (new) SparkML API which is based on pipelines and parameters
has matured significantly. It allows users build and maintain complicated
machine learning pipelines. A lot of this functionality is difficult to
expose using the simple formula-based API in SparkR.
I just submitted a SPIP <https://issues.apache.org/jira/browse/SPARK-21190>
to propose a new R package, SparkML, to be distributed along with SparkR as
part of Apache Spark. Please view the JIRA ticket and provide feedback &
comments.
Thanks,
--Hossein
Re: ML Pipelines in R
Posted by Hossein <fa...@gmail.com>.
Correction: the SPIP is https://issues.apache.org/jira/browse/SPARK-24359
--Hossein
On Tue, May 22, 2018 at 6:23 PM, Hossein <fa...@gmail.com> wrote:
> Hi all,
>
> SparkR supports calling MLlib functionality with an R-friendly API. Since
> Spark 1.5 the (new) SparkML API which is based on pipelines and parameters
> has matured significantly. It allows users build and maintain complicated
> machine learning pipelines. A lot of this functionality is difficult to
> expose using the simple formula-based API in SparkR.
>
> I just submitted a SPIP
> <https://issues.apache.org/jira/browse/SPARK-21190> to propose a new R
> package, SparkML, to be distributed along with SparkR as part of Apache
> Spark. Please view the JIRA ticket and provide feedback & comments.
>
> Thanks,
> --Hossein
>