You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@spark.apache.org by jinhong lu <lu...@gmail.com> on 2016/07/11 03:35:51 UTC

mllib based on dataset or dataframe

Hi,
    Since the DataSet will be the major API in spark2.0,  why mllib will DataFrame-based, and 'future development will focus on the DataFrame-based API.’

   Any plan will change mllib form DataFrame-based to DataSet-based?


=============
Thanks,
lujinhong


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org

Re: mllib based on dataset or dataframe

Posted by Yanbo Liang <yb...@gmail.com>.

DataFrame is a kind of special case of Dataset, so they mean the same thing.
Actually the ML pipeline API will accept Dataset[_] instead of DataFrame in
Spark 2.0.
We can say that MLlib will focus on the Dataset-based API for futher
development more accurately.

Thanks
Yanbo

2016-07-10 20:35 GMT-07:00 jinhong lu <lu...@gmail.com>:

> Hi,
>     Since the DataSet will be the major API in spark2.0,  why mllib will
> DataFrame-based, and 'future development will focus on the DataFrame-based
> API.’
>
>    Any plan will change mllib form DataFrame-based to DataSet-based?
>
>
> =============
> Thanks,
> lujinhong
>
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>
>

Re: mllib based on dataset or dataframe

Posted by Yanbo Liang <yb...@gmail.com>.

DataFrame is a kind of special case of Dataset, so they mean the same thing.
Actually the ML pipeline API will accept Dataset[_] instead of DataFrame in
Spark 2.0.
We can say that MLlib will focus on the Dataset-based API for futher
development more accurately.

Thanks
Yanbo

2016-07-10 20:35 GMT-07:00 jinhong lu <lu...@gmail.com>:

> Hi,
>     Since the DataSet will be the major API in spark2.0,  why mllib will
> DataFrame-based, and 'future development will focus on the DataFrame-based
> API.’
>
>    Any plan will change mllib form DataFrame-based to DataSet-based?
>
>
> =============
> Thanks,
> lujinhong
>
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>
>