You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by MLnick <gi...@git.apache.org> on 2016/06/27 13:22:25 UTC

[GitHub] spark pull request #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes ...

GitHub user MLnick opened a pull request:

    https://github.com/apache/spark/pull/13924

    [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML migration guide

    This PR adds the breaking changes from [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) to the migration guide.
    
    ## How was this patch tested?
    
    Built docs locally.
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/MLnick/spark SPARK-15643-migration-guide

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/13924.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #13924
    
----
commit 28e041211fc335e5a8cab0616c54d4881cd17331
Author: Nick Pentreath <ni...@za.ibm.com>
Date:   2016-06-27T13:19:42Z

    Add breaking changes to ML migration guide

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML m...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Will be merged once #13378 is merged.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r68866018
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,87 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `spark-mllib-local` 
    --- End diff --
    
    Should be "mllib-local" (no "spark-")


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML m...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61303 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61303/consoleFull)** for PR 13924 at commit [`28e0412`](https://github.com/apache/spark/commit/28e041211fc335e5a8cab0616c54d4881cd17331).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r68920293
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,87 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `spark-mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were moved to a new package, `spark.ml.linalg`. 
    +The DataFrame-based APIs in `spark.ml` now depend on the `spark.ml.linalg` classes, 
    +leading to a few breaking changes, predominantly in various model classes 
    +(see [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) for a full list).
    +
    +**Note:** the RDD-based APIs in `spark.mllib` continue to depend on the previous package `spark.mllib.linalg`.
    +
    +_Converting vectors and matrices_
    --- End diff --
    
    Good point - though I only see `asML` not `fromML`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61464/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    LGTM
    Merging with master and branch-2.0 now that https://github.com/apache/spark/pull/13997 has been merged
    Thank you!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML m...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    I just merged https://github.com/apache/spark/pull/13378


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    The changes look good, so just the Python item remains.  Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61464 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61464/consoleFull)** for PR 13924 at commit [`ac49f31`](https://github.com/apache/spark/commit/ac49f31cd83aa2755e0f1948f981beee66a16527).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61408 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61408/consoleFull)** for PR 13924 at commit [`6ef09a3`](https://github.com/apache/spark/commit/6ef09a31c4e8808277357359ac6d048d866ce9f0).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r68866022
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,87 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `spark-mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were moved to a new package, `spark.ml.linalg`. 
    --- End diff --
    
    "moved" --> "copied"


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r69024587
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,102 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were copied to a new package, `spark.ml.linalg`. 
    +The DataFrame-based APIs in `spark.ml` now depend on the `spark.ml.linalg` classes, 
    +leading to a few breaking changes, predominantly in various model classes 
    +(see [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) for a full list).
    +
    +**Note:** the RDD-based APIs in `spark.mllib` continue to depend on the previous package `spark.mllib.linalg`.
    +
    +_Converting vectors and matrices_
    +
    +While most pipeline components support backward compatibility for loading, 
    +some existing `DataFrames` and pipelines in Spark versions prior to 2.0, that contain vector or matrix 
    +columns, may need to be migrated to the new `spark.ml` vector and matrix types. 
    +Utilities for converting `DataFrame` columns from `spark.mllib.linalg` to `spark.ml.linalg` types
    +(and vice versa) can be found in `spark.mllib.util.MLUtils`.
    +
    +There are also utility methods available for converting single instances of 
    +vectors and matrices. Use the `asML` method on a `mllib.linalg.Vector` / `mllib.linalg.Matrix`
    +for converting to `ml.linalg` types, and 
    +`mllib.linalg.Vectors.fromML` / `mllib.linalg.Matrices.fromML` 
    +for converting to `mllib.linalg` types.
    +
    +<div class="codetabs">
    +<div data-lang="scala"  markdown="1">
    +
    +{% highlight scala %}
    +import org.apache.spark.mllib.util.MLUtils
    +
    +// convert DataFrame columns
    +val convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +val convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    +// convert a single vector or matrix
    +val mlVec: org.apache.spark.ml.linalg.Vector = mllibVec.asML
    +val mlMat: org.apache.spark.ml.linalg.Matrix = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Scala docs](api/scala/index.html#org.apache.spark.mllib.util.MLUtils$) for further detail.
    +</div>
    +
    +<div data-lang="java" markdown="1">
    +
    +{% highlight java %}
    +import org.apache.spark.mllib.util.MLUtils;
    +import org.apache.spark.sql.Dataset;
    +
    +// convert DataFrame columns
    +Dataset<Row> convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF);
    +Dataset<Row> convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF);
    +// convert a single vector or matrix
    +org.apache.spark.ml.linalg.Vector mlVec = mllibVec.asML
    +org.apache.spark.ml.linalg.Matrix mlMat = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Java docs](api/java/org/apache/spark/mllib/util/MLUtils.html) for further detail.
    +</div>
    +
    +<div data-lang="python"  markdown="1">
    +
    +{% highlight python %}
    +from pyspark.mllib.util import MLUtils
    +
    +# convert DataFrame columns
    +convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +convertedMatrxDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    --- End diff --
    
    That may have just been overlooked, but that's a good point that there is already a decent way to do the conversion.  Could you please just note that way here?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61408 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61408/consoleFull)** for PR 13924 at commit [`6ef09a3`](https://github.com/apache/spark/commit/6ef09a31c4e8808277357359ac6d048d866ce9f0).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61546/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML m...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61303 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61303/consoleFull)** for PR 13924 at commit [`28e0412`](https://github.com/apache/spark/commit/28e041211fc335e5a8cab0616c54d4881cd17331).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/13924


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61546 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61546/consoleFull)** for PR 13924 at commit [`919bfe9`](https://github.com/apache/spark/commit/919bfe9c73fca485ff33a528feffa5b59b0b7e86).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61545 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61545/consoleFull)** for PR 13924 at commit [`c2ce7cd`](https://github.com/apache/spark/commit/c2ce7cd9659484cec325ed137271c9b6ed52923d).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    @yanboliang @jkbradley @mengxr updated.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61545 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61545/consoleFull)** for PR 13924 at commit [`c2ce7cd`](https://github.com/apache/spark/commit/c2ce7cd9659484cec325ed137271c9b6ed52923d).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r68926461
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,102 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were copied to a new package, `spark.ml.linalg`. 
    +The DataFrame-based APIs in `spark.ml` now depend on the `spark.ml.linalg` classes, 
    +leading to a few breaking changes, predominantly in various model classes 
    +(see [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) for a full list).
    +
    +**Note:** the RDD-based APIs in `spark.mllib` continue to depend on the previous package `spark.mllib.linalg`.
    +
    +_Converting vectors and matrices_
    +
    +While most pipeline components support backward compatibility for loading, 
    +some existing `DataFrames` and pipelines in Spark versions prior to 2.0, that contain vector or matrix 
    +columns, may need to be migrated to the new `spark.ml` vector and matrix types. 
    +Utilities for converting `DataFrame` columns from `spark.mllib.linalg` to `spark.ml.linalg` types
    +(and vice versa) can be found in `spark.mllib.util.MLUtils`.
    +
    +There are also utility methods available for converting single instances of 
    +vectors and matrices. Use the `asML` method on a `mllib.linalg.Vector` / `mllib.linalg.Matrix`
    +for converting to `ml.linalg` types, and 
    +`mllib.linalg.Vectors.fromML` / `mllib.linalg.Matrices.fromML` 
    +for converting to `mllib.linalg` types.
    +
    +<div class="codetabs">
    +<div data-lang="scala"  markdown="1">
    +
    +{% highlight scala %}
    +import org.apache.spark.mllib.util.MLUtils
    +
    +// convert DataFrame columns
    +val convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +val convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    +// convert a single vector or matrix
    +val mlVec: org.apache.spark.ml.linalg.Vector = mllibVec.asML
    +val mlMat: org.apache.spark.ml.linalg.Matrix = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Scala docs](api/scala/index.html#org.apache.spark.mllib.util.MLUtils$) for further detail.
    +</div>
    +
    +<div data-lang="java" markdown="1">
    +
    +{% highlight java %}
    +import org.apache.spark.mllib.util.MLUtils;
    +import org.apache.spark.sql.Dataset;
    +
    +// convert DataFrame columns
    +Dataset<Row> convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF);
    +Dataset<Row> convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF);
    +// convert a single vector or matrix
    +org.apache.spark.ml.linalg.Vector mlVec = mllibVec.asML
    +org.apache.spark.ml.linalg.Matrix mlMat = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Java docs](api/java/org/apache/spark/mllib/util/MLUtils.html) for further detail.
    +</div>
    +
    +<div data-lang="python"  markdown="1">
    +
    +{% highlight python %}
    +from pyspark.mllib.util import MLUtils
    +
    +# convert DataFrame columns
    +convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +convertedMatrxDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    --- End diff --
    
    Note, it looks like we don't have single instance conversion methods `asML` / `fromML` in Python linalg classes (I commented on [SPARK-15944](https://issues.apache.org/jira/browse/SPARK-15944?focusedCommentId=15355014&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15355014).
    
    Not sure if this is intended or we just missed them. One can do `newVec = Vectors.dense(oldVec)` in Python directly, so if that is the expected way to do things I can add that here. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML m...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61303/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [WIP][SPARK-15643][DOC][ML] Add breaking changes to ML m...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    @jkbradley updated Python example assuming #13997 will get merged - refer https://github.com/apache/spark/pull/13924#discussion_r69083602.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61545/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61546 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61546/consoleFull)** for PR 13924 at commit [`919bfe9`](https://github.com/apache/spark/commit/919bfe9c73fca485ff33a528feffa5b59b0b7e86).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r69083602
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,102 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were copied to a new package, `spark.ml.linalg`. 
    +The DataFrame-based APIs in `spark.ml` now depend on the `spark.ml.linalg` classes, 
    +leading to a few breaking changes, predominantly in various model classes 
    +(see [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) for a full list).
    +
    +**Note:** the RDD-based APIs in `spark.mllib` continue to depend on the previous package `spark.mllib.linalg`.
    +
    +_Converting vectors and matrices_
    +
    +While most pipeline components support backward compatibility for loading, 
    +some existing `DataFrames` and pipelines in Spark versions prior to 2.0, that contain vector or matrix 
    +columns, may need to be migrated to the new `spark.ml` vector and matrix types. 
    +Utilities for converting `DataFrame` columns from `spark.mllib.linalg` to `spark.ml.linalg` types
    +(and vice versa) can be found in `spark.mllib.util.MLUtils`.
    +
    +There are also utility methods available for converting single instances of 
    +vectors and matrices. Use the `asML` method on a `mllib.linalg.Vector` / `mllib.linalg.Matrix`
    +for converting to `ml.linalg` types, and 
    +`mllib.linalg.Vectors.fromML` / `mllib.linalg.Matrices.fromML` 
    +for converting to `mllib.linalg` types.
    +
    +<div class="codetabs">
    +<div data-lang="scala"  markdown="1">
    +
    +{% highlight scala %}
    +import org.apache.spark.mllib.util.MLUtils
    +
    +// convert DataFrame columns
    +val convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +val convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    +// convert a single vector or matrix
    +val mlVec: org.apache.spark.ml.linalg.Vector = mllibVec.asML
    +val mlMat: org.apache.spark.ml.linalg.Matrix = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Scala docs](api/scala/index.html#org.apache.spark.mllib.util.MLUtils$) for further detail.
    +</div>
    +
    +<div data-lang="java" markdown="1">
    +
    +{% highlight java %}
    +import org.apache.spark.mllib.util.MLUtils;
    +import org.apache.spark.sql.Dataset;
    +
    +// convert DataFrame columns
    +Dataset<Row> convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF);
    +Dataset<Row> convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF);
    +// convert a single vector or matrix
    +org.apache.spark.ml.linalg.Vector mlVec = mllibVec.asML
    +org.apache.spark.ml.linalg.Matrix mlMat = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Java docs](api/java/org/apache/spark/mllib/util/MLUtils.html) for further detail.
    +</div>
    +
    +<div data-lang="python"  markdown="1">
    +
    +{% highlight python %}
    +from pyspark.mllib.util import MLUtils
    +
    +# convert DataFrame columns
    +convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +convertedMatrxDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    --- End diff --
    
    @jkbradley Ah sorry - I mispoke. It happens to work for dense vectors because it effectively calls `np.array(DenseVector)`, but not for sparse. Workaround is fairly ugly: `mlSV = NewVectors.sparse(mllibSV.size, zip(mllibSV.indices, mllibSV.values))`, or something similar.
    
    I'd say we should have some convenience methods like in Scala/Java?
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61408/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r68866028
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,87 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `spark-mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were moved to a new package, `spark.ml.linalg`. 
    +The DataFrame-based APIs in `spark.ml` now depend on the `spark.ml.linalg` classes, 
    +leading to a few breaking changes, predominantly in various model classes 
    +(see [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) for a full list).
    +
    +**Note:** the RDD-based APIs in `spark.mllib` continue to depend on the previous package `spark.mllib.linalg`.
    +
    +_Converting vectors and matrices_
    --- End diff --
    
    Could you also please note the public conversion methods asML, fromML within spark.mllib.linalg Vector and Matrix types and subtypes for converting single instances?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13924#discussion_r69134094
  
    --- Diff: docs/mllib-guide.md ---
    @@ -104,9 +104,102 @@ and the migration guide below will explain all changes between releases.
     
     ## From 1.6 to 2.0
     
    -The deprecations and changes of behavior in the `spark.mllib` or `spark.ml` packages include:
    +### Breaking changes
     
    -Deprecations:
    +There were several breaking changes in Spark 2.0, which are outlined below.
    +
    +**Linear algebra classes for DataFrame-based APIs**
    +
    +Spark's linear algebra dependencies were moved to a new project, `mllib-local` 
    +(see [SPARK-13944](https://issues.apache.org/jira/browse/SPARK-13944)). 
    +As part of this change, the linear algebra classes were copied to a new package, `spark.ml.linalg`. 
    +The DataFrame-based APIs in `spark.ml` now depend on the `spark.ml.linalg` classes, 
    +leading to a few breaking changes, predominantly in various model classes 
    +(see [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) for a full list).
    +
    +**Note:** the RDD-based APIs in `spark.mllib` continue to depend on the previous package `spark.mllib.linalg`.
    +
    +_Converting vectors and matrices_
    +
    +While most pipeline components support backward compatibility for loading, 
    +some existing `DataFrames` and pipelines in Spark versions prior to 2.0, that contain vector or matrix 
    +columns, may need to be migrated to the new `spark.ml` vector and matrix types. 
    +Utilities for converting `DataFrame` columns from `spark.mllib.linalg` to `spark.ml.linalg` types
    +(and vice versa) can be found in `spark.mllib.util.MLUtils`.
    +
    +There are also utility methods available for converting single instances of 
    +vectors and matrices. Use the `asML` method on a `mllib.linalg.Vector` / `mllib.linalg.Matrix`
    +for converting to `ml.linalg` types, and 
    +`mllib.linalg.Vectors.fromML` / `mllib.linalg.Matrices.fromML` 
    +for converting to `mllib.linalg` types.
    +
    +<div class="codetabs">
    +<div data-lang="scala"  markdown="1">
    +
    +{% highlight scala %}
    +import org.apache.spark.mllib.util.MLUtils
    +
    +// convert DataFrame columns
    +val convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +val convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    +// convert a single vector or matrix
    +val mlVec: org.apache.spark.ml.linalg.Vector = mllibVec.asML
    +val mlMat: org.apache.spark.ml.linalg.Matrix = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Scala docs](api/scala/index.html#org.apache.spark.mllib.util.MLUtils$) for further detail.
    +</div>
    +
    +<div data-lang="java" markdown="1">
    +
    +{% highlight java %}
    +import org.apache.spark.mllib.util.MLUtils;
    +import org.apache.spark.sql.Dataset;
    +
    +// convert DataFrame columns
    +Dataset<Row> convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF);
    +Dataset<Row> convertedMatrixDF = MLUtils.convertMatrixColumnsToML(matrixDF);
    +// convert a single vector or matrix
    +org.apache.spark.ml.linalg.Vector mlVec = mllibVec.asML
    +org.apache.spark.ml.linalg.Matrix mlMat = mllibMat.asML
    +{% endhighlight %}
    +
    +Refer to the [`MLUtils` Java docs](api/java/org/apache/spark/mllib/util/MLUtils.html) for further detail.
    +</div>
    +
    +<div data-lang="python"  markdown="1">
    +
    +{% highlight python %}
    +from pyspark.mllib.util import MLUtils
    +
    +# convert DataFrame columns
    +convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
    +convertedMatrxDF = MLUtils.convertMatrixColumnsToML(matrixDF)
    --- End diff --
    
    Created [SPARK-16328](https://issues.apache.org/jira/browse/SPARK-16328).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by jkbradley <gi...@git.apache.org>.
Github user jkbradley commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    Done with review pass.  Thanks for the PR!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #13924: [SPARK-15643][DOC][ML] Add breaking changes to ML migrat...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/13924
  
    **[Test build #61464 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61464/consoleFull)** for PR 13924 at commit [`ac49f31`](https://github.com/apache/spark/commit/ac49f31cd83aa2755e0f1948f981beee66a16527).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org