You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by zhengruifeng <gi...@git.apache.org> on 2016/05/11 13:32:27 UTC

[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

GitHub user zhengruifeng opened a pull request:

    https://github.com/apache/spark/pull/13050

    [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use SparkSession and update indent in examples

    ## What changes were proposed in this pull request?
    1, use `SparkSession` according to [SPARK-15031](https://issues.apache.org/jira/browse/SPARK-15031)
    2, Update indent for `SparkContext` according to [SPARK-15134](https://issues.apache.org/jira/browse/SPARK-15134)
    3, BTW, remove some duplicate space and add missing '.'
    
    
    ## How was this patch tested?
    manual tests

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zhengruifeng/spark use_sparksession

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/13050.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #13050
    
----
commit f1fbc6932391b4046ad4f940e1c05e64f2e2ad2f
Author: Zheng RuiFeng <ru...@foxmail.com>
Date:   2016-05-11T11:09:39Z

    create pr

commit 3cbca5bfe120f51d162683bbc4fde2ed621eb5a4
Author: Zheng RuiFeng <ru...@foxmail.com>
Date:   2016-05-11T13:16:07Z

    update

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218642130
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58431/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by andrewor14 <gi...@git.apache.org>.
Github user andrewor14 commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218660156
  
    LGTM, but let's retest this please just in case. There have been a lot of build breaks related to changes like these lately.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218463656
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by zhengruifeng <gi...@git.apache.org>.
Github user zhengruifeng commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218460211
  
    cc @andrewor14 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218461269
  
    **[Test build #58370 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58370/consoleFull)** for PR 13050 at commit [`3cbca5b`](https://github.com/apache/spark/commit/3cbca5bfe120f51d162683bbc4fde2ed621eb5a4).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218660436
  
    **[Test build #58446 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58446/consoleFull)** for PR 13050 at commit [`f8d51b9`](https://github.com/apache/spark/commit/f8d51b93ccd8362879aaf44b852ae3df4c9f798b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by zhengruifeng <gi...@git.apache.org>.
Github user zhengruifeng commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13050#discussion_r62955620
  
    --- Diff: examples/src/main/python/ml/simple_params_example.py ---
    @@ -18,36 +18,30 @@
     from __future__ import print_function
     
     import pprint
    -import sys
     
    -from pyspark import SparkContext
     from pyspark.ml.classification import LogisticRegression
     from pyspark.mllib.linalg import DenseVector
    -from pyspark.mllib.regression import LabeledPoint
    -from pyspark.sql import SQLContext
    +from pyspark.sql import SparkSession
     
     """
    -A simple example demonstrating ways to specify parameters for Estimators and Transformers.
    +An example demonstrating ways to specify parameters for Estimators and Transformers.
     Run with:
       bin/spark-submit examples/src/main/python/ml/simple_params_example.py
     """
     
     if __name__ == "__main__":
    -    if len(sys.argv) > 1:
    -        print("Usage: simple_params_example", file=sys.stderr)
    -        exit(1)
    -    sc = SparkContext(appName="PythonSimpleParamsExample")
    -    sqlContext = SQLContext(sc)
    +    spark = SparkSession \
    --- End diff --
    
    You are right. `model1.extractParamMap()` and `model2.extractParamMap()` are always empty.
    And it fails with 
    ```
    16/05/12 09:58:48 WARN TaskSetManager: Lost task 2.0 in stage 40.0 (TID 152, localhost): java.lang.IllegalArgumentException: requirement failed: Logistic Regression getThreshold found inconsistent values for threshold (0.5) and thresholds (equivalent to 0.55)
            at scala.Predef$.require(Predef.scala:224)
            at org.apache.spark.ml.classification.LogisticRegressionParams$class.checkThresholdConsistency(LogisticRegression.scala:143)
    ```
    I will revert this change. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218661211
  
    **[Test build #58446 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58446/consoleFull)** for PR 13050 at commit [`f8d51b9`](https://github.com/apache/spark/commit/f8d51b93ccd8362879aaf44b852ae3df4c9f798b).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218463503
  
    **[Test build #58370 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58370/consoleFull)** for PR 13050 at commit [`3cbca5b`](https://github.com/apache/spark/commit/3cbca5bfe120f51d162683bbc4fde2ed621eb5a4).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218642073
  
    **[Test build #58431 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58431/consoleFull)** for PR 13050 at commit [`f8d51b9`](https://github.com/apache/spark/commit/f8d51b93ccd8362879aaf44b852ae3df4c9f798b).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/13050


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218641043
  
    **[Test build #58431 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58431/consoleFull)** for PR 13050 at commit [`f8d51b9`](https://github.com/apache/spark/commit/f8d51b93ccd8362879aaf44b852ae3df4c9f798b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218642128
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13050#discussion_r62861788
  
    --- Diff: examples/src/main/python/ml/simple_params_example.py ---
    @@ -18,36 +18,30 @@
     from __future__ import print_function
     
     import pprint
    -import sys
     
    -from pyspark import SparkContext
     from pyspark.ml.classification import LogisticRegression
     from pyspark.mllib.linalg import DenseVector
    -from pyspark.mllib.regression import LabeledPoint
    -from pyspark.sql import SQLContext
    +from pyspark.sql import SparkSession
     
     """
    -A simple example demonstrating ways to specify parameters for Estimators and Transformers.
    +An example demonstrating ways to specify parameters for Estimators and Transformers.
     Run with:
       bin/spark-submit examples/src/main/python/ml/simple_params_example.py
     """
     
     if __name__ == "__main__":
    -    if len(sys.argv) > 1:
    -        print("Usage: simple_params_example", file=sys.stderr)
    -        exit(1)
    -    sc = SparkContext(appName="PythonSimpleParamsExample")
    -    sqlContext = SQLContext(sc)
    +    spark = SparkSession \
    --- End diff --
    
    I wonder this rxample works. I remember this was not fixed in https://github.com/apache/spark/pull/12809 because it does not work.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by andrewor14 <gi...@git.apache.org>.
Github user andrewor14 commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218665542
  
    Merging into master 2.0.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218661260
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58446/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218463658
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58370/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-15031][SPARK-15134][EXAMPLE][DOC] Use S...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13050#issuecomment-218661259
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org