You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by guoxiaolongzte <gi...@git.apache.org> on 2017/03/31 13:59:55 UTC

[GitHub] spark pull request #17498: [SPARK-20177]Document about compression way has s...

GitHub user guoxiaolongzte opened a pull request:

    https://github.com/apache/spark/pull/17498

    [SPARK-20177]Document about compression way has some little detail ch\u2026

    \u2026anges.
    
    ## What changes were proposed in this pull request?
    
    Document compression way little detail changes.
    1.spark.eventLog.compress add 'Compression will use spark.io.compression.codec.'
    2.spark.broadcast.compress add 'Compression will use spark.io.compression.codec.'
    3,spark.rdd.compress add 'Compression will use spark.io.compression.codec.'
    4.spark.io.compression.codec add 'event log describe'
    
    ## How was this patch tested?
    
    manual tests
    
    Please review http://spark.apache.org/contributing.html before opening a pull request.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/guoxiaolongzte/spark SPARK-20177

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17498.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17498
    
----
commit d383efba12c66addb17006dea107bb0421d50bc3
Author: \u90ed\u5c0f\u9f99 10207633 <gu...@zte.com.cn>
Date:   2017-03-31T13:57:09Z

    [SPARK-20177]Document about compression way has some little detail changes.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by guoxiaolongzte <gi...@git.apache.org>.
Github user guoxiaolongzte commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    @jerryshao 
    This is just an optimization suggestion.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17498: [SPARK-20177]Document about compression way has s...

Posted by guoxiaolongzte <gi...@git.apache.org>.
Github user guoxiaolongzte commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17498#discussion_r109172977
  
    --- Diff: docs/configuration.md ---
    @@ -773,14 +774,15 @@ Apart from these, the following properties are also available, and may be useful
       <td>true</td>
       <td>
         Whether to compress broadcast variables before sending them. Generally a good idea.
    +    Compression will use <code>spark.io.compression.codec</code>.
       </td>
     </tr>
     <tr>
       <td><code>spark.io.compression.codec</code></td>
       <td>lz4</td>
       <td>
    -    The codec used to compress internal data such as RDD partitions, broadcast variables and
    -    shuffle outputs. By default, Spark provides three codecs: <code>lz4</code>, <code>lzf</code>,
    +    The codec used to compress internal data such as RDD partitions,event log, broadcast variables
    --- End diff --
    
    I asked a few people, they all don't know, the event log compression methods is spark.io.com pression. Codec.And a few other places it is necessary to modify, more friendly and clear.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by guoxiaolongzte <gi...@git.apache.org>.
Github user guoxiaolongzte commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    @srowen @jerryshao 
    If a spark application developer, using event compress, from the spark official document, will not see the use of spark.io.compression.codec is specified compression description.
    The current problem I think is how to change the document.
    I mentioned above to amend the point, which can be a few changes to merge to the master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by jerryshao <gi...@git.apache.org>.
Github user jerryshao commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    IMHO I thought this is still a unnecessary fix. I would doubt if user really get confused without your fix? You can always correct me since I stand on the of developers :).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    Merged to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    **[Test build #3632 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3632/testReport)** for PR 17498 at commit [`3059013`](https://github.com/apache/spark/commit/3059013e9d2aec76def14eb314b6761bea0e7ca0).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by guoxiaolongzte <gi...@git.apache.org>.
Github user guoxiaolongzte commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    @srowen  i add a space


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17498: [SPARK-20177]Document about compression way has s...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17498#discussion_r109167234
  
    --- Diff: docs/configuration.md ---
    @@ -773,14 +774,15 @@ Apart from these, the following properties are also available, and may be useful
       <td>true</td>
       <td>
         Whether to compress broadcast variables before sending them. Generally a good idea.
    +    Compression will use <code>spark.io.compression.codec</code>.
       </td>
     </tr>
     <tr>
       <td><code>spark.io.compression.codec</code></td>
       <td>lz4</td>
       <td>
    -    The codec used to compress internal data such as RDD partitions, broadcast variables and
    -    shuffle outputs. By default, Spark provides three codecs: <code>lz4</code>, <code>lzf</code>,
    +    The codec used to compress internal data such as RDD partitions,event log, broadcast variables
    --- End diff --
    
    (missing a space)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #17498: [SPARK-20177]Document about compression way has some lit...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/17498
  
    **[Test build #3632 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3632/testReport)** for PR 17498 at commit [`3059013`](https://github.com/apache/spark/commit/3059013e9d2aec76def14eb314b6761bea0e7ca0).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17498: [SPARK-20177]Document about compression way has s...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17498#discussion_r109174272
  
    --- Diff: docs/configuration.md ---
    @@ -773,14 +774,15 @@ Apart from these, the following properties are also available, and may be useful
       <td>true</td>
       <td>
         Whether to compress broadcast variables before sending them. Generally a good idea.
    +    Compression will use <code>spark.io.compression.codec</code>.
       </td>
     </tr>
     <tr>
       <td><code>spark.io.compression.codec</code></td>
       <td>lz4</td>
       <td>
    -    The codec used to compress internal data such as RDD partitions, broadcast variables and
    -    shuffle outputs. By default, Spark provides three codecs: <code>lz4</code>, <code>lzf</code>,
    +    The codec used to compress internal data such as RDD partitions,event log, broadcast variables
    --- End diff --
    
    I'm not sure what you're replying to, but I'm asking you to add the missing space in "partitions,event"


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #17498: [SPARK-20177]Document about compression way has s...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/17498


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org