You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by zsxwing <gi...@git.apache.org> on 2015/12/23 20:14:04 UTC

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Update Strea...

GitHub user zsxwing opened a pull request:

    https://github.com/apache/spark/pull/10453

    [SPARK-12507][Streaming][Document]Update Streaming configurations for 1.6

    /cc @tdas @brkyvz 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zsxwing/spark streaming-conf

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/10453.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #10453
    
----
commit 42951371259cc1ef1dd39f1e6a2ebb5867326704
Author: Shixiong Zhu <sh...@databricks.com>
Date:   2015-12-23T19:11:56Z

    Update Streaming configurations for 1.6

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by BenFradet <gi...@git.apache.org>.

Github user BenFradet commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10453#discussion_r48559015
  
    --- Diff: docs/configuration.md ---
    @@ -1600,6 +1600,33 @@ Apart from these, the following properties are also available, and may be useful
         How many batches the Spark Streaming UI and status APIs remember before garbage collecting.
       </td>
     </tr>
    +<tr>
    +  <td><code>spark.streaming.driver.writeAheadLog.closeFileAfterWrite</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to close the file after writing a write ahead log record in driver. Because S3 doesn't
    +    support flushing of data, when using S3 for checkpointing, you should enable it to achieve read
    +    after write consistency.
    +  </td>
    +</tr>
    +<tr>
    +  <td><code>spark.streaming.receiver.writeAheadLog.closeFileAfterWrite</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to close the file after writing a write ahead log record in receivers. Because S3
    --- End diff --
    
    same thing here: `on the receivers` instead of `in receivers`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10453#issuecomment-169837009
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10453#issuecomment-169817911
  
    **[Test build #48971 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48971/consoleFull)** for PR 10453 at commit [`4d55b03`](https://github.com/apache/spark/commit/4d55b03af0c6cfb73833c8fe86fb7bf97f7c2c38).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by zsxwing <gi...@git.apache.org>.

Github user zsxwing commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10453#discussion_r48559444
  
    --- Diff: docs/configuration.md ---
    @@ -1600,6 +1600,33 @@ Apart from these, the following properties are also available, and may be useful
         How many batches the Spark Streaming UI and status APIs remember before garbage collecting.
       </td>
     </tr>
    +<tr>
    +  <td><code>spark.streaming.driver.writeAheadLog.closeFileAfterWrite</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to close the file after writing a write ahead log record in driver. Because S3 doesn't
    +    support flushing of data, when using S3 for checkpointing, you should enable it to achieve read
    +    after write consistency.
    +  </td>
    +</tr>
    +<tr>
    +  <td><code>spark.streaming.receiver.writeAheadLog.closeFileAfterWrite</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to close the file after writing a write ahead log record in receivers. Because S3
    +    doesn't support flushing of data, when using S3 for checkpointing, you should enable it to
    +    achieve read after write consistency.
    +  </td>
    +</tr>
    +<tr>
    +  <td><code>spark.streaming.driver.writeAheadLog.allowBatching</code></td>
    +  <td>false</td>
    --- End diff --
    
    for me: the default value is `true`.
    
    That's why I want to expose this one since the behavior is different from 1.5.0.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by BenFradet <gi...@git.apache.org>.

Github user BenFradet commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10453#discussion_r48559275
  
    --- Diff: docs/configuration.md ---
    @@ -1600,6 +1600,33 @@ Apart from these, the following properties are also available, and may be useful
         How many batches the Spark Streaming UI and status APIs remember before garbage collecting.
       </td>
     </tr>
    +<tr>
    +  <td><code>spark.streaming.driver.writeAheadLog.closeFileAfterWrite</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to close the file after writing a write ahead log record in driver. Because S3 doesn't
    +    support flushing of data, when using S3 for checkpointing, you should enable it to achieve read
    +    after write consistency.
    +  </td>
    +</tr>
    +<tr>
    +  <td><code>spark.streaming.receiver.writeAheadLog.closeFileAfterWrite</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to close the file after writing a write ahead log record in receivers. Because S3
    +    doesn't support flushing of data, when using S3 for checkpointing, you should enable it to
    +    achieve read after write consistency.
    +  </td>
    +</tr>
    +<tr>
    +  <td><code>spark.streaming.driver.writeAheadLog.allowBatching</code></td>
    +  <td>false</td>
    +  <td>
    +    Whether to batch write ahead logs in driver to write. When using S3 for checkpointing, write
    +    operations in driver usually take too long. Enable batching write ahead logs will improve
    +    the performance of writing.
    --- End diff --
    
    I'd say `will improve the performance of write operations`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10453#issuecomment-169836865
  
    **[Test build #48980 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48980/consoleFull)** for PR 10453 at commit [`28a750d`](https://github.com/apache/spark/commit/28a750d61c058e537a8ca44babb3ff0f4b54f3b3).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-12507][Streaming][Document]Expose close...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10453#issuecomment-168094523
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48516/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org