You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by zouzias <gi...@git.apache.org> on 2018/11/21 11:03:33 UTC

[GitHub] spark pull request #23103: [SPARK-26121] init commit

GitHub user zouzias opened a pull request:

    https://github.com/apache/spark/pull/23103

    [SPARK-26121] init commit

    ## What changes were proposed in this pull request?
    
    Allow the Spark Structured Streaming user to specify the prefix of the consumer group (group.id), compared to force consumer group ids of the form `spark-kafka-source-*`
    
    ## How was this patch tested?
    
    Unit tests provided by Spark (backwards compatible change, i.e., user can optionally use the functionality)
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/Sqooba/spark SPARK-26121

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/23103.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #23103
    
----
commit 4afd3a4dd0794348be1462b896f20de4e344e5ce
Author: Anastasios Zouzias <an...@...>
Date:   2018-11-21T10:24:53Z

    [SPARK-26121] init commit

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99124/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99167 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99167/testReport)** for PR 23103 at commit [`39424dd`](https://github.com/apache/spark/commit/39424dd4ed796394f535e93186681fbd8f03a6f4).
     * This patch **fails due to an unknown error code, -9**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by zouzias <gi...@git.apache.org>.
Github user zouzias commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235478048
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    +                                     metadataPath: String): String = {
    +    val groupIdPrefix = parameters
    +      .getOrElse("group.id.prefix", "spark-kafka-source")
    --- End diff --
    
    I named the property `kafka.groupIdPrefix` for now. Let me know if you a better name suggestion.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5267/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5282/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99134/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99134 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99134/testReport)** for PR 23103 at commit [`9d9c944`](https://github.com/apache/spark/commit/9d9c944d894eb142a0b78211fd6cb4bd1ac07774).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by tgravescs <gi...@git.apache.org>.
Github user tgravescs commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    ok to test


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99125 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99125/testReport)** for PR 23103 at commit [`9a09e74`](https://github.com/apache/spark/commit/9a09e749ccdc262e7582d97114a89441bf133ee9).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99183 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99183/testReport)** for PR 23103 at commit [`f79ba9f`](https://github.com/apache/spark/commit/f79ba9f87cd5f50f9df76af9e486527b781223a7).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/23103


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99167 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99167/testReport)** for PR 23103 at commit [`39424dd`](https://github.com/apache/spark/commit/39424dd4ed796394f535e93186681fbd8f03a6f4).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by zouzias <gi...@git.apache.org>.
Github user zouzias commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    @koeninger, I will make the doc changes asap. FYI, I plan to make changes on file `structured-streaming-kafka-integration.md` seems the most relevant doc for this diff.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99183 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99183/testReport)** for PR 23103 at commit [`f79ba9f`](https://github.com/apache/spark/commit/f79ba9f87cd5f50f9df76af9e486527b781223a7).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99126 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99126/testReport)** for PR 23103 at commit [`cac7976`](https://github.com/apache/spark/commit/cac7976d807fc55e94098c60d185c81ae3f3cf49).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99219/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99125/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99167/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5241/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99126/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by koeninger <gi...@git.apache.org>.
Github user koeninger commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235479872
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    +                                     metadataPath: String): String = {
    +    val groupIdPrefix = parameters
    +      .getOrElse("group.id.prefix", "spark-kafka-source")
    --- End diff --
    
    kafka.* is reserved for the existing kafka project's client configs, see e.g. line 86.  I'd just go with groupIdPrefix


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by koeninger <gi...@git.apache.org>.
Github user koeninger commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235462394
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    +                                     metadataPath: String): String = {
    +    val groupIdPrefix = parameters
    +      .getOrElse("group.id.prefix", "spark-kafka-source")
    --- End diff --
    
    It seems like convention has been to mostly use camelcase for streaming options that aren't from the existing kafka.blah.whatever configuration namespace... e.g. subscribePattern, startingOffsets, maxOffsetsPerTrigger


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by koeninger <gi...@git.apache.org>.
Github user koeninger commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    @zouzias can you add the new option to docs/structured-streaming-kafka-integration.md as part of this PR?  Instructions for building docs are in docs/README.md , ping me if you need a hand.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by HeartSaVioR <gi...@git.apache.org>.
Github user HeartSaVioR commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    LGTM. Btw, IMHO, TODOs @zouzias described would be better to be addressed at once since documentation is easy to be forgotten.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5232/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99219 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99219/testReport)** for PR 23103 at commit [`9196623`](https://github.com/apache/spark/commit/9196623d2b1fa29522dcc400e27edccf8fea946a).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99124 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99124/testReport)** for PR 23103 at commit [`4afd3a4`](https://github.com/apache/spark/commit/4afd3a4dd0794348be1462b896f20de4e344e5ce).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5230/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by tgravescs <gi...@git.apache.org>.
Github user tgravescs commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Lgtm 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by koeninger <gi...@git.apache.org>.
Github user koeninger commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235480695
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    --- End diff --
    
    first arg should be on its own newline as well


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by zouzias <gi...@git.apache.org>.
Github user zouzias commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235627877
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    +                                     metadataPath: String): String = {
    +    val groupIdPrefix = parameters
    +      .getOrElse("group.id.prefix", "spark-kafka-source")
    --- End diff --
    
    Makes sense, fixed in 39424dd 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by koeninger <gi...@git.apache.org>.
Github user koeninger commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    merging to master, thanks @zouzias 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5231/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by zouzias <gi...@git.apache.org>.
Github user zouzias commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    cc @koeninger 
    
    Preliminary PR, todos:
    
    * Decide on input parameter name for prefix consumer group, i.e., `streaming.group_id.prefix`.
    * Document the above parameter in Spark SS docs.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by koeninger <gi...@git.apache.org>.
Github user koeninger commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235461374
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    --- End diff --
    
    Sorry there isn't an automatic formatter for this... but use 4 space indentation for multi-line argument lists
    
    https://github.com/databricks/scala-style-guide#spacing-and-indentation


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99219 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99219/testReport)** for PR 23103 at commit [`9196623`](https://github.com/apache/spark/commit/9196623d2b1fa29522dcc400e27edccf8fea946a).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99126 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99126/testReport)** for PR 23103 at commit [`cac7976`](https://github.com/apache/spark/commit/cac7976d807fc55e94098c60d185c81ae3f3cf49).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/99183/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99134 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99134/testReport)** for PR 23103 at commit [`9d9c944`](https://github.com/apache/spark/commit/9d9c944d894eb142a0b78211fd6cb4bd1ac07774).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99124 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99124/testReport)** for PR 23103 at commit [`4afd3a4`](https://github.com/apache/spark/commit/4afd3a4dd0794348be1462b896f20de4e344e5ce).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    **[Test build #99125 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/99125/testReport)** for PR 23103 at commit [`9a09e74`](https://github.com/apache/spark/commit/9a09e749ccdc262e7582d97114a89441bf133ee9).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #23103: [SPARK-26121] [Structured Streaming] Allow users to defi...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/23103
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/5311/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #23103: [SPARK-26121] [Structured Streaming] Allow users ...

Posted by zouzias <gi...@git.apache.org>.
Github user zouzias commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23103#discussion_r235478266
  
    --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceProvider.scala ---
    @@ -538,6 +538,17 @@ private[kafka010] object KafkaSourceProvider extends Logging {
           .setIfUnset(ConsumerConfig.RECEIVE_BUFFER_CONFIG, 65536: java.lang.Integer)
           .build()
     
    +  /**
    +   * Returns a unique consumer group (group.id), allowing the user to set the prefix of
    +   * the consumer group
    +   */
    +  private def streamingUniqueGroupId(parameters: Map[String, String],
    --- End diff --
    
    No worries, done.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org