You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by squito <gi...@git.apache.org> on 2018/04/17 14:03:46 UTC

[GitHub] spark pull request #21085: [SPARK-23948] Trigger mapstage's job listener in ...

GitHub user squito opened a pull request:

    https://github.com/apache/spark/pull/21085

    [SPARK-23948] Trigger mapstage's job listener in submitMissingTasks

    ## What changes were proposed in this pull request?
    
    SparkContext submitted a map stage from `submitMapStage` to `DAGScheduler`,
    `markMapStageJobAsFinished` is called only in (https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala#L933 and https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala#L1314);
    
    But think about below scenario:
    1. stage0 and stage1 are all `ShuffleMapStage` and stage1 depends on stage0;
    2. We submit stage1 by `submitMapStage`;
    3. When stage 1 running, `FetchFailed` happened, stage0 and stage1 got resubmitted as stage0_1 and stage1_1;
    4. When stage0_1 running, speculated tasks in old stage1 come as succeeded, but stage1 is not inside `runningStages`. So even though all splits(including the speculated tasks) in stage1 succeeded, job listener in stage1 will not be called;
    5. stage0_1 finished, stage1_1 starts running. When `submitMissingTasks`, there is no missing tasks. But in current code, job listener is not triggered.
    
    We should call the job listener for map stage in `5`.
    
    ## How was this patch tested?
    
    Not added yet.
    
    Author: jinxing <ji...@126.com>
    
    Closes #21019 from jinxing64/SPARK-23948.
    
    (cherry picked from commit 3990daaf3b6ca2c5a9f7790030096262efb12cb2)

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/squito/spark cp

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/21085.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #21085
    
----
commit 35e349f402ffd83a4eae31ffb848cd400595d9f7
Author: jinxing <ji...@...>
Date:   2018-04-17T13:55:01Z

    [SPARK-23948] Trigger mapstage's job listener in submitMissingTasks
    
    ## What changes were proposed in this pull request?
    
    SparkContext submitted a map stage from `submitMapStage` to `DAGScheduler`,
    `markMapStageJobAsFinished` is called only in (https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala#L933 and https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala#L1314);
    
    But think about below scenario:
    1. stage0 and stage1 are all `ShuffleMapStage` and stage1 depends on stage0;
    2. We submit stage1 by `submitMapStage`;
    3. When stage 1 running, `FetchFailed` happened, stage0 and stage1 got resubmitted as stage0_1 and stage1_1;
    4. When stage0_1 running, speculated tasks in old stage1 come as succeeded, but stage1 is not inside `runningStages`. So even though all splits(including the speculated tasks) in stage1 succeeded, job listener in stage1 will not be called;
    5. stage0_1 finished, stage1_1 starts running. When `submitMissingTasks`, there is no missing tasks. But in current code, job listener is not triggered.
    
    We should call the job listener for map stage in `5`.
    
    ## How was this patch tested?
    
    Not added yet.
    
    Author: jinxing <ji...@126.com>
    
    Closes #21019 from jinxing64/SPARK-23948.
    
    (cherry picked from commit 3990daaf3b6ca2c5a9f7790030096262efb12cb2)

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    **[Test build #89457 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89457/testReport)** for PR 21085 at commit [`35e349f`](https://github.com/apache/spark/commit/35e349f402ffd83a4eae31ffb848cd400595d9f7).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by squito <gi...@git.apache.org>.
Github user squito commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    known flaky test https://issues.apache.org/jira/browse/SPARK-23894
    
    merging to branch 2.3
    



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by jiangxb1987 <gi...@git.apache.org>.
Github user jiangxb1987 commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    Should we manually close this PR? @squito 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by jiangxb1987 <gi...@git.apache.org>.
Github user jiangxb1987 commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    LGTM!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/89457/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #21085: [SPARK-23948] Trigger mapstage's job listener in ...

Posted by squito <gi...@git.apache.org>.
Github user squito closed the pull request at:

    https://github.com/apache/spark/pull/21085


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by squito <gi...@git.apache.org>.
Github user squito commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    clean cherry-pick of https://github.com/apache/spark/pull/21019, I just forgot to merge back to 2.3


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/2393/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    **[Test build #89457 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89457/testReport)** for PR 21085 at commit [`35e349f`](https://github.com/apache/spark/commit/35e349f402ffd83a4eae31ffb848cd400595d9f7).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #21085: [SPARK-23948] Trigger mapstage's job listener in submitM...

Posted by squito <gi...@git.apache.org>.
Github user squito commented on the issue:

    https://github.com/apache/spark/pull/21085
  
    whoops, thanks for the reminder @jiangxb1987 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org