You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2021/02/02 12:24:59 UTC

[GitHub] [spark] AngersZhuuuu opened a new pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

AngersZhuuuu opened a new pull request #31437:
URL: https://github.com/apache/spark/pull/31437


   ### What changes were proposed in this pull request?
   We meet case that because yarn queue's setting, some app's container is preempted by a higher level request due to the scheduling framework. Then throw `ApplicationAttemptNotFoundException`,  in an old PR `https://github.com/apache/spark/pull/10129`, it add the behavior that we always close app directly without retry.
   
   But for some issue, such as queue issue caused by peak usage, we can have a retry.
   
   Since for not all case throw `ApplicationAttemptNotFoundException` we don't need to retry. IMO, we should not just catch this exception.
   
   ### Why are the changes needed?
   Make job more tolerable about container issue
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   
   ### How was this patch tested?
   Not need


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823829023


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/137717/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823816368


   **[Test build #137717 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/137717/testReport)** for PR 31437 at commit [`69abde6`](https://github.com/apache/spark/commit/69abde655c00ff7dc4502bb601d6f695791e5ec0).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771616224


   **[Test build #134779 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134779/testReport)** for PR 31437 at commit [`1aed1ea`](https://github.com/apache/spark/commit/1aed1ea82dac8694dabaf9ad12c7ee4e856bcabb).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] github-actions[bot] closed pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
github-actions[bot] closed pull request #31437:
URL: https://github.com/apache/spark/pull/31437


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] github-actions[bot] commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-910934317


   We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
   If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu edited a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu edited a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-772604648


   > can you provide more detail here.
   > what mode is this running in when you hit this (client or cluster - unmanaged am or managed)?
   
   Our case is client mode. this error throw in driver side.
   So we can change to :
   
   1. client- mode, for this error we can have a retry.
   2. cluster-mode, since this ApplicationMaster running in the AM container, so we can just stop?
   
   > When you say its preempted, I assume that you mean this application was killed by the RM due to higher priority? If it was killed how can it stay alive.
   
   I mean client mode AM's container was preempted. Then RM can't find this attempt. Then client driver backend receive  ApplicationAttemptNotFoundException.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] tgravescs commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
tgravescs commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773366575


   ok, please update the description with those details.
   maybe cluster mode doesn't matter here because application master would be killed anyway.
   The original change I believe was before we could handle application master being killed and restarted. I think that is handled ok now. so just to verify with this change, the application master gets preempted and killed and application master gets restarted and the driver process continue, correct?
   
   The only thing better would be if yarn told us this was a preempt case, did you look at that at all?  its been a while since I looked into the yarn code.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu edited a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu edited a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823763714


   RM clean preempted container's  info, then code in `ClientRMService` throw `ApplicationAttemptNotFoundException`
   
   ```
       RMAppAttempt appAttempt = application.getAppAttempts().get(attemptId);
       if (appAttempt == null) {
         throw new ApplicationAttemptNotFoundException(
             "ApplicationAttempt with id '" + attemptId + "' doesn't exist in RM.");
       }
   ```
   
   `org.apache.spark.deploy.yarn.Client` will receive this exception。
   
   In yarn-client mode, `Client` run in client driver side, when it receive `ApplicationAttemptNotFoundException`, only means  AM's container lose. In this situation, retry and restart a AM attempt is ok.
   In cluster mode, `Client` will be wrapper in `YarnClusterApplication` and start this in user client side.
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823784240


   **[Test build #137715 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/137715/testReport)** for PR 31437 at commit [`69abde6`](https://github.com/apache/spark/commit/69abde655c00ff7dc4502bb601d6f695791e5ec0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823830136






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771690641


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39365/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773369627


   > ok, please update the description with those details.
   > maybe cluster mode doesn't matter here because application master would be killed anyway.
   > The original change I believe was before we could handle application master being killed and restarted. I think that is handled ok now. so just to verify with this change, the application master gets preempted and killed and application master gets restarted and the driver process continue, correct?
   > 
   > The only thing better would be if yarn told us this was a preempt case, did you look at that at all? its been a while since I looked into the yarn code.
   
   From the error stack, it only tell us the attempt can't be found. I got root cause in yarn's log. Since we can't change yarn's code  in spark side, what we can do here is just retry. I am looking into yarn's code these days.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773765682


   retest this please


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771690641


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39365/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu edited a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu edited a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773369627


   > ok, please update the description with those details.
   
   Done
   
   > maybe cluster mode doesn't matter here because application master would be killed anyway.
   
   Yea, and origin change also happen in yarn-client mode since this part code only use in yarn-client mode.
   
   > The original change I believe was before we could handle application master being killed and restarted.
   >  I think that is handled ok now. so just to verify with this change, the application master gets preempted and killed and application master gets restarted and the driver process continue, correct?
   
   yea
   
   > The only thing better would be if yarn told us this was a preempt case, did you look at that at all? its been a while since I looked into the yarn code.
   
   From the error stack, it only tell us the attempt can't be found. I got root cause in yarn's log. Since we can't change yarn's code  in spark side, what we can do here is just retry. I am looking into yarn's code these days.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773742464


   **[Test build #134901 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134901/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu edited a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu edited a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773369627






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773395117


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134883/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773752116


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134901/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] tgravescs commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
tgravescs commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773366575


   ok, please update the description with those details.
   maybe cluster mode doesn't matter here because application master would be killed anyway.
   The original change I believe was before we could handle application master being killed and restarted. I think that is handled ok now. so just to verify with this change, the application master gets preempted and killed and application master gets restarted and the driver process continue, correct?
   
   The only thing better would be if yarn told us this was a preempt case, did you look at that at all?  its been a while since I looked into the yarn code.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782557538


   **[Test build #135292 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135292/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823804684


   **[Test build #137717 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/137717/testReport)** for PR 31437 at commit [`69abde6`](https://github.com/apache/spark/commit/69abde655c00ff7dc4502bb601d6f695791e5ec0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823800992






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773468964


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39469/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782568625


   ping @tgravescs 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782561702


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135292/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] tgravescs commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
tgravescs commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-788039596


   Sorry was out of office for a bit. did you look at the test failure?
   
   >> The only thing better would be if yarn told us this was a preempt case, did you look at that at all? its been a while since I looked into the yarn code.
   > From the error stack, it only tell us the attempt can't be found. I got root cause in yarn's log. Since we can't change yarn's code in spark side, what we can do here is just retry. I am looking into yarn's code these days.
   
   did you have a chance to look at the yarn code here?  I thought YARN was supposed to give indication that it was going to preempt you but its been a while since I did anything in YARN.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773813066


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39490/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782561702


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/135292/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773468589


   Kubernetes integration test status success
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39469/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782560299


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39871/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773395117






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-846944838


   ping @tgravescs 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773393996


   **[Test build #134883 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134883/testReport)** for PR 31437 at commit [`b36cf61`](https://github.com/apache/spark/commit/b36cf618f0d4222443d8a91bf11112e961adb94d).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782563590


   Kubernetes integration test status success
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39871/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771649648


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134779/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823784240


   **[Test build #137715 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/137715/testReport)** for PR 31437 at commit [`69abde6`](https://github.com/apache/spark/commit/69abde655c00ff7dc4502bb601d6f695791e5ec0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823804684


   **[Test build #137717 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/137717/testReport)** for PR 31437 at commit [`69abde6`](https://github.com/apache/spark/commit/69abde655c00ff7dc4502bb601d6f695791e5ec0).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823763714


   
   RM clean preempted container's  info, then code in `ClientRMService` throw `ApplicationAttemptNotFoundException`
   
   ```
       RMAppAttempt appAttempt = application.getAppAttempts().get(attemptId);
       if (appAttempt == null) {
         throw new ApplicationAttemptNotFoundException(
             "ApplicationAttempt with id '" + attemptId + "' doesn't exist in RM.");
       }
   ```
   
   `org.apache.spark.deploy.yarn.Client` will receive this exception。
   
   In yarn-client


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782557538


   **[Test build #135292 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135292/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771616224


   **[Test build #134779 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134779/testReport)** for PR 31437 at commit [`1aed1ea`](https://github.com/apache/spark/commit/1aed1ea82dac8694dabaf9ad12c7ee4e856bcabb).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773790362


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134908/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782563594


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39871/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771627867


   **[Test build #134779 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134779/testReport)** for PR 31437 at commit [`1aed1ea`](https://github.com/apache/spark/commit/1aed1ea82dac8694dabaf9ad12c7ee4e856bcabb).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782563594


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39871/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-772604648


   > can you provide more detail here.
   > what mode is this running in when you hit this (client or cluster - unmanaged am or managed)?
   > When you say its preempted, I assume that you mean this application was killed by the RM due to higher priority? If it was killed how can it stay alive
   
   Our case is client mode. this error throw in driver side.
   So we can change to :
   
   1. client- mode, for this error we can have a retry.
   2. cluster-mode, since this ApplicationMaster running in the AM container, so we can just stop?
   3. 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823799862






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823829023


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/137717/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771616224


   **[Test build #134779 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134779/testReport)** for PR 31437 at commit [`1aed1ea`](https://github.com/apache/spark/commit/1aed1ea82dac8694dabaf9ad12c7ee4e856bcabb).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] tgravescs commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
tgravescs commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-772559171


   can you provide more detail here. 
   what mode is this running in when you hit this (client or cluster - unmanaged am or managed)?
   When you say its preempted, I assume that you mean this application was killed by the RM due to higher priority?  If it was killed how can it stay alive


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773468964


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39469/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782568625


   ping @tgravescs 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823830178


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42245/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823803517


   > Sorry was out of office for a bit. did you look at the test failure?
   
   Seems a flaky test.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823830178


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42245/
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773395094


   **[Test build #134883 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134883/testReport)** for PR 31437 at commit [`b36cf61`](https://github.com/apache/spark/commit/b36cf618f0d4222443d8a91bf11112e961adb94d).
    * This patch **fails Scala style tests**.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-787466707


   gentle ping @tgravescs Any more suggestion?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773393996






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773393996






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782559847


   **[Test build #135292 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/135292/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773790362


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134908/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773448309


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39469/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771649648


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134779/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773748896


   **[Test build #134901 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134901/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).
    * This patch **fails Spark unit tests**.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773393996


   **[Test build #134883 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134883/testReport)** for PR 31437 at commit [`b36cf61`](https://github.com/apache/spark/commit/b36cf618f0d4222443d8a91bf11112e961adb94d).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773395117


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134883/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu edited a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu edited a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-772604648


   > can you provide more detail here.
   > what mode is this running in when you hit this (client or cluster - unmanaged am or managed)?
   
   Our case is client mode. this error throw in driver side. And this logical I removed only happen when yarn-client mode. 
   
   > When you say its preempted, I assume that you mean this application was killed by the RM due to higher priority? If it was killed how can it stay alive.
   
   I mean client mode AM's container was preempted. Then RM can't find this attempt. Then client driver backend receive  ApplicationAttemptNotFoundException.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773742464


   **[Test build #134901 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134901/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771649648






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823803304


   retest this please


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773395117






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773777203


   **[Test build #134908 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134908/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771616224






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-772154089


   cc @tgravescs and @jerryshao FYI


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu edited a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu edited a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-772604648


   > can you provide more detail here.
   > what mode is this running in when you hit this (client or cluster - unmanaged am or managed)?
   
   Our case is client mode. this error throw in driver side. 
   
   > When you say its preempted, I assume that you mean this application was killed by the RM due to higher priority? If it was killed how can it stay alive.
   
   I mean client mode AM's container was preempted. Then RM can't find this attempt. Then client driver backend receive  ApplicationAttemptNotFoundException.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823800992






-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-823795031


   **[Test build #137715 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/137715/testReport)** for PR 31437 at commit [`69abde6`](https://github.com/apache/spark/commit/69abde655c00ff7dc4502bb601d6f695791e5ec0).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773784010


   **[Test build #134908 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134908/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).
    * This patch **fails Spark unit tests**.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773813066


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39490/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773752116


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134901/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773777203


   **[Test build #134908 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134908/testReport)** for PR 31437 at commit [`db37f5e`](https://github.com/apache/spark/commit/db37f5e791315263d4c83f7ee93d2d4a1b356b16).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-782550133


   retest this please


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31437: [SPARK-34329][SQL] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-771649648






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AngersZhuuuu commented on pull request #31437: [SPARK-34329][YARN] When hit ApplicationAttemptNotFoundException, we can't just stop app for all case

Posted by GitBox <gi...@apache.org>.
AngersZhuuuu commented on pull request #31437:
URL: https://github.com/apache/spark/pull/31437#issuecomment-773369627






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org