You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zeppelin.apache.org by GitBox <gi...@apache.org> on 2020/03/20 14:14:29 UTC

[GitHub] [zeppelin] xiejiajun opened a new pull request #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

xiejiajun opened a new pull request #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695
 
 
   ### What is this PR for?
   - added timeout for getting Thrift client to avoid situations where the interpreter may not be restarted when the interpreter process exits unexpectedly
   
   
   ### What type of PR is it?
   - Bug Fix
   
   ### What is the Jira issue?
   - https://issues.apache.org/jira/browse/ZEPPELIN-4691
   
   ### Questions:
   * Does the licenses files need update? NO
   * Is there breaking changes for older versions? NO
   * Does this needs documentation? NO
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-602007560
 
 
   Which interpreter do you use ? Do you use the latest master branch ? Recently there's one related issue resolved 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] xiejiajun commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
xiejiajun commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-602042565
 
 
   > Which interpreter do you use ? Do you use the latest master branch ? Recently there's one related issue resolved
   
   @zjffdu Sorry,I have not been able to find a related issue. I am using the 0.8.2 branch. The spark interpreter often has this problem. But I see that the related code of the 0.9 branch has not changed, and this problem should also exist.  Can you provide the related issue that has been resolved recently, I'll check if it is related to this issue.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] asfgit closed pull request #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
asfgit closed pull request #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695
 
 
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-601760088
 
 
   Thanks for the contribution @xiejiajun , could you let other know what kind of people you are trying to resolve and how to reproduce this issue ?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] xiejiajun commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
xiejiajun commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-601773051
 
 
   > Thanks for the contribution @xiejiajun , could you let other know what kind of people you are trying to resolve and how to reproduce this issue ?
   
   @zjffdu ,When I manually kill the interpreter-related process, I try to restart by using the interpreter restart button on the web page. Occasionally, the interpreter cannot restart. The ZeppelinServer service must be restarted to solve it. Finally, I traced the source code and found that the chain of method invocation for closing the interpreter was blocked at RemoteInterpreterProcess.getClient method
   ![203605_f7e9310b_1936253](https://user-images.githubusercontent.com/26395958/77181346-213f6500-6b06-11ea-9b47-18e26cf53745.png)
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-602054524
 
 
   @xiejiajun Sorry, I forget to paste the issue [ZEPPELIN-4600](https://issues.apache.org/jira/browse/ZEPPELIN-4600) 
   BTW, is it hard to reproduce this issue ? You mentioned that it is on occasional, I am wondering how often it could be reproduced  ? What kind of mode do you use ? yarn-client or yarn-cluster ? 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] xiejiajun commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
xiejiajun commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-602093882
 
 
   > @xiejiajun Sorry, I forget to paste the issue [ZEPPELIN-4600](https://issues.apache.org/jira/browse/ZEPPELIN-4600)
   > BTW, is it hard to reproduce this issue ? You mentioned that it is on occasional, I am wondering how often it could be reproduced ? What kind of mode do you use ? yarn-client or yarn-cluster ?
   
   I am using yarn-client mode. Through the analysis of the code call chain, we can learn that after we using kill -9 to manually terminate the local Spark Driver process, if we restart spark interpreter when the number of active Thrift clients in the ClientPool connected to the corresponding interpreter is greater than or equal to the maxTotal default value  (8) , it will wait indefinitely for available thrift clients.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

[GitHub] [zeppelin] zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly

Posted by GitBox <gi...@apache.org>.
zjffdu commented on issue #3695: [ZEPPELIN-4691]restart interpreter blocked after interpreter process exited unexpectedly
URL: https://github.com/apache/zeppelin/pull/3695#issuecomment-602323442
 
 
   Thanks for the detail explanation, PR LGTM

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services