You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by Andrew Onischuk <ao...@hortonworks.com> on 2014/08/01 16:23:40 UTC

Re: Review Request 22204: During install, retry on repo timeout and show better error message if retries fail

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/22204/
-----------------------------------------------------------

(Updated Aug. 1, 2014, 2:23 p.m.)


Review request for Ambari and Dmytro Sen.


Bugs: AMBARI-1562
    https://issues.apache.org/jira/browse/AMBARI-1562


Repository: ambari


Description
-------

During Agent bootstrap + registration (the Confirm Hosts step), and during
cluster Install/Start/Test, there is a chance the Hortonworks public
repositories or the EPEL repository can timeout, giving an error "no more
mirrors".

This causes the host to be marked "fail" which can be concerning to the user.

1) we should trap this timeout error specifically and auto-retry. The timeout
might only be temporary and will work on retry.  
2) After a certain amount of retries, we should produce a specific "fail"
message for the end user to help them troubleshoot.


Diffs (updated)
-----

  ambari-server/src/main/python/setupAgent.py 2bb7b53 

Diff: https://reviews.apache.org/r/22204/diff/


Testing
-------

mvn clean test


Thanks,

Andrew Onischuk


Re: Review Request 22204: During install, retry on repo timeout and show better error message if retries fail

Posted by Dmytro Sen <ds...@hortonworks.com>.
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/22204/#review49342
-----------------------------------------------------------

Ship it!


Ship It!

- Dmytro Sen


On Aug. 1, 2014, 2:23 p.m., Andrew Onischuk wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/22204/
> -----------------------------------------------------------
> 
> (Updated Aug. 1, 2014, 2:23 p.m.)
> 
> 
> Review request for Ambari and Dmytro Sen.
> 
> 
> Bugs: AMBARI-1562
>     https://issues.apache.org/jira/browse/AMBARI-1562
> 
> 
> Repository: ambari
> 
> 
> Description
> -------
> 
> During Agent bootstrap + registration (the Confirm Hosts step), and during
> cluster Install/Start/Test, there is a chance the Hortonworks public
> repositories or the EPEL repository can timeout, giving an error "no more
> mirrors".
> 
> This causes the host to be marked "fail" which can be concerning to the user.
> 
> 1) we should trap this timeout error specifically and auto-retry. The timeout
> might only be temporary and will work on retry.  
> 2) After a certain amount of retries, we should produce a specific "fail"
> message for the end user to help them troubleshoot.
> 
> 
> Diffs
> -----
> 
>   ambari-server/src/main/python/setupAgent.py 2bb7b53 
> 
> Diff: https://reviews.apache.org/r/22204/diff/
> 
> 
> Testing
> -------
> 
> mvn clean test
> 
> 
> Thanks,
> 
> Andrew Onischuk
> 
>