When one of the node is not answering but it is still up (for example extremely overloaded) and I click on the 'grid' tab, the whole QB UI is not responding.
I can see following in the logs:
2011-07-04 16:44:36,640 [Thread-3534754] WARN com.pmease.quickbuild.DefaultBuildEngine - Error caching build status in grid node '{agent-name:8811}'.
com.caucho.hessian.client.HessianRuntimeException: Can not connect to '
http://10.18.120.30:8811/service/node'.
at com.caucho.hessian.client.HessianProxy.sendRequest(HessianProxy.java:321)
....
Caused by: java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
After timeout (few minutes) I can see the grid again the the problematic node is shown like this:
agent-name:8811 build agent 10.18.120.30 false 10 unknown 10000