<< Back to previous view |
[QB-2380] QuickBuild Node Issues
|
|
Status: | Closed |
Project: | QuickBuild |
Component/s: | None |
Affects Version/s: | 6.0.9 |
Fix Version/s: | None |
Type: | Bug | Priority: | Major |
Reporter: | Phong Trinh | Assigned To: | Robin Shen |
Resolution: | Fixed | Votes: | 0 |
Remaining Estimate: | Unknown | Time Spent: | Unknown |
Original Estimate: | Unknown | ||
Environment: | QuickBuild 6.0.9 |
Description |
Hi,
I am having two issue with the QuickBuild nodes as the follows: 1) I received many alerts with similar message, "Build Agent '<AgentName>' is removed from active list as server has not hear from it for 12 seconds" although the build machines were still up and running fine. Please advise me on this issue. 2) When a node is offline, all of the jobs which are assigned to that node are automatically cancelled by QuickBuild. I think the jobs are supposed to wait for the node until it is back online and then resume the operations. Please advise me on this issue too. Thanks, -Phong |
Comments |
Comment by Robin Shen [ 21/Mar/15 12:58 AM ] |
For issue1, the agent is actually removed but then it has been added back automatically when server hears from it again. You may edit system setting to set a large value for agent timeout so that server does not complain about this.
For issue2, QB has to terminate the build as long as it finds the agent offline as it does not know when the agent will come back. If it waits eternally, other jobs can be blocked. |
Comment by Phong Trinh [ 21/Mar/15 04:28 AM ] |
For issue 1: Regarding editing system setting to set a large value for agent timeout, Is it the wrapper.ping.timeout=0 in the wrapper.conf on QuickBuild server?
For issue 2: I think when a node is offline, QB can cancel the build which is running on the node. But it should not cancel all of builds in queue which are waiting for the node. |
Comment by Robin Shen [ 22/Mar/15 12:18 AM ] |
For issue 1: this setting can be found in "Administration / System Setting" via QB web UI.
For issue 2: to have queueing builds waiting instead of failed, please configure master step of them to use resource instead of matching a particular node, for instance, you may define a resource say "node1", which matches node1 via its resource setting, then configure the master step to use that resource. |
Comment by Phong Trinh [ 24/Mar/15 06:50 PM ] |
Thank you very much for responding promptly.
The solution for issue 1 fixed my system. Thank you. I have an agent which dedicates to one of our builds. The build run for hours, and I schedule it run twice day. Per your suggestion, I think I create a resource which has only one node and configure master step to use that resource. My follow-up question is that will the builds be queued or wait if the node is offline? Thank you in advance, ptrinh |
Comment by Robin Shen [ 24/Mar/15 11:13 PM ] |
Yes it will. Please give it a try. |