<< Back to previous view |
[QB-2710] allocateResource got stuck during a build although the node selection is "running on same as parent".
|
|
Status: | Closed |
Project: | QuickBuild |
Component/s: | None |
Affects Version/s: | 6.1.13 |
Fix Version/s: | None |
Type: | Bug | Priority: | Critical |
Reporter: | AlSt | Assigned To: | Robin Shen |
Resolution: | Cannot Reproduce | Votes: | 0 |
Remaining Estimate: | Unknown | Time Spent: | Unknown |
Original Estimate: | Unknown |
Description |
Hi.
We got a build which ran into timeout after 12 hours because the allocateResource method got stuck. build log: 16:43:14,845 [master@lnz-dt-ub12-qb03v:8811] INFO - Executing pre-execute action... 16:43:14,903 [master@lnz-dt-ub12-qb03v:8811] INFO - Running step... 16:43:14,935 [master>init@lnz-dt-ub12-qb03v:8811] INFO - Checking step execute condition... 16:43:14,936 [master>init@lnz-dt-ub12-qb03v:8811] INFO - Step execute condition satisfied, executing... 04:43:16,812 [master@lnz-dt-ub12-qb03v:8811] INFO - Executing post-execute action... 04:43:16,855 [master@lnz-dt-ub12-qb03v:8811] ERROR - Step 'master' is failed. java.lang.RuntimeException: Error executing step process job. at com.pmease.quickbuild.stepsupport.StepProcessTask.reduce(StepProcessTask.java:126) at com.pmease.quickbuild.stepsupport.StepProcessTask.reduce(StepProcessTask.java:19) at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:102) at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:106) at com.pmease.quickbuild.stepsupport.SequentialStep.triggerChildren(SequentialStep.java:36) at com.pmease.quickbuild.stepsupport.CompositeStep.run(CompositeStep.java:117) at com.pmease.quickbuild.stepsupport.Step.execute(Step.java:548) at com.pmease.quickbuild.stepsupport.StepExecutionJob.executeStepAwareJob(StepExecutionJob.java:31) at com.pmease.quickbuild.stepsupport.StepAwareJob.executeBuildAwareJob(StepAwareJob.java:45) at com.pmease.quickbuild.BuildAwareJob.execute(BuildAwareJob.java:60) at com.pmease.quickbuild.grid.GridJob.run(GridJob.java:106) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) Caused by: java.lang.RuntimeException: Error executing resource allocate job. at com.pmease.quickbuild.resource.ResourceAllocateTask.reduce(ResourceAllocateTask.java:27) at com.pmease.quickbuild.resource.ResourceAllocateTask.reduce(ResourceAllocateTask.java:13) at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:102) at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:106) at com.pmease.quickbuild.stepsupport.StepExecutionTask.map(StepExecutionTask.java:86) at com.pmease.quickbuild.stepsupport.StepExecutionTask.map(StepExecutionTask.java:19) at com.pmease.quickbuild.grid.GridImpl.execute(GridImpl.java:83) at com.pmease.quickbuild.stepsupport.StepProcessJob.executeStepAwareJob(StepProcessJob.java:38) ... 8 more Caused by: java.lang.RuntimeException: java.lang.InterruptedException: sleep interrupted at com.pmease.quickbuild.DefaultBuildEngine.allocateResource(DefaultBuildEngine.java:2282) at com.pmease.quickbuild.resource.ResourceAllocateJob.execute(ResourceAllocateJob.java:36) at com.pmease.quickbuild.grid.GridJob.run(GridJob.java:106) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) ... 1 more Caused by: java.lang.InterruptedException: sleep interrupted at java.lang.Thread.sleep(Native Method) at com.pmease.quickbuild.DefaultBuildEngine.allocateResource(DefaultBuildEngine.java:2280) ... 7 more 04:43:17,810 [@ld-ub-bm:8810] ERROR - Build is timed out. java.util.concurrent.TimeoutException: Timed out waiting for task completion. at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:82) at com.pmease.quickbuild.DefaultBuildEngine.run(DefaultBuildEngine.java:526) at com.pmease.quickbuild.DefaultBuildEngine.process(DefaultBuildEngine.java:394) at com.pmease.quickbuild.DefaultBuildEngine.access$000(DefaultBuildEngine.java:139) at com.pmease.quickbuild.DefaultBuildEngine$2.run(DefaultBuildEngine.java:1136) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) |
Comments |
Comment by Robin Shen [ 05/May/16 12:50 AM ] |
Are there any errors printed in server log about this? And are you able to reproduce this issue? |
Comment by AlSt [ 09/May/16 11:53 AM ] |
Sadly I did not capture the log at this time because we saw the error pretty late. And it did not happen a second time. Very weird. I'll inform you when it occurs again. |
Comment by Robin Shen [ 01/Dec/16 08:21 AM ] |
Reopen it if there is more clue |