History | Log In     View a printable version of the current page.  
Issue Details (XML | Word | Printable)

Key: QB-2710
Type: Bug Bug
Status: Closed Closed
Resolution: Cannot Reproduce
Priority: Critical Critical
Assignee: Robin Shen
Reporter: AlSt
Votes: 0
Watchers: 0
Operations

If you were logged in you would be able to see more operations.
QuickBuild

allocateResource got stuck during a build although the node selection is "running on same as parent".

Created: 04/May/16 12:13 PM   Updated: 01/Dec/16 08:21 AM
Component/s: None
Affects Version/s: 6.1.13
Fix Version/s: None

Original Estimate: Unknown Remaining Estimate: Unknown Time Spent: Unknown


 Description  « Hide
Hi.

We got a build which ran into timeout after 12 hours because the allocateResource method got stuck. build log:

16:43:14,845 [master@lnz-dt-ub12-qb03v:8811] INFO - Executing pre-execute action...
16:43:14,903 [master@lnz-dt-ub12-qb03v:8811] INFO - Running step...
16:43:14,935 [master>init@lnz-dt-ub12-qb03v:8811] INFO - Checking step execute condition...
16:43:14,936 [master>init@lnz-dt-ub12-qb03v:8811] INFO - Step execute condition satisfied, executing...
04:43:16,812 [master@lnz-dt-ub12-qb03v:8811] INFO - Executing post-execute action...
04:43:16,855 [master@lnz-dt-ub12-qb03v:8811] ERROR - Step 'master' is failed.
    java.lang.RuntimeException: Error executing step process job.
        at com.pmease.quickbuild.stepsupport.StepProcessTask.reduce(StepProcessTask.java:126)
        at com.pmease.quickbuild.stepsupport.StepProcessTask.reduce(StepProcessTask.java:19)
        at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:102)
        at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:106)
        at com.pmease.quickbuild.stepsupport.SequentialStep.triggerChildren(SequentialStep.java:36)
        at com.pmease.quickbuild.stepsupport.CompositeStep.run(CompositeStep.java:117)
        at com.pmease.quickbuild.stepsupport.Step.execute(Step.java:548)
        at com.pmease.quickbuild.stepsupport.StepExecutionJob.executeStepAwareJob(StepExecutionJob.java:31)
        at com.pmease.quickbuild.stepsupport.StepAwareJob.executeBuildAwareJob(StepAwareJob.java:45)
        at com.pmease.quickbuild.BuildAwareJob.execute(BuildAwareJob.java:60)
        at com.pmease.quickbuild.grid.GridJob.run(GridJob.java:106)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
    Caused by: java.lang.RuntimeException: Error executing resource allocate job.
        at com.pmease.quickbuild.resource.ResourceAllocateTask.reduce(ResourceAllocateTask.java:27)
        at com.pmease.quickbuild.resource.ResourceAllocateTask.reduce(ResourceAllocateTask.java:13)
        at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:102)
        at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:106)
        at com.pmease.quickbuild.stepsupport.StepExecutionTask.map(StepExecutionTask.java:86)
        at com.pmease.quickbuild.stepsupport.StepExecutionTask.map(StepExecutionTask.java:19)
        at com.pmease.quickbuild.grid.GridImpl.execute(GridImpl.java:83)
        at com.pmease.quickbuild.stepsupport.StepProcessJob.executeStepAwareJob(StepProcessJob.java:38)
        ... 8 more
    Caused by: java.lang.RuntimeException: java.lang.InterruptedException: sleep interrupted
        at com.pmease.quickbuild.DefaultBuildEngine.allocateResource(DefaultBuildEngine.java:2282)
        at com.pmease.quickbuild.resource.ResourceAllocateJob.execute(ResourceAllocateJob.java:36)
        at com.pmease.quickbuild.grid.GridJob.run(GridJob.java:106)
        at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        ... 1 more
    Caused by: java.lang.InterruptedException: sleep interrupted
        at java.lang.Thread.sleep(Native Method)
        at com.pmease.quickbuild.DefaultBuildEngine.allocateResource(DefaultBuildEngine.java:2280)
        ... 7 more
04:43:17,810 [@ld-ub-bm:8810] ERROR - Build is timed out.
    java.util.concurrent.TimeoutException: Timed out waiting for task completion.
        at com.pmease.quickbuild.grid.GridTaskFuture.get(GridTaskFuture.java:82)
        at com.pmease.quickbuild.DefaultBuildEngine.run(DefaultBuildEngine.java:526)
        at com.pmease.quickbuild.DefaultBuildEngine.process(DefaultBuildEngine.java:394)
        at com.pmease.quickbuild.DefaultBuildEngine.access$000(DefaultBuildEngine.java:139)
        at com.pmease.quickbuild.DefaultBuildEngine$2.run(DefaultBuildEngine.java:1136)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        at java.lang.Thread.run(Thread.java:745)


 All   Comments   Work Log   Change History      Sort Order:
Robin Shen [05/May/16 12:50 AM]
Are there any errors printed in server log about this? And are you able to reproduce this issue?

AlSt [09/May/16 11:53 AM]
Sadly I did not capture the log at this time because we saw the error pretty late. And it did not happen a second time. Very weird. I'll inform you when it occurs again.

Robin Shen [01/Dec/16 08:21 AM]
Reopen it if there is more clue