During a routine job run this morning, 2 of our agents became non responsive(they disappeared from the grid) to the server. The error was "Can not find any node matching specified criteria.". Upon further checking and reruns, one agent had recovered the other had not. I managed to do a "hard restart"(double run of the stop agent command) on the non-recovered one and restart it.
I would like to know why the one agent did not recover on its own and why both agents became non responsive in the first place?
I am attaching logs for both agent and server.