V
Vivek N
Guest
Hi
We are trying to write a customized job DRMS for our private cluster and we are having issues getting it to work.
We have followed the template in the SKILL file, but there are some unexpected results.
For example we see it launching the interface job and then that times out with:
\"Querying health for interface job 2, current health: unknown, new health: unknown\"
This repeats a few times till the job policy timeout ends.
After that our interface is called and the job status etc are OK - it looks like we have something wrong
Is there anyone who has successfully got this working?
I will describe the issues in more detail if someone has.
Thanks in advance
V
We are trying to write a customized job DRMS for our private cluster and we are having issues getting it to work.
We have followed the template in the SKILL file, but there are some unexpected results.
For example we see it launching the interface job and then that times out with:
\"Querying health for interface job 2, current health: unknown, new health: unknown\"
This repeats a few times till the job policy timeout ends.
After that our interface is called and the job status etc are OK - it looks like we have something wrong
Is there anyone who has successfully got this working?
I will describe the issues in more detail if someone has.
Thanks in advance
V