On Feb 23, 2010, at 8:45 AM, charlieR wrote:
Hi,
I'm having trouble canceling jobs from my director on the CLI console.
In the console, I do 'status director'. It returns something like this....
Running Jobs:
Console connected at 23-Feb-10 08:29
JobId Level Name Status
======================================================================
3098 Full serv01.2010-02-22_23.05.00_13 is waiting for Client serv01 to connect to Storage storage.domain
3099 Full serv02.2010-02-22_23.05.00_14 is waiting on max Storage jobs
3105 Increme serv03.2010-02-22_23.05.00_20 is waiting on max Storage jobs
3106 Increme serv04.2010-02-22_23.05.00_21 is waiting on max Storage jobs
Okay I know what the problem is, so I want to fix it after the other jobs run. So I do 'cancel' and then pick the job from the list, and then it returns....
3904 Job serv01.2010-02-22_23.05.00_13 not found.
Can anyone tell me why it can't find this or where I should start looking?
# Extra stuff that may help
Here's how my setup looks.
We have a main director server.
We have 2 storage servers located at remote datacenters.
We have all our clients.
We use the TLS tunnels on every client to the director / storage and vice versa.
Thanks for the help.
CharlieR
I think I found out what it is....
It mentions that if it's waiting on storage you may have to do a mount. In my case this is because this host can't connect to the storage due to me not having my ssl certs in place.
I have also found that I can cancel other jobs, just not the one that is running and waiting on that storage. I can't do a mount as that storage is always mounted.
I would think this should time out and cancel if it can't connect after a certain period. Is there any config options I can use to do this? Or at least move on to the next job.
Regards,
CharlieR