-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SLURM jobs tracking issue #14
Comments
Hi! Thanks! Thanks for the detailed bug report, and sorry it isn't working. I've tried to reproduce on our cluster by running The code is getting stuck here. To help debug this, could you please:
here.
Thanks! |
Thank you for the response!
the script running node - 1973245. |
Thanks. As the job finished before 23:57, but the output of Is this right? I assume the completed job remains in the queue for a while with At the moment, I'm naively using
to read all jobs in the slurm queue, which are assumed to be running. I'll come up with a more robust way of doing this which accounts for completed jobs. |
If this is this case, it should be fixed in this branch: https://github.com/michellab/a3fe/tree/bugfix-robust-slurm-queue-read . I've simply updated to
If this is the case, could you please pull the latest changes, check out the branch Thanks! |
Yes, it is right, all finished jobs in our system stay for some time in the queue. |
Brilliant, glad it works! No problem. |
(Reopening so I can formally close my merging in the |
Hello!
Thank you for the great tool!
I have an issue with tracking jobs submitted via SLURM, I created the input dir, copied all provided example input files from a3fe/a3fe/data/example_run_dir into it and saved the code from the documentation into calc.py:
I run the script inside the SLURM Interactive Job node (maybe it is the source of the issue, but I cannot run it otherwise).
And here is a problem, Slurm job with 1970700 ID has been finished, but the script is still waiting for it and doesn't stop until I forcefully stop it.
The output of the SLURM 1970700 JOB:
Could you help me please?
The text was updated successfully, but these errors were encountered: