-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nomad allocation stuck in pending state after docker "error reading image pull progress" #24955
Comments
I was able to reproduce the problem as it is explained below. Nomad version: Deployed below job, which uses an image with 7+GB size.
deployed the job and, after some time, updated the Allocation status is stuck in pending state . Please refer below Screenshots and logs Nomad logs:
Docker logs:
|
Heya @sijo13, thanks for another report! I am able to reproduce this one by inducing an error in the image pull code. We did a big refactor of the docker driver in Nomad I think we can patch this one up to go out in Nomad |
Nomad allocation stuck in pending state for over 2 days in multiple Nomad clients.
Nomad version
Docker version:
During the deployment of a job, one of the allocations went into a pending state as it failed while pulling the images from the artifactory.
Docker logs:
Nomad Logs:
Allocation was in a pending state for 2 days. The issue was fixed only post-restart of Nomad client service.
Please note that the same issue was observed in another cluster where the Nomad version is
v1.7.3
. all the pending allocations were self-healed, which was not the case with nomad clients with versionv1.9.3
Nomad job file:
The text was updated successfully, but these errors were encountered: