-
Notifications
You must be signed in to change notification settings - Fork 270
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Not enouch Inodes #961
Comments
I see this issue was raised in 2018 to allow seeing of the values for the inode checks and clean up time Not sure if this would be something to revisit but wanted to highlight |
Thank you for opening this issue @darrenwhighamfd. To confirm my understanding is correct once these instances’ disk has filled up to this point they fail any builds they are assigned until they are replaced? Medium to long term I would like to move this heath check out of the job lifecycle and into the agent lifecycle so that agents whose host is unhealthy do not accept jobs. In the short term to get this working for you again, would you be able to append a value for the |
Thanks @keithduncan Thats correct about the issue, We have set DOCKER_PRUNE_UNTIL to a lower value for now to see if it helps mitigate the issue. |
Good to hear @darrenwhighamfd, I’m going to close this in the short term as we have hopefully mitigated the acute issue and have the long term fix tracked in buildkite/agent#1111 If this recurs for you as our mitigation is insufficient please don’t hesitate to re-open or leave a command and we’ll work on a new solution. 🙇 |
Hi,
We are starting to see this error more and more on our more intense queues used for building
I see the script for checking here https://github.com/buildkite/elastic-ci-stack-for-aws/blob/master/packer/linux/conf/bin/bk-check-disk-space.sh &
elastic-ci-stack-for-aws/packer/linux/conf/buildkite-agent/hooks/environment
Line 34 in 272916a
However our agents are typically shorter lived than 4 hours as we scale as needed. As a result the clean up does not help,
unless we set this as a lower value or is there another way around this issue? Other than just adding more disk space to the agent which I think increases the Inodes available or reducing the agent life and spinning them down sooner.
The text was updated successfully, but these errors were encountered: