-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Autoupdate from 2.163.1 to 2.164.0 stucks #289
Comments
It seems that the only Issue with the update is moving the old /bin and /externals to bin.2.163.1 and externals.2.163.1 and then creating the symlinks in their place. |
@Alex-ala there should be an update log under _daig folder of the runner root, the update log looks like |
The log has neither .suceeded or .failed appended.
No additional lines are written over time (I still got one runner that is currently not working on any job but wrote this log on 20th of December). In a fresh setup this happened:
After 4. the restarted runners logfile states, that its waiting for jobs, but its stuck until I restart it again. It is still running 2.163.1 and the file-system is as described in the issue. The PID given in the SelfUpdate log is not running anymore after step 3. |
@Alex-ala did you configure the runner as service? or running interactively from terminal? if you run the runner interactively, try |
Both, when manually stopped and during selfupdate, the runner has no process left running. There are no processes alive during a short period when the runner updates and restarts. |
Thank you for your hint at running it as a service @TingluoHuang . |
Describe the bug
We have our self-hosted runners deployed and running for some time. Today they started self-updating from version 2.163.1 to 2.164.0 . The runner downloaded the new version and at least partially copied the new data. After the automatic restart of the runner it is not processing any more jobs. Neither cancelling and restarting the job via Web, nor restarting the runner or pushing a new commit does unstuck this runner.
For the runner to work again I had to:
I observed that the folder-structure after the failed update has some new folders with the new version suffixed. After a correct manual update, there are symlinks to these directories that were not created during automatic update. Maybe this is the cause for the error.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
The runner updates correctly
Runner Version and Platform
Runner (before): 2.163.1
Runner (after): 2.164.0
OS: CentOS 7
What's not working?
Jobs are forever queued: "Starting your workflow run..."
Job Log Output
"Starting your workflow run..."
Runner and Worker's Diagnostic Logs
error.log
ls.txt
The text was updated successfully, but these errors were encountered: