-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Tunneldigger process management broken (?): can end up with multiple tunneldigger running #148
Comments
Can you maybe reproduce something like this if you try to do multiple tunneldigger restarts at the same time? Something like:
I'm wondering if the tunneldigger-watchdog micron can sometimes result it multiple restarts being run in parallel? Just some weild guesses. |
Hm, when I tried this even just with a loop count of 20, my device just reboots after a bit... nothing it prints via SSH shows any indication why. It's a pretty weak device with very little RAM, so it's probably not good for such tests. It's the only one I have though... |
You should be able to find out if it's an out-of-memory or other crash via /sys/kernel/debug/crashlog after the device rebooted, as long as you don't power cycle it. Or via a serial console, of course. Not sure if that'd help for this issue, but maybe there could be some unexpected hints in there? |
We had this issue on alot of routers and it seems that this occours after a reboot (daily reboots). Patched this function
to
With this change we dont have this issue anymore, but i still cant tell how the routers end up |
tunneldigger has been deprecated in gluon and removed in main branch: freifunk-gluon/gluon#3109 |
The latest lead in a long-standing issue seems to indicate that tunneldigger process management sometimes goes wrong, and we can end up with 2 instances of tunneldigger running (
ps
showing 6 tunneldigger processes, rather than the usual 3). This then leads to those 2 instances interrupting each other all the time, which is essentially a DoS attack on the gateway.I don't know how to reproduce this, and have not actually seen these 6 tunneldigger processes myself (I never managed to get SSH onto an affected device), but this is the best lead so far. So I wonder... how could a Gluon device end up in a situation where tunneldigger runs twice?
The text was updated successfully, but these errors were encountered: