-
Notifications
You must be signed in to change notification settings - Fork 448
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Fail master not replaced ? #322
Comments
(as i need my cluster, workaround found : restart the non healthy node... and kill it after all other resync ok) |
@LordFPL Can you provide more infos on the keepers states (their logs)? You could also run the sentinel with From the above status looks like |
On keeper i have theses logs on cald00 (synced) :
On cald01 (not synced) :
I can't add --debug as problem is gone with the restart of the failed master, sorry. |
@LordFPL From these logs I can see that cald00 and cald01 weren't able to talk to cald03, probably because it died before them. Looks like your nodes died multiple time and at differen times since As a note: Next time something like this happens you could try saving a I'll open a PR to move the sentinel logs that reports why a standby is skipped from new master decision from Debug to Info level since it makes more sense.
If you want to force a db to become master (be sure that its state is good or you'll end with a master with a lot of lost transactions) you can reinitialize the cluster using an existing keeper: |
@sgotti Many thx for this more than complete answer. If this problem happen again, i will take more logs before any action. |
Hello,
Due to a power outage, all my nodes have been halted... and poweron is giving me a strange situation :
My master is on a non healthy node... and logs on the sentinel are :
Why sentinel is not switching to cald00 which was last master before outage ?
How can i force a new master ?
Thx in advance :)
The text was updated successfully, but these errors were encountered: