kubeadm's etcd client member add / remove can return errors but server side there could be success #3111
Labels
area/etcd
kind/bug
Categorizes issue or PR as related to a bug.
priority/important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
Milestone
had a discussion offline with @ahrtr
basically errors like this cannot be trusted because etcd is a distributed system:
https://github.com/kubernetes/kubernetes/blob/master/cmd/kubeadm/app/util/etcd/etcd.go#L436
what can happen:
the solution is to check the member list for the given peer URL before any add (learner or normal) / remove operation.
here:
https://github.com/kubernetes/kubernetes/blob/release-1.31/cmd/kubeadm/app/util/etcd/etcd.go#L430-L431
https://github.com/kubernetes/kubernetes/blob/release-1.31/cmd/kubeadm/app/util/etcd/etcd.go#L361
The text was updated successfully, but these errors were encountered: