RFC: responsive leadership transfer #144

BusyJay · 2018-11-22T06:46:17Z

Summary

This RFC proposes to use an index to trace leadership transfer, and if follower fails to start election, a response should be sent back to leader.

Motivation

When leader transfer leadership to a follower, follower may and may not start an election. Leader can't know precisely what happen, so it will stop read/write and wait for an election timeout and then try to retain leadership if no one campaigns. We observe some unexpected high latency when a leadership transfer fails. Note that failure doesn't have to be caused by network failure, it can also be caused by slow apply of logs. For example, a newly promoted voter may not start campaign if conf change is applied locally.

Detailed design

We can introduce an index to trace every leadership transfer. Everytime leadership transfer happens, index should increase by 1. The index is also sent via transfer command. If a follower checks its own state, and decides not to campaign, it should send back a TransferLeaderResponse to tell leader its decision. Leader finds a rejected response's index matches its own latest transfer index, then abort leadership transfer immediately.

Unresolved questions

What if transfer command is dropped due to network failure? It may be hard to handle all situations, but at lease should make it work as expected when infrastructures work as expected.

BusyJay · 2018-11-22T12:54:10Z

/cc @siddontang @xiang90 Any thoughts?

BusyJay · 2018-11-22T13:00:02Z

Index may not be easy to maintain, using a local timestamp can be a workaround.

BusyJay · 2018-11-23T05:54:34Z

One difference between current implementation and thesis paper is that raft apply conf change after it is committed and applied. However, thesis suggests to always use latest available configuration. In that case, the problem describe here should not exist as TimeoutNow is sent only after follower's log is up to date.

BusyJay · 2019-05-09T07:19:40Z

If #234 is solved, the index is unnecessary anymore. The only thing needs to do is just send back a response to leader.

BusyJay added Feature Related to a major feature. Request for Comment A proposal to be considered. Analogous to an RFC in TiKV/Rust. labels Nov 22, 2018

zhangjinpeng87 mentioned this issue Nov 22, 2018

Should not transfer leader to a new added peer tikv/tikv#3819

Open

nolouch mentioned this issue Dec 5, 2018

raftstore: reject transfer leader to the recently added peers tikv/tikv#3878

Merged

overvenus mentioned this issue Jan 9, 2019

Stash MsgTimeout to fix transfer leader bug tikv/tikv#4045

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: responsive leadership transfer #144

RFC: responsive leadership transfer #144

BusyJay commented Nov 22, 2018

BusyJay commented Nov 22, 2018

BusyJay commented Nov 22, 2018

BusyJay commented Nov 23, 2018

BusyJay commented May 9, 2019

RFC: responsive leadership transfer #144

RFC: responsive leadership transfer #144

Comments

BusyJay commented Nov 22, 2018

Summary

Motivation

Detailed design

Unresolved questions

BusyJay commented Nov 22, 2018

BusyJay commented Nov 22, 2018

BusyJay commented Nov 23, 2018

BusyJay commented May 9, 2019