ReCraft: Self-Contained Split, Merge, and Membership Change of Raft Protocol

๐Ÿ“… 2025-04-21
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing Raft reconfiguration schemes rely on a centralized coordinator and require full-cluster downtime, introducing single-point-of-failure vulnerabilities and correctness risks. This paper proposes ReCraft: a coordinator-free dynamic reconfiguration mechanism supporting split/merge operations and fine-grained membership changes. Its core is a self-contained, multi-level reconfiguration protocol, realized through extensions to the Raft state machine and a redesigned distributed consensus logicโ€”formally verified for safety and liveness using TLA+. Implemented in etcd, ReCraft demonstrates that reconfiguration blocks only necessary log submissions, incurs <8% throughput degradation, reduces split/merge latency by 57%, and completely eliminates single-point failures inherent in centralized coordination.

Technology Category

Application Category

๐Ÿ“ Abstract
Designing reconfiguration schemes for consensus protocols is challenging because subtle corner cases during reconfiguration could invalidate the correctness of the protocol. Thus, most systems that embed consensus protocols conservatively implement the reconfiguration and refrain from developing an efficient scheme. Existing implementations often stop the entire system during reconfiguration and rely on a centralized coordinator, which can become a single point of failure. We present ReCraft, a novel reconfiguration protocol for Raft, which supports multi- and single-cluster-level reconfigurations. ReCraft does not rely on external coordinators and blocks minimally. ReCraft enables the sharding of Raft clusters with split and merge reconfigurations and adds a membership change scheme that improves Raft. We prove the safety and liveness of ReCraft and demonstrate its efficiency through implementations in etcd.
Problem

Research questions and friction points this paper is trying to address.

Designing efficient reconfiguration schemes for Raft consensus protocol
Eliminating reliance on external coordinators during reconfiguration
Enabling sharding with split, merge, and membership change operations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-contained split and merge for Raft
No external coordinators, minimal blocking
Safe membership change scheme for Raft
๐Ÿ”Ž Similar Papers
No similar papers found.
K
Kezhi Xiong
Northeastern University
S
Soonwon Moon
Seoul National University
J
Joshua Kang
Northeastern University
B
Bryant Curto
Northeastern University
Jieung Kim
Jieung Kim
Assistant Professor, Yonsei University
PL theoryprogram logicsformal verificationsystem software & neural network reliability
Ji-Yong Shin
Ji-Yong Shin
Northeastern University
Cloud StorageDistributed SystemsDatacenter NetworksFormal Verification