Post

RedPlane: Enabling Fault-Tolerant Stateful in-Switch Applications

URL: https://doi.org/10.1145/3452296.3472905

Bibtex Entry:

@inproceedings{2021-Kim-sigcomm, author = “Kim, Daehyeok and Nelson, Jacob and Ports, Dan R. K. and Sekar, Vyas and Seshan, Srinivasan”, title = “RedPlane: Enabling Fault-Tolerant Stateful in-Switch Applications”, year = “2021”, month = “August”, isbn = “9781450383837”, publisher = “Association for Computing Machinery”, address = “New York, NY, USA”, url = “https://doi.org/10.1145/3452296.3472905”, doi = “10.1145/3452296.3472905”, abstract = “Many recent efforts have demonstrated the performance benefits of running datacenter functions (emph{e.g.,} NATs, load balancers, monitoring) on programmable switches. However, a key missing piece remains: fault tolerance. This is especially critical as the network is no longer stateless and pure endpoint recovery does not suffice. In this paper, we design and implement RedPlane, a fault-tolerant state store for stateful in-switch applications. This provides in-switch applications consistent access to their state, even if the switch they run on fails or traffic is rerouted to an alternative switch. We address key challenges in devising a practical, provably correct replication protocol and implementing it in the switch data plane. Our evaluations show that RedPlane incurs negligible overhead and enables end-to-end applications to rapidly recover from switch failures.”, booktitle = “ACM SIGCOMM”, pages = “223–244”, numpages = “22”, keywords = “state replication, programmable switches, programmable networks, fault tolerance”, location = “Virtual Event, USA”, series = “SIGCOMM ‘21” }

This post is licensed under CC BY 4.0 by the author.