CrossTalk: Making Low-Latency Fault Tolerance Cheap by Exploiting Redundant Networks

Loveless, Andrew; Phan, Linh Thi; Erickson, Lisa; Dreslinski, Ronald; Kasikci, Baris

doi:10.1145/3609436

Citation Details

CrossTalk: Making Low-Latency Fault Tolerance Cheap by Exploiting Redundant Networks

Real-time embedded systems perform many important functions in the modern world. A standard way to tolerate faults in these systems is with Byzantine fault-tolerant (BFT) state machine replication (SMR), in which multiple replicas execute the same software and their outputs are compared by the actuators. Unfortunately, traditional BFT SMR protocols areslow, requiring replicas to exchange sensor data back and forth over multiple rounds in order to reach agreement before each execution. The state of the art in reducing the latency of BFT SMR iseager execution, in which replicas execute on data from different sensors simultaneously on different processor cores. However, this technique results in 3–5× higher computation overheads compared to traditional BFT SMR systems, significantly limiting schedulability. We presentCrossTalk, a new BFT SMR protocol that leverages the prevalence of redundant switched networks in embedded systems to reduce latency without added computation. The key idea is to use specific algorithms to move messages between redundant network planes (which many systems already possess) as the messages travel from the sensors to the replicas. As a result,CrossTalkcan ensure agreementautomaticallyin the network, avoiding the need for any communication between replicas. Our evaluation shows thatCrossTalkimproves schedulability by 2.13–4.24× over the state of the art. Moreover, in a NASA simulation of a real spaceflight mission,CrossTalktolerates more faults than the state of the art while using nearly 3× less processor time. more »

Award ID(s):: 1703936 1955670 1750158

PAR ID:: 10497722

Author(s) / Creator(s):: Loveless, Andrew; Phan, Linh Thi; Erickson, Lisa; Dreslinski, Ronald; Kasikci, Baris

Publisher / Repository:: ACM

Date Published:: 2023-10-31

Journal Name:: ACM Transactions on Embedded Computing Systems

Volume:: 22

Issue:: 5s

ISSN:: 1539-9087

Page Range / eLocation ID:: 1 to 25

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3609436

More Like this