Software Fault Tolerance in Real-Time Systems: Identifying the Future Research Questions

Reghenzani, Federico; Guo, Zhishan; Fornaciari, William

doi:10.1145/3589950

Citation Details

Software Fault Tolerance in Real-Time Systems: Identifying the Future Research Questions

Tolerating hardware faults in modern architectures is becoming a prominent problem due to the miniaturization of the hardware components, their increasing complexity, and the necessity to reduce costs. Software-Implemented Hardware Fault Tolerance approaches have been developed to improve system dependability regarding hardware faults without resorting to custom hardware solutions. However, these come at the expense of making the satisfaction of the timing constraints of the applications/activities harder from a scheduling standpoint. This article surveys the current state-of-the-art of fault tolerance approaches when used in the context of real-time systems, identifying the main challenges and the cross-links between these two topics. We propose a joint scheduling-failure analysis model that highlights the formal interactions among software fault tolerance mechanisms and timing properties. This model allows us to present and discuss many open research questions with the final aim to spur future research activities. more »

Award ID(s):: 2246672

PAR ID:: 10519372

Author(s) / Creator(s):: Reghenzani, Federico; Guo, Zhishan; Fornaciari, William

Publisher / Repository:: ACM Computing Surveys

Date Published:: 2023-12-31

Journal Name:: ACM Computing Surveys

Volume:: 55

Issue:: 14s

ISSN:: 0360-0300

Page Range / eLocation ID:: 1 to 30

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3589950

More Like this