Reliability and Fault-tolerance of Computer Systems

Course Description

Origin and characteristics of faults. System reliability. Fault distributions. Safety, testability and dependability of computer systems. Hardware, software, time and information redundancy. Techniques for high availability, fault tolerance, monitoring, detection, diagnosis, recovery and graceful service degradation. Software fault tolerance and reliability. Hardware and software fault tolerant architectures.

Study Programmes

Postgraduate doctoral study programme


Daniel Siewiorek, Robert Swarz (2014.), Reliable Computer Systems, Digital Press
J.C. Geffroy, G. Motet (2013.), Design of Dependable Computing Systems, Springer Science & Business Media
J. Voas, J. Bechta, M. Vouk (2002.), Developing Fault Tolerant Software, IEEE Press
Eric Bauer (2011.), Design for Reliability, John Wiley & Sons

For students


ID 154829
  Summer semester
L0 English Level