Reliability and Fault-tolerance of Computer Systems

Course Description

Origin and characteristics of faults. System reliability. Fault distributions. Safety, testability and dependability of computer systems. Hardware, software, time and information redundancy. Techniques for high availability, fault tolerance, monitoring, detection, diagnosis, recovery and graceful service degradation. Software fault tolerance and reliability. Hardware and software fault tolerant architectures.

Study Programmes

Postgraduate doctoral study programme

Literature

(.), D. Siewiorek, R. Swarz, The Theory and Practice of Reliable System Design, Digital Press, Bedford, 1992.,
(.), J. C. Geffroy, G. Motet: Design of Dependable Computing Systems, Kluwer Academic Publishers, Boston, 2002.,
(.), J. Voas, J. Bechta, M. Vouk: Developing Fault Tolerant Software, IEEE Press, 2002.,
(.), E. Bauer: Design for Reliability: Information and Computer-Based Systems, Wiley-IEEE Press, 2010.,

General

ID 154829
  Summer semester
6 ECTS