Course detail

Fault Tolerant Systems

FIT-SODAcad. year: 2022/2023

Principles of fault tolerance, structures and techniques. Codes for control and correction of information. Cyclic codes, Fire codes, Galois fields, BCH and RS codes. Modelling, estimation and control of reliability. Fail-safe systems. Architecture of FT systems. Fault tolerance at VLSI level. Radiation safety and fault tolerance. Fault tolerance in computer units, memories, computer systems and communication networks. Fault tolerant and secure control systems. Distributed tolerant systems, fault tolerant software. Intrduction to quantum computing.

State doctoral exam - Final interview topics:

  1. Principe's, approaches and parameters of safe and fault tolerant systems.
  2. Parity codes, multidimensional parity codes, low-density parity codes, arithmentic codes, Raptor codes.
  3. Hamming codes, byte error correction codes, matrix notation of of coding and decoding.
  4. Cyclic codes, basic and fast CRC calculation.
  5. Galois finite field GF(n) construction, minimum  polynomials.
  6. Construction and applications of BCH and RS codes.
  7. Time redundancy, radiation tolerant circuits and systems.
  8. Fault tolerance in VLSI structures - memories and multiprocessors, reconfiguration, fault and error containment.
  9. Fault tolerance in communication systems.
  10. Software implemented fault tolerance, Byzantine agreement.

Language of instruction

Czech

Mode of study

Not applicable.

Learning outcomes of the course unit

Skills and approaches to building fault tolerance using hardware and codes. To research new techniques and their applications.

To get know a novel approaches to ensure availability and safety of technical means.

Prerequisites

Computer design and software tools.

Co-requisites

Not applicable.

Planned learning activities and teaching methods

Not applicable.

Assesment methods and criteria linked to learning outcomes

Project topic selection and systematic consultatiobs.
Additional sessions after cunsultations wuth the lecturer.

Course curriculum

Not applicable.

Work placements

Not applicable.

Aims

To inform the students about different types of redundancy and its application for the design of computer systems being able to function correctly even under presence of faults and data errors. To give the students literary sources and principles of advanced topics in the area of fault and error tolerance for the choice of up-to-date research topics.

Specification of controlled education, way of implementation and compensation for absences

Final exam, project submission and presentation.

 

Recommended optional programme components

Not applicable.

Prerequisites and corequisites

Not applicable.

Basic literature

Not applicable.

Recommended reading

Nicolaidis M.: Soft Errors in Modern Electronic Systems, Spribger, 2011
Shokrollahi A., Luby M.: Raptor Codes, NOW Publishers, 2011
Szefer J.: Principles of Secure Processor Architecture Design, Morgan & Claypool, 2019

Classification of course in study plans

  • Programme DIT Doctoral 0 year of study, summer semester, compulsory-optional
  • Programme DIT Doctoral 0 year of study, summer semester, compulsory-optional
  • Programme DIT-EN Doctoral 0 year of study, summer semester, compulsory-optional
  • Programme DIT-EN Doctoral 0 year of study, summer semester, compulsory-optional

  • Programme CSE-PHD-4 Doctoral

    branch DVI4 , 0 year of study, summer semester, elective

  • Programme CSE-PHD-4 Doctoral

    branch DVI4 , 0 year of study, summer semester, elective

  • Programme CSE-PHD-4 Doctoral

    branch DVI4 , 0 year of study, summer semester, elective

  • Programme CSE-PHD-4 Doctoral

    branch DVI4 , 0 year of study, summer semester, elective

Type of course unit

 

Lecture

39 hod., optionally

Teacher / Lecturer

Syllabus

  1. FT design methodology, structures and techniques.
  2. Error control codes. Parity codes, multidimensional parity codes, arithmetic codes.
  3. Residue codes, Hamming codes, sparse parity codes. Raptor codes.
  4. Cyclic codes, Fire codes.
  5. Galois fields GF(n) and their construction, BCH and Reed-Solomon codes, byte error detection.
  6. Time redundancy, alternating logic.
  7. Reliability modeling, combinatorial models, MIL-HDBK-217. Markov reliability models.
  8. Safe systems.
  9. FT architectures.
  10. VLSI fault tolerance. Radiation fault tolerance. 
  11. FT in computer units, in memorie, in computer and communication systems.
  12. Fault tolerant and secure control systems.
  13. Distributed FT systems.
  14. Software implemented fault tolerance.

Guided consultation in combined form of studies

26 hod., optionally

Teacher / Lecturer