There are many applications in which the reliability of the overall system must be far higher than the reliability of its individual components. In such cases, designers devise mechanisms and architectures that allow the system to either completely mask the effects of a component failure or recover from it so quickly that the application is not seriously affected. This is the work of fault-tolerant designers and their work is increasingly important and complex not only because of the increasing number of “mission critical” applications, but also because the diminishing reliability of hardware means that even systems for non-critical applications will need to be designed with fault-tolerance in mind.
Reflecting the real-world challenges faced by designers of these systems, this book addresses fault tolerance design with a systems approach to both hardware and software. No other text on the market takes this approach, nor offers the comprehensive and up-to-date treatment Koren and Krishna provide. Students, designers and architects of high performance processors will value this comprehensive overview of the field.
* The first book on fault tolerance design with a systems approach
* Comprehensive coverage of both hardware and software fault tolerance, as well as information and time redundancy
* Incorporated case studies highlight six different computer systems with fault-tolerance techniques implemented in their design
* Available to lecturers is a complete ancillary package including online solutions manual for instructors and PowerPoint slides