Proceedings


Book Description




Dependable Computing - EDCC-1


Book Description

This book presents the proceedings of the First European Dependable Computing Conference (EDCC-1), held in Berlin, Germany, in October 1994. EDCC is the merger of two former European events on dependable computing. The volume comprises 34 refereed full papers selected from 106 submissions. The contributions address all current aspects of dependable computing and reflect the state of the art in dependable systems research and advanced applications; among the topics covered are hardware and software reliability, safety-critical and secure systems, fault-tolerance and detection, verification and validation, formal methods, hardware and software testing, and parallel and distributed systems.




Reliable Distributed Systems


Book Description

Explains fault tolerance in clear terms, with concrete examples drawn from real-world settings Highly practical focus aimed at building "mission-critical" networked applications that remain secure




Guide to Reliable Distributed Systems


Book Description

This book describes the key concepts, principles and implementation options for creating high-assurance cloud computing solutions. The guide starts with a broad technical overview and basic introduction to cloud computing, looking at the overall architecture of the cloud, client systems, the modern Internet and cloud computing data centers. It then delves into the core challenges of showing how reliability and fault-tolerance can be abstracted, how the resulting questions can be solved, and how the solutions can be leveraged to create a wide range of practical cloud applications. The author’s style is practical, and the guide should be readily understandable without any special background. Concrete examples are often drawn from real-world settings to illustrate key insights. Appendices show how the most important reliability models can be formalized, describe the API of the Isis2 platform, and offer more than 80 problems at varying levels of difficulty.




Advances in Multimedia Information Processing — PCM 2001


Book Description

Welcome to the second IEEE Pacific Rim Conference on Multimedia (IEEE PCM 2001) held in Zhongguanchun, Beijing, China, October 22 24, 2001. Building upon the success of the inaugural IEEE PCM 2000 in Sydney in December 2000, the second PCM again brought together the researchers, developers, practitioners, and educators of multimedia in the Pacific area. Theoretical breakthroughs and practical systems were presented at this conference, thanks to the sponsorship by the IEEE Circuit and Systems Society, IEEE Signal Processing Society, China Computer Foundation, China Society of Image and Graphics, National Natural Science Foundation of China, Tsinghua University, and Microsoft Research, China. IEEE PCM 2001 featured a comprehensive program including keynote talks, regular paper presentations, posters, demos, and special sessions. We received 244 papers and accepted only 104 of them as regular papers, and 53 as poster papers. Our special session chairs, Shin'ichi Satoh and Mohan Kankanhalli, organized 6 special sessions. We acknowledge the great contribution from our program committee members and paper reviewers who spent many hours reviewing submitted papers and providing valuable comments for the authors. The conference would not have been successful without the help of so many people. We greatly appreciated the support of our honorary chairs: Prof. Sun Yuan Kung of Princeton University, Dr. Ya Qin Zhang of Microsoft Research China, and Prof.




Introduction to Reliable and Secure Distributed Programming


Book Description

In modern computing a program is usually distributed among several processes. The fundamental challenge when developing reliable and secure distributed programs is to support the cooperation of processes required to execute a common task, even when some of these processes fail. Failures may range from crashes to adversarial attacks by malicious processes. Cachin, Guerraoui, and Rodrigues present an introductory description of fundamental distributed programming abstractions together with algorithms to implement them in distributed systems, where processes are subject to crashes and malicious attacks. The authors follow an incremental approach by first introducing basic abstractions in simple distributed environments, before moving to more sophisticated abstractions and more challenging environments. Each core chapter is devoted to one topic, covering reliable broadcast, shared memory, consensus, and extensions of consensus. For every topic, many exercises and their solutions enhance the understanding This book represents the second edition of "Introduction to Reliable Distributed Programming". Its scope has been extended to include security against malicious actions by non-cooperating processes. This important domain has become widely known under the name "Byzantine fault-tolerance".




Fault-Tolerant Real-Time Systems


Book Description

Real-time computer systems are very often subject to dependability requirements because of their application areas. Fly-by-wire airplane control systems, control of power plants, industrial process control systems and others are required to continue their function despite faults. Fault-tolerance and real-time requirements thus constitute a kind of natural combination in process control applications. Systematic fault-tolerance is based on redundancy, which is used to mask failures of individual components. The problem of replica determinism is thereby to ensure that replicated components show consistent behavior in the absence of faults. It might seem trivial that, given an identical sequence of inputs, replicated computer systems will produce consistent outputs. Unfortunately, this is not the case. The problem of replica non-determinism and the presentation of its possible solutions is the subject of Fault-Tolerant Real-Time Systems: The Problem of Replica Determinism. The field of automotive electronics is an important application area of fault-tolerant real-time systems. Systems like anti-lock braking, engine control, active suspension or vehicle dynamics control have demanding real-time and fault-tolerance requirements. These requirements have to be met even in the presence of very limited resources since cost is extremely important. Because of its interesting properties Fault-Tolerant Real-Time Systems gives an introduction to the application area of automotive electronics. The requirements of automotive electronics are a topic of discussion in the remainder of this work and are used as a benchmark to evaluate solutions to the problem of replica determinism.




Fault-Tolerant Systems


Book Description

Fault-Tolerant Systems is the first book on fault tolerance design with a systems approach to both hardware and software. No other text on the market takes this approach, nor offers the comprehensive and up-to-date treatment that Koren and Krishna provide. This book incorporates case studies that highlight six different computer systems with fault-tolerance techniques implemented in their design. A complete ancillary package is available to lecturers, including online solutions manual for instructors and PowerPoint slides. Students, designers, and architects of high performance processors will value this comprehensive overview of the field. - The first book on fault tolerance design with a systems approach - Comprehensive coverage of both hardware and software fault tolerance, as well as information and time redundancy - Incorporated case studies highlight six different computer systems with fault-tolerance techniques implemented in their design - Available to lecturers is a complete ancillary package including online solutions manual for instructors and PowerPoint slides