Application-Layer Fault-Tolerance Protocols


Book Description

"This book increases awareness of the need for application-level fault-tolerance (ALFT) through introduction of problems and qualitative analysis of solutions"--Provided by publisher.




Fault-Tolerance Techniques for Spacecraft Control Computers


Book Description

Comprehensive coverage of all aspects of space application oriented fault tolerance techniques • Experienced expert author working on fault tolerance for Chinese space program for almost three decades • Initiatively provides a systematic texts for the cutting-edge fault tolerance techniques in spacecraft control computer, with emphasis on practical engineering knowledge • Presents fundamental and advanced theories and technologies in a logical and easy-to-understand manner • Beneficial to readers inside and outside the area of space applications




Fault-Tolerance Techniques for High-Performance Computing


Book Description

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.




Patterns for Fault Tolerant Software


Book Description

Software patterns have revolutionized the way developer’s and architects think about how software is designed, built and documented. This new title in Wiley’s prestigious Series in Software Design Patterns presents proven techniques to achieve patterns for fault tolerant software. This is a key reference for experts seeking to select a technique appropriate for a given system. Readers are guided from concepts and terminology, through common principles and methods, to advanced techniques and practices in the development of software systems. References will provide access points to the key literature, including descriptions of exemplar applications of each technique. Organized into a collection of software techniques, specific techniques can be easily found with sufficient detail to allow appropriate choices for the system being designed.




Coordination Models and Languages


Book Description

This volume contains the proceedings of the Fifth International Conference on Coordination Models and Languages (Coordination 2002), held in York, UK, 8–11 April 2002. Coordination models and languages close the conceptual gap - tween the cooperation model used by the constituent parts of an application and the lower-level communication model used in its implementation. Coordinati- based methods provide a clean separation between individual software com- nents and their interactions within their overall software organization. This se- ration, together with the higher-level abstractions o?ered by coordination models and languages, improve software productivity, enhance maintainability, advocate modularity, promote reusability, and lead to software organizations and arc- tectures that are more tractable and more amenable to veri?cation and global analysis. Coordination is relevant in design, development, debugging, maintenance, and reuse of all complex concurrent and distributed systems. Speci?cally, - ordination becomes paramount in the context of open systems, systems with mobile entities, and dynamically re-con?gurable evolving systems. Moreover, - ordination models and languages focus on such key issues in Component Based Software Engineering as speci?cation, interaction, and dynamic composition of components.




Parallel and Distributed Processing and Applications


Book Description

Welcometotheproceedingsofthe2ndInternationalSymposiumonParalleland Distributed Processing and Applications (ISPA2004) which was held in Hong Kong, China, 13–15 December, 2004. With the advance of computer networks and hardware technology, parallel and distributed processing has become a key technology which plays an imp- tant part in determining future research and development activities in many academic and industrial branches. It provides a means to solve computati- ally intensive problems by improving processing speed. It is also the only - ableapproachtobuildinghighlyreliableandinherentlydistributedapplications. ISPA2004 provided a forum for scientists and engineers in academia and ind- try to exchange and discuss their experiences, new ideas, research results, and applications about all aspects of parallel and distributed computing. There was a very large number of paper submissions (361) from 26 countries and regions, including not only Asia and the Paci?c, but also Europe and North America. All submissions were reviewed by at least three program or technical committee members or external reviewers. It was extremely di?cult to select the presentations for the conference because there were so many excellent and interesting submissions. In order to allocate as many papers as possible and keep the high quality of the conference, we ?nally decided to accept 78 regular papers and 38 short papers for oral technical presentations. We believe that all of these papers and topics not only provide novel ideas, new results, work in progress and state-of-the-art techniques in this ?eld, but also stimulate the future research activities in the area of parallel and distributed computing with applications.




Service-Driven Approaches to Architecture and Enterprise Integration


Book Description

While business functions such as manufacturing, operations, and marketing often utilize various software applications, they tend to operate without the ability to interact with each other and exchange data. This provides a challenge to gain an enterprise-wide view of a business and to assist real-time decision making. Service-Driven Approaches to Architecture and Enterprise Integration addresses the issues of integrating assorted software applications and systems by using a service driven approach. Supporting the dynamics of business needs, this book highlights the tools, techniques, and governance aspects of design, and implements cost-effective enterprise integration solutions. It is a valuable source of information for software architects, SOA practitioners, and software engineers as well as researchers and students in pursuit of extensible and agile software design.




Practical Distributed Processing


Book Description

Distributed processing has a strong theoretical foundation, but many day-to-day practitioners make limited use of the advantages this theory can give them. The result includes unreliable systems with obscure and intermittent failures, that can cost time, money and in extreme cases, lives. Reliable construction of distributed and concurrent systems must incorporate theory in practice. This book provides a concise presentation of the theory closely linked to the practical realization of these concepts. This highly practical presentation contains all the elements needed for a complete development of a distributed system. The book includes examples from C, Java and Eiffel, and sample code is available online.




Rigorous Development of Complex Fault-Tolerant Systems


Book Description

This book brings together 19 papers focusing on the application of rigorous design techniques to the development of fault-tolerant, software-based systems. It is an outcome of the REFT 2005 Workshop on Rigorous Engineering of Fault-Tolerant Systems held in conjunction with the Formal Methods 2005 conference at Newcastle upon Tyne, UK, in July 2005.




Fault-tolerant IP and MPLS Networks


Book Description

Helping readers master important IP and MPLS concepts, this instructive resource is written by a technical leader for the MPLS Group from Cisco Systems Internet Technologies Division. The book guides networking professionals as they design fault tolerant networks.