Stochastic Models for Fault Tolerance


Book Description

As modern society relies on the fault-free operation of complex computing systems, system fault-tolerance has become an indispensable requirement. Therefore, we need mechanisms that guarantee correct service in cases where system components fail, be they software or hardware elements. Redundancy patterns are commonly used, for either redundancy in space or redundancy in time. Wolter’s book details methods of redundancy in time that need to be issued at the right moment. In particular, she addresses the so-called "timeout selection problem", i.e., the question of choosing the right time for different fault-tolerance mechanisms like restart, rejuvenation and checkpointing. Restart indicates the pure system restart, rejuvenation denotes the restart of the operating environment of a task, and checkpointing includes saving the system state periodically and reinitializing the system at the most recent checkpoint upon failure of the system. Her presentation includes a brief introduction to the methods, their detailed stochastic description, and also aspects of their efficient implementation in real-world systems. The book is targeted at researchers and graduate students in system dependability, stochastic modeling and software reliability. Readers will find here an up-to-date overview of the key theoretical results, making this the only comprehensive text on stochastic models for restart-related problems.




Stochastic Models in Reliability Engineering


Book Description

This book is a collective work by many leading scientists, analysts, mathematicians, and engineers who have been working at the front end of reliability science and engineering. The book covers conventional and contemporary topics in reliability science, all of which have seen extended research activities in recent years. The methods presented in this book are real-world examples that demonstrate improvements in essential reliability and availability for industrial equipment such as medical magnetic resonance imaging, power systems, traction drives for a search and rescue helicopter, and air conditioning systems. The book presents real case studies of redundant multi-state air conditioning systems for chemical laboratories and covers assessments of reliability and fault tolerance and availability calculations. Conventional and contemporary topics in reliability engineering are discussed, including degradation, networks, and dynamic reliability, resilience, and multi-state systems, all of which are relatively new topics to the field. The book is aimed at engineers and scientists, as well as postgraduate students involved in reliability design, analysis, and experiments and applied probability and statistics.




Active Fault Tolerant Control Systems


Book Description

Modern technological systems rely on sophisticated control functions to meet increased performance requirements. For such systems, Fault Tolerant Control Systems (FTCS) need to be developed. Active FTCS are dependent on a Fault Detection and Identification (FDI) process to monitor system performance and to detect and isolate faults in the systems. The main objective of this book is to study and to validate some important issues in real-time Active FTCS by means of theoretical analysis and simulation. Several models are presented to achieve this objective, taking into consideration practical aspects of the system to be controlled, performance deterioration in FDI algorithms, and limitations in reconfigurable control laws.




Fault-Diagnosis Systems


Book Description

With increasing demands for efficiency and product quality plus progress in the integration of automatic control systems in high-cost mechatronic and safety-critical processes, the field of supervision (or monitoring), fault detection and fault diagnosis plays an important role. The book gives an introduction into advanced methods of fault detection and diagnosis (FDD). After definitions of important terms, it considers the reliability, availability, safety and systems integrity of technical processes. Then fault-detection methods for single signals without models such as limit and trend checking and with harmonic and stochastic models, such as Fourier analysis, correlation and wavelets are treated. This is followed by fault detection with process models using the relationships between signals such as parameter estimation, parity equations, observers and principal component analysis. The treated fault-diagnosis methods include classification methods from Bayes classification to neural networks with decision trees and inference methods from approximate reasoning with fuzzy logic to hybrid fuzzy-neuro systems. Several practical examples for fault detection and diagnosis of DC motor drives, a centrifugal pump, automotive suspension and tire demonstrate applications.




Structural Failure Models for Fault-Tolerant Distributed Computing


Book Description

Timo Warns has developed tractable fault models that, while being non-probabilistic, are accurate for dependent and propagating faults. Using seminal problems such as consensus and constructing coteries, he demonstrates how the new models can be used to design and evaluate effective and efficient means of fault tolerance.




Encyclopedia of Systems and Control


Book Description

The Encyclopedia of Systems and Control collects a broad range of short expository articles that describe the current state of the art in the central topics of control and systems engineering as well as in many of the related fields in which control is an enabling technology. The editors have assembled the most comprehensive reference possible, and this has been greatly facilitated by the publisher’s commitment continuously to publish updates to the articles as they become available in the future. Although control engineering is now a mature discipline, it remains an area in which there is a great deal of research activity, and as new developments in both theory and applications become available, they will be included in the online version of the encyclopedia. A carefully chosen team of leading authorities in the field has written the well over 250 articles that comprise the work. The topics range from basic principles of feedback in servomechanisms to advanced topics such as the control of Boolean networks and evolutionary game theory. Because the content has been selected to reflect both foundational importance as well as subjects that are of current interest to the research and practitioner communities, a broad readership that includes students, application engineers, and research scientists will find material that is of interest.




Diagnosis and Fault-tolerant Control 1


Book Description

This book presents recent advances in fault diagnosis strategies for complex dynamic systems. Its impetus derives from the need for an overview of the challenges of the fault diagnosis technique, especially for those demanding systems that require reliability, availability, maintainability and safety to ensure efficient operations. Moreover, the need for a high degree of tolerance with respect to possible faults represents a further key point, primarily for complex systems, as modeling and control are inherently challenging, and maintenance is both expensive and safety-critical. Diagnosis and Fault-tolerant Control 1 also presents and compares different diagnosis schemes using established case studies that are widely used in related literature. The main features of this book regard the analysis, design and implementation of proper solutions for the problems of fault diagnosis in safety critical systems. The design of the considered solutions involves robust data-driven, model-based approaches.




Stochastic Models in Reliability and Maintenance


Book Description

Our daily lives can be maintained by the high-technology systems. Computer systems are typical examples of such systems. We can enjoy our modern lives by using many computer systems. Much more importantly, we have to maintain such systems without failure, but cannot predict when such systems will fail and how to fix such systems without delay. A stochastic process is a set of outcomes of a random experiment indexed by time, and is one of the key tools needed to analyze the future behavior quantitatively. Reliability and maintainability technologies are of great interest and importance to the maintenance of such systems. Many mathematical models have been and will be proposed to describe reliability and maintainability systems by using the stochastic processes. The theme of this book is "Stochastic Models in Reliability and Main tainability. " This book consists of 12 chapters on the theme above from the different viewpoints of stochastic modeling. Chapter 1 is devoted to "Renewal Processes," under which classical renewal theory is surveyed and computa tional methods are described. Chapter 2 discusses "Stochastic Orders," and in it some definitions and concepts on stochastic orders are described and ag ing properties can be characterized by stochastic orders. Chapter 3 is devoted to "Classical Maintenance Models," under which the so-called age, block and other replacement models are surveyed. Chapter 4 discusses "Modeling Plant Maintenance," describing how maintenance practice can be carried out for plant maintenance.




Diagnosis and Fault-Tolerant Control


Book Description

This book presents model-based analysis and design methods for fault diagnosis and fault-tolerant control. Architectural and structural models are used to analyse the propagation of the fault through the process, test fault detectability and reveal redundancies that can be used to ensure fault tolerance. Case studies demonstrate the methods presented. The second edition includes new material on reconfigurable control, diagnosis of nonlinear systems, and remote diagnosis, plus new examples and updated bibliography.




Applied Stochastic System Modeling


Book Description

This book was written for an introductory one-semester or two-quarter course in stochastic processes and their applications. The reader is assumed to have a basic knowledge of analysis and linear algebra at an undergraduate level. Stochastic models are applied in many fields such as engineering systems, physics, biology, operations research, business, economics, psychology, and linguistics. Stochastic modeling is one of the promising kinds of modeling in applied probability theory. This book is intended to introduce basic stochastic processes: Poisson pro cesses, renewal processes, discrete-time Markov chains, continuous-time Markov chains, and Markov-renewal processes. These basic processes are introduced from the viewpoint of elementary mathematics without going into rigorous treatments. This book also introduces applied stochastic system modeling such as reliability and queueing modeling. Chapters 1 and 2 deal with probability theory, which is basic and prerequisite to the following chapters. Many important concepts of probabilities, random variables, and probability distributions are introduced. Chapter 3 develops the Poisson process, which is one of the basic and im portant stochastic processes. Chapter 4 presents the renewal process. Renewal theoretic arguments are then used to analyze applied stochastic models. Chapter 5 develops discrete-time Markov chains. Following Chapter 5, Chapter 6 deals with continuous-time Markov chains. Continuous-time Markov chains have im portant applications to queueing models as seen in Chapter 9. A one-semester course or two-quarter course consists of a brief review of Chapters 1 and 2, fol lowed in order by Chapters 3 through 6.