Automated Optimization Methods for Scientific Workflows in e-Science Infrastructures


Book Description

Scientific workflows have emerged as a key technology that assists scientists with the design, management, execution, sharing and reuse of in silico experiments. Workflow management systems simplify the management of scientific workflows by providing graphical interfaces for their development, monitoring and analysis. Nowadays, e-Science combines such workflow management systems with large-scale data and computing resources into complex research infrastructures. For instance, e-Science allows the conveyance of best practice research in collaborations by providing workflow repositories, which facilitate the sharing and reuse of scientific workflows. However, scientists are still faced with different limitations while reusing workflows. One of the most common challenges they meet is the need to select appropriate applications and their individual execution parameters. If scientists do not want to rely on default or experience-based parameters, the best-effort option is to test different workflow set-ups using either trial and error approaches or parameter sweeps. Both methods may be inefficient or time consuming respectively, especially when tuning a large number of parameters. Therefore, scientists require an effective and efficient mechanism that automatically tests different workflow set-ups in an intelligent way and will help them to improve their scientific results. This thesis addresses the limitation described above by defining and implementing an approach for the optimization of scientific workflows. In the course of this work, scientists’ needs are investigated and requirements are formulated resulting in an appropriate optimization concept. In a following step, this concept is prototypically implemented by extending a workflow management system with an optimization framework, including general mechanisms required to conduct workflow optimization. As optimization is an ongoing research topic, different algorithms are provided by pluggable extensions (plugins) that can be loosely coupled with the framework, resulting in a generic and quickly extendable system. In this thesis, an exemplary plugin is introduced which applies a Genetic Algorithm for parameter optimization. In order to accelerate and therefore make workflow optimization feasible at all, e-Science infrastructures are utilized for the parallel execution of scientific workflows. This is empowered by additional extensions enabling the execution of applications and workflows on distributed computing resources. The actual implementation and therewith the general approach of workflow optimization is experimentally verified by four use cases in the life science domain. All workflows were significantly improved, which demonstrates the advantage of the proposed workflow optimization. Finally, a new collaboration-based approach is introduced that harnesses optimization provenance to make optimization faster and more robust in the future.




Automated Workflow Scheduling in Self-Adaptive Clouds


Book Description

This timely text/reference presents a comprehensive review of the workflow scheduling algorithms and approaches that are rapidly becoming essential for a range of software applications, due to their ability to efficiently leverage diverse and distributed cloud resources. Particular emphasis is placed on how workflow-based automation in software-defined cloud centers and hybrid IT systems can significantly enhance resource utilization and optimize energy efficiency. Topics and features: describes dynamic workflow and task scheduling techniques that work across multiple (on-premise and off-premise) clouds; presents simulation-based case studies, and details of real-time test bed-based implementations; offers analyses and comparisons of a broad selection of static and dynamic workflow algorithms; examines the considerations for the main parameters in projects limited by budget and time constraints; covers workflow management systems, workflow modeling and simulation techniques, and machine learning approaches for predictive workflow analytics. This must-read work provides invaluable practical insights from three subject matter experts in the cloud paradigm, which will empower IT practitioners and industry professionals in their daily assignments. Researchers and students interested in next-generation software-defined cloud environments will also greatly benefit from the material in the book.







Web Portal Design, Implementation, Integration, and Optimization


Book Description

Web Portal Design, Implementation, Integration, and Optimization discusses the challenges faced in building web services and integrating applications in order to reach the successful benefits web portals bring to an organization. This collection of research aims to be a resource for researchers, developers, and industry practitioners involved in the technological, business, organizational and social dimensions of web portals.




Workflows for e-Science


Book Description

This is a timely book presenting an overview of the current state-of-the-art within established projects, presenting many different aspects of workflow from users to tool builders. It provides an overview of active research, from a number of different perspectives. It includes theoretical aspects of workflow and deals with workflow for e-Science as opposed to e-Commerce. The topics covered will be of interest to a wide range of practitioners.




Guide to e-Science


Book Description

This guidebook on e-science presents real-world examples of practices and applications, demonstrating how a range of computational technologies and tools can be employed to build essential infrastructures supporting next-generation scientific research. Each chapter provides introductory material on core concepts and principles, as well as descriptions and discussions of relevant e-science methodologies, architectures, tools, systems, services and frameworks. Features: includes contributions from an international selection of preeminent e-science experts and practitioners; discusses use of mainstream grid computing and peer-to-peer grid technology for “open” research and resource sharing in scientific research; presents varied methods for data management in data-intensive research; investigates issues of e-infrastructure interoperability, security, trust and privacy for collaborative research; examines workflow technology for the automation of scientific processes; describes applications of e-science.




Cloud Computing with e-Science Applications


Book Description

The amount of data in everyday life has been exploding. This data increase has been especially significant in scientific fields, where substantial amounts of data must be captured, communicated, aggregated, stored, and analyzed. Cloud Computing with e-Science Applications explains how cloud computing can improve data management in data-heavy fields such as bioinformatics, earth science, and computer science. The book begins with an overview of cloud models supplied by the National Institute of Standards and Technology (NIST), and then: Discusses the challenges imposed by big data on scientific data infrastructures, including security and trust issues Covers vulnerabilities such as data theft or loss, privacy concerns, infected applications, threats in virtualization, and cross-virtual machine attack Describes the implementation of workflows in clouds, proposing an architecture composed of two layers—platform and application Details infrastructure-as-a-service (IaaS), platform-as-a-service (PaaS), and software-as-a-service (SaaS) solutions based on public, private, and hybrid cloud computing models Demonstrates how cloud computing aids in resource control, vertical and horizontal scalability, interoperability, and adaptive scheduling Featuring significant contributions from research centers, universities, and industries worldwide, Cloud Computing with e-Science Applications presents innovative cloud migration methodologies applicable to a variety of fields where large data sets are produced. The book provides the scientific community with an essential reference for moving applications to the cloud.




Global Healthgrid


Book Description

HealthGrid 2008 is the sixth conference in this series of open forums for the integration of grid technologies and its applications in the biomedical, medical and biological domains to pave the path to an international research area in healthgrids. The main objective of the HealthGrid conference and the HealthGrid Association is the exchange and discussion of ideas, technologies, solutions and requirements that interest the grid and the life-sciences communities to foster the integration of grids into health. Subjects in this publication reflect the diversity of mature practice: Advancing Virtual Communities, offering a glimpse of the kind of communities that are brought together by means of collaboration grids; Public Health Informatics, exploring the diffusion of grid concepts and technologies in health informatics; Translational Bioinformatics, the contact point between medicine, healthcare and genomics; and Knowledge Management and Decision Support, one direction that is confidently expected to grow as the synergy of grids and 'evidence-based practice' in healthcare is exploited.




Vaccine Design


Book Description

This volume provides a practical guide providing step-by-step protocol to design and develop vaccines for human diseases. Divided into three volumes, Volume 1: Vaccines for Human Diseases guides readers through an introductory section on future challenges for vaccinologists and the immunological mechanism of vaccines. Chapters focus on design of human vaccines for viral, bacterial, fungal, and parasitic diseases as well as tumor vaccines. Written in the format of the highly successful Methods in Molecular Biology series, each chapter includes an introduction to the topic, lists necessary materials and reagents, includes tips on troubleshooting and known pitfalls, and step-by-step, readily reproducible protocols. Authoritative and practical, Vaccine Design: Methods and Protocols, Second Edition, Volume 1: Vaccines for Human Diseases aims to be a useful practical guide to researchers to help further their study in this field.




Modern Computational Approaches to Traditional Chinese Medicine


Book Description

Recognized as an essential component of Chinese culture, Traditional Chinese Medicine (TCM) is both an ancient medical system and one still used widely in China today. TCM's independently evolved knowledge system is expressed mainly in the Chinese language and the information is frequently only available through ancient classics and confidential family records, making it difficult to utilize. The major concern in TCM is how to consolidate and integrate the data, enabling efficient retrieval and discovery of novel knowledge from the dispersed data. Computational approaches such as data mining, semantic reasoning and computational intelligence have emerged as innovative approaches for the reservation and utilization of this knowledge system. Typically, this requires an inter-disciplinary approach involving Chinese culture, computer science, modern healthcare and life sciences. This book examines the computerization of TCM information and knowledge to provide intelligent resources and supporting evidences for clinical decision-making, drug discovery, and education. Recent research results from the Traditional Chinese Medicine Informatics Group of Zhejiang University are presented, gathering in one resource systematic approaches for massive data processing in TCM. These include the utilization of modern Semantic Web and data mining methods for more advanced data integration, data analysis and integrative knowledge discovery. This book will appeal to medical professionals, life sciences students, computer scientists, and those interested in integrative, complementary, and alternative medicine. - Interdisciplinary book bringing together Traditional Chinese Medicine and computer scientists - Introduces novel network technologies to Traditional Chinese Medicine informatics - Provides theory and practical examples and case studies of new techniques