Data Governance


Book Description




Managing Data Quality


Book Description

This book explains data quality management in practical terms, focusing on three key areas - the nature of data in enterprises, the purpose and scope of data quality management, and implementing a data quality management system, in line with ISO 8000-61. Examples of good practice in data quality management are also included.




Registries for Evaluating Patient Outcomes


Book Description

This User’s Guide is intended to support the design, implementation, analysis, interpretation, and quality evaluation of registries created to increase understanding of patient outcomes. For the purposes of this guide, a patient registry is an organized system that uses observational study methods to collect uniform data (clinical and other) to evaluate specified outcomes for a population defined by a particular disease, condition, or exposure, and that serves one or more predetermined scientific, clinical, or policy purposes. A registry database is a file (or files) derived from the registry. Although registries can serve many purposes, this guide focuses on registries created for one or more of the following purposes: to describe the natural history of disease, to determine clinical effectiveness or cost-effectiveness of health care products and services, to measure or monitor safety and harm, and/or to measure quality of care. Registries are classified according to how their populations are defined. For example, product registries include patients who have been exposed to biopharmaceutical products or medical devices. Health services registries consist of patients who have had a common procedure, clinical encounter, or hospitalization. Disease or condition registries are defined by patients having the same diagnosis, such as cystic fibrosis or heart failure. The User’s Guide was created by researchers affiliated with AHRQ’s Effective Health Care Program, particularly those who participated in AHRQ’s DEcIDE (Developing Evidence to Inform Decisions About Effectiveness) program. Chapters were subject to multiple internal and external independent reviews.




Executing Data Quality Projects


Book Description

Executing Data Quality Projects, Second Edition presents a structured yet flexible approach for creating, improving, sustaining and managing the quality of data and information within any organization. Studies show that data quality problems are costing businesses billions of dollars each year, with poor data linked to waste and inefficiency, damaged credibility among customers and suppliers, and an organizational inability to make sound decisions. Help is here! This book describes a proven Ten Step approach that combines a conceptual framework for understanding information quality with techniques, tools, and instructions for practically putting the approach to work – with the end result of high-quality trusted data and information, so critical to today's data-dependent organizations. The Ten Steps approach applies to all types of data and all types of organizations – for-profit in any industry, non-profit, government, education, healthcare, science, research, and medicine. This book includes numerous templates, detailed examples, and practical advice for executing every step. At the same time, readers are advised on how to select relevant steps and apply them in different ways to best address the many situations they will face. The layout allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, best practices, and warnings. The experience of actual clients and users of the Ten Steps provide real examples of outputs for the steps plus highlighted, sidebar case studies called Ten Steps in Action. This book uses projects as the vehicle for data quality work and the word broadly to include: 1) focused data quality improvement projects, such as improving data used in supply chain management, 2) data quality activities in other projects such as building new applications and migrating data from legacy systems, integrating data because of mergers and acquisitions, or untangling data due to organizational breakups, and 3) ad hoc use of data quality steps, techniques, or activities in the course of daily work. The Ten Steps approach can also be used to enrich an organization's standard SDLC (whether sequential or Agile) and it complements general improvement methodologies such as six sigma or lean. No two data quality projects are the same but the flexible nature of the Ten Steps means the methodology can be applied to all. The new Second Edition highlights topics such as artificial intelligence and machine learning, Internet of Things, security and privacy, analytics, legal and regulatory requirements, data science, big data, data lakes, and cloud computing, among others, to show their dependence on data and information and why data quality is more relevant and critical now than ever before. - Includes concrete instructions, numerous templates, and practical advice for executing every step of The Ten Steps approach - Contains real examples from around the world, gleaned from the author's consulting practice and from those who implemented based on her training courses and the earlier edition of the book - Allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, and best practices - A companion Web site includes links to numerous data quality resources, including many of the templates featured in the text, quick summaries of key ideas from the Ten Steps methodology, and other tools and information that are available online




Data Management in Large-Scale Education Research


Book Description

Research data management is becoming more complicated. Researchers are collecting more data, using more complex technologies, all the while increasing the visibility of our work with the push for data sharing and open science practices. Ad hoc data management practices may have worked for us in the past, but now others need to understand our processes as well, requiring researchers to be more thoughtful in planning their data management routines. This book is for anyone involved in a research study involving original data collection. While the book focuses on quantitative data, typically collected from human participants, many of the practices covered can apply to other types of data as well. The book contains foundational context, instructions, and practical examples to help researchers in the field of education begin to understand how to create data management workflows for large-scale, typically federally funded, research studies. The book starts by describing the research life cycle and how data management fits within this larger picture. The remaining chapters are then organized by each phase of the life cycle, with examples of best practices provided for each phase. Finally, considerations on whether the reader should implement, and how to integrate those practices into a workflow, are discussed. Key Features: Provides a holistic approach to the research life cycle, showing how project management and data management processes work in parallel and collaboratively Can be read in its entirety, or referenced as needed throughout the life cycle Includes relatable examples specific to education research Includes a discussion on how to organize and document data in preparation for data sharing requirements Contains links to example documents as well as templates to help readers implement practices




Future Data and Security Engineering


Book Description

This book constitutes the proceedings of the 7th International Conference on Future Data and Security Engineering, FDSE 2020, which was supposed to be held in Quy Nhon, Vietnam, in November 2020, but the conference was held virtually due to the COVID-19 pandemic. The 24 full papers (of 53 accepted full papers) presented together with 2 invited keynotes were carefully reviewed and selected from 161 submissions. The other 29 accepted full and 8 short papers are included in CCIS 1306. The selected papers are organized into the following topical headings: security issues in big data; big data analytics and distributed systems; advances in big data query processing and optimization; blockchain and applications; industry 4.0 and smart city: data analytics and security; advanced studies in machine learning for security; and emerging data management systems and applications.




Driving Data Quality with Data Contracts


Book Description

Everything you need to know to apply data contracts and build a truly data-driven organization that harnesses quality data to deliver tangible business value Purchase of the print or Kindle book includes a free PDF eBook Key Features Understand data contracts and their power to resolving the problems in contemporary data platforms Learn how to design and implement a cutting-edge data platform powered by data contracts Access practical guidance from the pioneer of data contracts to get expert insights on effective utilization Book DescriptionDespite the passage of time and the evolution of technology and architecture, the challenges we face in building data platforms persist. Our data often remains unreliable, lacks trust, and fails to deliver the promised value. With Driving Data Quality with Data Contracts, you’ll discover the potential of data contracts to transform how you build your data platforms, finally overcoming these enduring problems. You’ll learn how establishing contracts as the interface allows you to explicitly assign responsibility and accountability of the data to those who know it best—the data generators—and give them the autonomy to generate and manage data as required. The book will show you how data contracts ensure that consumers get quality data with clearly defined expectations, enabling them to build on that data with confidence to deliver valuable analytics, performant ML models, and trusted data-driven products. By the end of this book, you’ll have gained a comprehensive understanding of how data contracts can revolutionize your organization’s data culture and provide a competitive advantage by unlocking the real value within your data.What you will learn Gain insights into the intricacies and shortcomings of today's data architectures Understand exactly how data contracts can solve prevalent data challenges Drive a fundamental transformation of your data culture by implementing data contracts Discover what goes into a data contract and why it's important Design a modern data architecture that leverages the power of data contracts Explore sample implementations to get practical knowledge of using data contracts Embrace best practices for the successful deployment of data contracts Who this book is for If you’re a data engineer, data leader, architect, or practitioner thinking about your data architecture and looking to design one that enables your organization to get the most value from your data, this book is for you. Additionally, staff engineers, product managers, and software engineering leaders and executives will also find valuable insights.




SQL Server 2019 Administrator's Guide


Book Description

Use Microsoft SQL Server 2019 to implement, administer, and secure a robust database solution that is disaster-proof and highly available Key FeaturesExplore new features of SQL Server 2019 to set up, administer, and maintain your database solution successfullyDevelop a dynamic SQL Server environment and streamline big data pipelinesDiscover best practices for fixing performance issues, database access management, replication, and securityBook Description SQL Server is one of the most popular relational database management systems developed by Microsoft. This second edition of the SQL Server Administrator's Guide will not only teach you how to administer an enterprise database, but also help you become proficient at managing and keeping the database available, secure, and stable. You’ll start by learning how to set up your SQL Server and configure new and existing environments for optimal use. The book then takes you through designing aspects and delves into performance tuning by showing you how to use indexes effectively. You’ll understand certain choices that need to be made about backups, implement security policy, and discover how to keep your environment healthy. Tools available for monitoring and managing a SQL Server database, including automating health reviews, performance checks, and much more, will also be discussed in detail. As you advance, the book covers essential topics such as migration, upgrading, and consolidation, along with the techniques that will help you when things go wrong. Once you’ve got to grips with integration with Azure and streamlining big data pipelines, you’ll learn best practices from industry experts for maintaining a highly reliable database solution. Whether you are an administrator or are looking to get started with database administration, this SQL Server book will help you develop the skills you need to successfully create, design, and deploy database solutions. What you will learnDiscover SQL Server 2019’s new features and how to implement themFix performance issues by optimizing queries and making use of indexesDesign and use an optimal database management strategyCombine SQL Server 2019 with Azure and manage your solution using various automation techniquesImplement efficient backup and recovery techniques in line with security policiesGet to grips with migrating, upgrading, and consolidating with SQL ServerSet up an AlwaysOn-enabled stable and fast SQL Server 2019 environmentUnderstand how to work with Big Data on SQL Server environmentsWho this book is for This book is for database administrators, database developers, and anyone who wants to administer large and multiple databases single-handedly using Microsoft's SQL Server 2019. Basic awareness of database concepts and experience with previous SQL Server versions is required.




DAMA-DMBOK


Book Description

Defining a set of guiding principles for data management and describing how these principles can be applied within data management functional areas; Providing a functional framework for the implementation of enterprise data management practices; including widely adopted practices, methods and techniques, functions, roles, deliverables and metrics; Establishing a common vocabulary for data management concepts and serving as the basis for best practices for data management professionals. DAMA-DMBOK2 provides data management and IT professionals, executives, knowledge workers, educators, and researchers with a framework to manage their data and mature their information infrastructure, based on these principles: Data is an asset with unique properties; The value of data can be and should be expressed in economic terms; Managing data means managing the quality of data; It takes metadata to manage data; It takes planning to manage data; Data management is cross-functional and requires a range of skills and expertise; Data management requires an enterprise perspective; Data management must account for a range of perspectives; Data management is data lifecycle management; Different types of data have different lifecycle requirements; Managing data includes managing risks associated with data; Data management requirements must drive information technology decisions; Effective data management requires leadership commitment.




Connected, Intelligent, Automated


Book Description

Quality 4.0 is for all industries, and this book is for anyone who wants to learn how Industry 4.0 and Quality 4.0 can help improve quality and performance in their team or company. This comprehensive guide is the culmination of 25 years of research and practice-exploring, implementing, and critically examining the quality and performance improvement aspects of what we now call Industry 4.0 technologies. Navigate the connected, intelligent, and automated ecosystems of infrastructure, people, objects, machines, and data. Sift through the noise around AI, AR, big data, blockchain, cybersecurity, and other rising technologies and emerging issues to find the signals for your organization. Discover the value proposition of Quality 4.0 and the leading role for Quality professionals to drive successful digital transformation initiatives. The changes ahead are powerful, exciting, and overwhelming-and we can draw on the lessons from past work to mitigate the risks we face today. Connected, Intelligent, Automated provides you with the techniques, philosophies, and broad overall knowledge you need to understand Quality 4.0, and helps you leverage those things for the future success of your enterprise. Chapter 1: Quality 4.0 and the Fourth Industrial Revolution Chapter 2: Connected Ecosystems Chapter 3: Intelligent Agents and Machine Learning Chapter 4: Automation: From Manual Labor to Autonomy Chapter 5: Quality 4.0 Use Cases Across Industries Chapter 6: From Algorithms to Advanced Analytics Chapter 7: Delivering Value and Impact Through Data Science Chapter 8: Data Quality and Data Management Chapter 9: Software Applications & Data Platforms Chapter 10: Blockchain Chapter 11: Performance Excellence Chapter 12: Environment, Health, Safety, Quality (EHSQ) and Cybersecurity Chapter 13: Voice of the Customer (VoC) Chapter 14: Elements of a Quality 4.0 Strategy Chapter 15: Playbook for Transformation