From Data to Action


Book Description

This book is a welcome guide for educators, civic leaders, and researchers looking for ways to leverage data to identify the most effective policies, interventions, and use of resources for their communities. In the current era of reform, much has been made of the fact that there are many influences that shape children beyond the walls of the schoolhouse. Powerful data “warehouses” have been built to track children and interventions within school bureaucracies and in other social service sectors. Yet these data systems are rarely linked to provide a holistic view of how individual children are faring both in and out of school and which interventions—or combinations thereof—are most promising. Privacy laws and institutional traditions have made such collaborations difficult, if not impossible. Until now. The Youth Data Archive, based at the John W. Gardner Center for Youth and Their Communities at Stanford University, is an effort to blaze a new path to the productive use of cross-agency data now employed by researchers, school officials, and service providers in San Francisco, San Mateo, Alameda, and Santa Clara counties. Editors Milbrey McLaughlin and Rebecca A. London, leaders of the Youth Data Archive, bring together participants who describe the initiative and its challenges and successes. The participants also give detailed background on how the archive was built and how it has led to improvements in services, particularly for children at risk. This book is a welcome guide for educators, civic leaders, and researchers looking for ways to leverage data to identify the most effective policies, interventions, and use of resources for their communities.




Process Mining


Book Description

This is the second edition of Wil van der Aalst’s seminal book on process mining, which now discusses the field also in the broader context of data science and big data approaches. It includes several additions and updates, e.g. on inductive mining techniques, the notion of alignments, a considerably expanded section on software tools and a completely new chapter of process mining in the large. It is self-contained, while at the same time covering the entire process-mining spectrum from process discovery to predictive analytics. After a general introduction to data science and process mining in Part I, Part II provides the basics of business process modeling and data mining necessary to understand the remainder of the book. Next, Part III focuses on process discovery as the most important process mining task, while Part IV moves beyond discovering the control flow of processes, highlighting conformance checking, and organizational and time perspectives. Part V offers a guide to successfully applying process mining in practice, including an introduction to the widely used open-source tool ProM and several commercial products. Lastly, Part VI takes a step back, reflecting on the material presented and the key open challenges. Overall, this book provides a comprehensive overview of the state of the art in process mining. It is intended for business process analysts, business consultants, process managers, graduate students, and BPM researchers.




How Teachers Can Turn Data into Action


Book Description

From state and Common Core tests to formative and summative assessments in the classroom, teachers are awash in data. Reviewing the data can be time-consuming, and the work of translating data into real change can seem overwhelming. Tapping more than 30 years' experience as an award-winning teacher and a trainer of PLC coaches, Daniel R. Venables, author of The Practice of Authentic PLCs: A Guide to Effective Teacher Teams, soothes the trepidation of even the biggest "dataphobes" in this essential resource. Field-tested and fine-tuned with professional learning communities around the United States, the Data Action Model is a teacher-friendly, systematic process for reviewing and responding to data in cycles of two to nine weeks. This powerful tool enables you and your teacher team to * Identify critical gaps in learning and corresponding instructional gaps; * Collaborate on solutions and develop a goal-driven action plan; and * Evaluate the plan's effectiveness after implementation and determine the next course of action. With easy-to-use templates and protocols to focus and deepen data conversations, this indispensable guide delineates exactly what should be accomplished in each team meeting to translate data into practice. In the modern sea of data, this book is your life preserver!




Queer Data


Book Description

Data has never mattered more. Our lives are increasingly shaped by it and how it is defined, collected and used. But who counts in the collection, analysis and application of data? This important book is the first to look at queer data – defined as data relating to gender, sex, sexual orientation and trans identity/history. The author shows us how current data practices reflect an incomplete account of LGBTQ lives and helps us understand how data biases are used to delegitimise the everyday experiences of queer people. Guyan demonstrates why it is important to understand, collect and analyse queer data, the benefits and challenges involved in doing so, and how we might better use queer data in our work. Arming us with the tools for action, this book shows how greater knowledge about queer identities is instrumental in informing decisions about resource allocation, changes to legislation, access to services, representation and visibility.




R in Action


Book Description

R is a powerful language for statistical computing and graphics that can handle virtually any data-crunching task. It runs on all important platforms and provides thousands of useful specialized modules and utilities. This makes R a great way to get meaningful information from mountains of raw data. R in Action, Second Edition is a language tutorial focused on practical problems. Written by a research methodologist, it takes a direct and modular approach to quickly give readers the information they need to produce useful results. Focusing on realistic data analyses and a comprehensive integration of graphics, it follows the steps that real data analysts use to acquire their data, get it into shape, analyze it, and produce meaningful results that they can provide to clients. Purchase of the print book comes with an offer of a free PDF eBook from Manning. Also available is all code from the book.




Network Security Through Data Analysis


Book Description

Traditional intrusion detection and logfile analysis are no longer enough to protect today’s complex networks. In the updated second edition of this practical guide, security researcher Michael Collins shows InfoSec personnel the latest techniques and tools for collecting and analyzing network traffic datasets. You’ll understand how your network is used, and what actions are necessary to harden and defend the systems within it. In three sections, this book examines the process of collecting and organizing data, various tools for analysis, and several different analytic scenarios and techniques. New chapters focus on active monitoring and traffic manipulation, insider threat detection, data mining, regression and machine learning, and other topics. You’ll learn how to: Use sensors to collect network, service, host, and active domain data Work with the SiLK toolset, Python, and other tools and techniques for manipulating data you collect Detect unusual phenomena through exploratory data analysis (EDA), using visualization and mathematical techniques Analyze text data, traffic behavior, and communications mistakes Identify significant structures in your network with graph analysis Examine insider threat data and acquire threat intelligence Map your network and identify significant hosts within it Work with operations to develop defenses and analysis techniques




Data Warehousing In Action


Book Description

Data Warehousing in Action provides the tools to harness data and transform it into useful customized bytes of information. This book is a fully comprehensive account of how to proceed with the data warehouse project in a clear step-by-step fashion. It reviews the marketplace, the technology, the design issues, and the management issues. It includes real company experiences from research carried out in 30 countries. It focuses on the topics needed to design and implement a meaningful data warehousing architecture including: Identifying industrial applications for data warehouses; Frameworks for building a data warehouse; Choosing the right architecture and technologies; Outline methodologies for managing the project; and future proofing the design.· Planning the Data Warehouse· Integrating the Data--The Data Warehouse Infrastructure· Exploiting the Data--The Data Warehouse Applications· Looking to the Future--Next Generation Data Warehouse




Gnuplot in Action


Book Description

Summary Gnuplot in Action, Second Edition is a major revision of this popular and authoritative guide for developers, engineers, and scientists who want to learn and use gnuplot effectively. Fully updated for gnuplot version 5, the book includes four pages of color illustrations and four bonus appendixes available in the eBook. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Technology Gnuplot is an open-source graphics program that helps you analyze, interpret, and present numerical data. Available for Unix, Mac, and Windows, it is well-maintained, mature, and totally free. About the Book Gnuplot in Action, Second Edition is a major revision of this authoritative guide for developers, engineers, and scientists. The book starts with a tutorial introduction, followed by a systematic overview of gnuplot's core features and full coverage of gnuplot's advanced capabilities. Experienced readers will appreciate the discussion of gnuplot 5's features, including new plot types, improved text and color handling, and support for interactive, web-based display formats. The book concludes with chapters on graphical effects and general techniques for understanding data with graphs. It includes four pages of color illustrations. 3D graphics, false-color plots, heatmaps, and multivariate visualizations are covered in chapter-length appendixes available in the eBook. What's Inside Creating different types of graphs in detail Animations, scripting, batch operations Extensive discussion of terminals Updated to cover gnuplot version 5 About the Reader No prior experience with gnuplot is required. This book concentrates on practical applications of gnuplot relevant to users of all levels. About the Author Philipp K. Janert, PhD, is a programmer and scientist. He is the author of several books on data analysis and applied math and has been a gnuplot power user and developer for over 20 years. Table of Contents PART 1 GETTING STARTED Prelude: understanding data with gnuplot Tutorial: essential gnuplot The heart of the matter: the plot command PART 2 CREATING GRAPHS Managing data sets and files Practical matters: strings, loops, and history A catalog of styles Decorations: labels, arrows, and explanations All about axes PART 3 MASTERING TECHNICALITIES Color, style, and appearance Terminals and output formats Automation, scripting, and animation Beyond the defaults: workflow and styles PART 4 UNDERSTANDING DATA Basic techniques of graphical analysis Topics in graphical analysis Coda: understanding data with graphs




R in Action, Third Edition


Book Description

R is the most powerful tool you can use for statistical analysis. This definitive guide smooths R’s steep learning curve with practical solutions and real-world applications for commercial environments. In R in Action, Third Edition you will learn how to: Set up and install R and RStudio Clean, manage, and analyze data with R Use the ggplot2 package for graphs and visualizations Solve data management problems using R functions Fit and interpret regression models Test hypotheses and estimate confidence Simplify complex multivariate data with principal components and exploratory factor analysis Make predictions using time series forecasting Create dynamic reports and stunning visualizations Techniques for debugging programs and creating packages R in Action, Third Edition makes learning R quick and easy. That’s why thousands of data scientists have chosen this guide to help them master the powerful language. Far from being a dry academic tome, every example you’ll encounter in this book is relevant to scientific and business developers, and helps you solve common data challenges. R expert Rob Kabacoff takes you on a crash course in statistics, from dealing with messy and incomplete data to creating stunning visualizations. This revised and expanded third edition contains fresh coverage of the new tidyverse approach to data analysis and R’s state-of-the-art graphing capabilities with the ggplot2 package. About the technology Used daily by data scientists, researchers, and quants of all types, R is the gold standard for statistical data analysis. This free and open source language includes packages for everything from advanced data visualization to deep learning. Instantly comfortable for mathematically minded users, R easily handles practical problems without forcing you to think like a software engineer. About the book R in Action, Third Edition teaches you how to do statistical analysis and data visualization using R and its popular tidyverse packages. In it, you’ll investigate real-world data challenges, including forecasting, data mining, and dynamic report writing. This revised third edition adds new coverage for graphing with ggplot2, along with examples for machine learning topics like clustering, classification, and time series analysis. What's inside Clean, manage, and analyze data Use the ggplot2 package for graphs and visualizations Techniques for debugging programs and creating packages A complete learning resource for R and tidyverse About the reader Requires basic math and statistics. No prior experience with R needed. About the author Dr. Robert I Kabacoff is a professor of quantitative analytics at Wesleyan University and a seasoned data scientist with more than 20 years of experience. Table of Contents PART 1 GETTING STARTED 1 Introduction to R 2 Creating a dataset 3 Basic data management 4 Getting started with graphs 5 Advanced data management PART 2 BASIC METHODS 6 Basic graphs 7 Basic statistics PART 3 INTERMEDIATE METHODS 8 Regression 9 Analysis of variance 10 Power analysis 11 Intermediate graphs 12 Resampling statistics and bootstrapping PART 4 ADVANCED METHODS 13 Generalized linear models 14 Principal components and factor analysis 15 Time series 16 Cluster analysis 17 Classification 18 Advanced methods for missing data PART 5 EXPANDING YOUR SKILLS 19 Advanced graphs 20 Advanced programming 21 Creating dynamic reports 22 Creating a package




Data Wise in Action


Book Description

What does it look like when a school uses data wisely? Data Wise in Action, a new companion and sequel to our bestselling Data Wise, tells the stories of eight very different schools following the Data Wise process of using assessment results to improve teaching and learning. Data Wise in Action highlights the leadership challenges schools face in each phase of the eight-step Data Wise cycle and illustrates how staff members use creativity and collaboration to overcome those challenges. Data Wise in Action builds on the work of leading faculty and graduate students at the Harvard Graduate School of Education, who joined with exemplary practitioners in 2005 to produce Data Wise: A Step-by-Step Guide to Using Assessment Results to Improve Teaching and Learning. Since its publication, Data Wise has been read by thousands of school leaders, many of whom have shared the book with colleagues and staff. The success of the original book has generated a new demand among school leaders: to hear real stories from schools that are implementing the Data Wise process. Data Wise in Action answers that need. It offers both inspiration and practical guidance for school leaders.