Data Management in Large-Scale Education Research


Book Description

Research data management is becoming more complicated. Researchers are collecting more data, using more complex technologies, all the while increasing the visibility of our work with the push for data sharing and open science practices. Ad hoc data management practices may have worked for us in the past, but now others need to understand our processes as well, requiring researchers to be more thoughtful in planning their data management routines. This book is for anyone involved in a research study involving original data collection. While the book focuses on quantitative data, typically collected from human participants, many of the practices covered can apply to other types of data as well. The book contains foundational context, instructions, and practical examples to help researchers in the field of education begin to understand how to create data management workflows for large-scale, typically federally funded, research studies. The book starts by describing the research life cycle and how data management fits within this larger picture. The remaining chapters are then organized by each phase of the life cycle, with examples of best practices provided for each phase. Finally, considerations on whether the reader should implement, and how to integrate those practices into a workflow, are discussed. Key Features: Provides a holistic approach to the research life cycle, showing how project management and data management processes work in parallel and collaboratively Can be read in its entirety, or referenced as needed throughout the life cycle Includes relatable examples specific to education research Includes a discussion on how to organize and document data in preparation for data sharing requirements Contains links to example documents as well as templates to help readers implement practices




Data Management for Researchers


Book Description

A comprehensive guide to everything scientists need to know about data management, this book is essential for researchers who need to learn how to organize, document and take care of their own data. Researchers in all disciplines are faced with the challenge of managing the growing amounts of digital data that are the foundation of their research. Kristin Briney offers practical advice and clearly explains policies and principles, in an accessible and in-depth text that will allow researchers to understand and achieve the goal of better research data management. Data Management for Researchers includes sections on: * The data problem – an introduction to the growing importance and challenges of using digital data in research. Covers both the inherent problems with managing digital information, as well as how the research landscape is changing to give more value to research datasets and code. * The data lifecycle – a framework for data’s place within the research process and how data’s role is changing. Greater emphasis on data sharing and data reuse will not only change the way we conduct research but also how we manage research data. * Planning for data management – covers the many aspects of data management and how to put them together in a data management plan. This section also includes sample data management plans. * Documenting your data – an often overlooked part of the data management process, but one that is critical to good management; data without documentation are frequently unusable. * Organizing your data – explains how to keep your data in order using organizational systems and file naming conventions. This section also covers using a database to organize and analyze content. * Improving data analysis – covers managing information through the analysis process. This section starts by comparing the management of raw and analyzed data and then describes ways to make analysis easier, such as spreadsheet best practices. It also examines practices for research code, including version control systems. * Managing secure and private data – many researchers are dealing with data that require extra security. This section outlines what data falls into this category and some of the policies that apply, before addressing the best practices for keeping data secure. * Short-term storage – deals with the practical matters of storage and backup and covers the many options available. This section also goes through the best practices to insure that data are not lost. * Preserving and archiving your data – digital data can have a long life if properly cared for. This section covers managing data in the long term including choosing good file formats and media, as well as determining who will manage the data after the end of the project. * Sharing/publishing your data – addresses how to make data sharing across research groups easier, as well as how and why to publicly share data. This section covers intellectual property and licenses for datasets, before ending with the altmetrics that measure the impact of publicly shared data. * Reusing data – as more data are shared, it becomes possible to use outside data in your research. This chapter discusses strategies for finding datasets and lays out how to cite data once you have found it. This book is designed for active scientific researchers but it is useful for anyone who wants to get more from their data: academics, educators, professionals or anyone who teaches data management, sharing and preservation. "An excellent practical treatise on the art and practice of data management, this book is essential to any researcher, regardless of subject or discipline." —Robert Buntrock, Chemical Information Bulletin




Effective Big Data Management and Opportunities for Implementation


Book Description

“Big data” has become a commonly used term to describe large-scale and complex data sets which are difficult to manage and analyze using standard data management methodologies. With applications across sectors and fields of study, the implementation and possible uses of big data are limitless. Effective Big Data Management and Opportunities for Implementation explores emerging research on the ever-growing field of big data and facilitates further knowledge development on methods for handling and interpreting large data sets. Providing multi-disciplinary perspectives fueled by international research, this publication is designed for use by data analysts, IT professionals, researchers, and graduate-level students interested in learning about the latest trends and concepts in big data.




Implementation of Large-Scale Education Assessments


Book Description

Presents a comprehensive treatment of issues related to the inception, design, implementation and reporting of large-scale education assessments. In recent years many countries have decided to become involved in international educational assessments to allow them to ascertain the strengths and weaknesses of their student populations. Assessments such as the OECD's Programme for International Student Assessment (PISA), the IEA's Trends in Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy (PIRLS) have provided opportunities for comparison between students of different countries on a common international scale. This book is designed to give researchers, policy makers and practitioners a well-grounded knowledge in the design, implementation, analysis and reporting of international assessments. Readers will be able to gain a more detailed insight into the scientific principles employed in such studies allowing them to make better use of the results. The book will also give readers an understanding of the resources needed to undertake and improve the design of educational assessments in their own countries and regions. Implementation of Large-Scale Education Assessments: Brings together the editors’ extensive experience in creating, designing, implementing, analysing and reporting results on a wide range of assessments. Emphasizes methods for implementing international studies of student achievement and obtaining highquality data from cognitive tests and contextual questionnaires. Discusses the methods of sampling, weighting, and variance estimation that are commonly encountered in international large-scale assessments. Provides direction and stimulus for improving global educational assessment and student learning Is written by experts in the field, with an international perspective. Survey researchers, market researchers and practitioners engaged in comparative projects will all benefit from the unparalleled breadth of knowledge and experience in large-scale educational assessments gathered in this one volume.




Telling Stories with Data


Book Description

The book equips students with the end-to-end skills needed to do data science. That means gathering, cleaning, preparing, and sharing data, then using statistical models to analyse data, writing about the results of those models, drawing conclusions from them, and finally, using the cloud to put a model into production, all done in a reproducible way. At the moment, there are a lot of books that teach data science, but most of them assume that you already have the data. This book fills that gap by detailing how to go about gathering datasets, cleaning and preparing them, before analysing them. There are also a lot of books that teach statistical modelling, but few of them teach how to communicate the results of the models and how they help us learn about the world. Very few data science textbooks cover ethics, and most of those that do, have a token ethics chapter. Finally, reproducibility is not often emphasised in data science books. This book is based around a straight-forward workflow conducted in an ethical and reproducible way: gather data, prepare data, analyse data, and communicate those findings. This book will achieve the goals by working through extensive case studies in terms of gathering and preparing data, and integrating ethics throughout. It is specifically designed around teaching how to write about the data and models, so aspects such as writing are explicitly covered. And finally, the use of GitHub and the open-source statistical language R are built in throughout the book. Key Features: Extensive code examples. Ethics integrated throughout. Reproducibility integrated throughout. Focus on data gathering, messy data, and cleaning data. Extensive formative assessment throughout.




Perspectives in Contemporary STEM Education Research


Book Description

This book presents an overview of the methodological innovations and developments present in the field of STEM education research as well as providing a practically orientated resource on research method design more broadly. Featuring a range of international contributors in the field, the book provides a compendium of exemplary innovative methodological designs, implementations, and analyses that answer a variety of research questions relating to STEM education disciplines. Charting the thinking behind the design and implementation of successful research investigations, the book’s two parts present an accessible and pragmatically framed set of chapters that cover a range of important methodological areas presented by active researchers in the field. Ultimately, this book presents a comprehensive resource that explores the act of educational research as related to STEM. By showcasing key methodological principles with guidance on practical approaches underpinned by theory, the book offers scholarly research-informed suggestions for practice. It will be of great interest to researchers, academics, and students in the fields of STEM education and education research methods, as well as educational research more broadly.







Model Management and Analytics for Large Scale Systems


Book Description

Model Management and Analytics for Large Scale Systems covers the use of models and related artefacts (such as metamodels and model transformations) as central elements for tackling the complexity of building systems and managing data. With their increased use across diverse settings, the complexity, size, multiplicity and variety of those artefacts has increased. Originally developed for software engineering, these approaches can now be used to simplify the analytics of large-scale models and automate complex data analysis processes. Those in the field of data science will gain novel insights on the topic of model analytics that go beyond both model-based development and data analytics. This book is aimed at both researchers and practitioners who are interested in model-based development and the analytics of large-scale models, ranging from big data management and analytics, to enterprise domains. The book could also be used in graduate courses on model development, data analytics and data management. - Identifies key problems and offers solution approaches and tools that have been developed or are necessary for model management and analytics - Explores basic theory and background, current research topics, related challenges and the research directions for model management and analytics - Provides a complete overview of model management and analytics frameworks, the different types of analytics (descriptive, diagnostics, predictive and prescriptive), the required modelling and method steps, and important future directions




International Large-Scale Assessments in Education


Book Description

This book explores the often controversial international large-scale assessments (ILSAs) in education and offers research-based accounts of international testing as a social practice. Assessment exercises, such as the Organisation for Economic Co-operation and Development's Programme for International Student Assessment (PISA), produce comparable international statistics and rankings on educational performance, and are influential practices that shape educational policy on a global scale. The chapters in this volume, written by expert researchers in the field, take the reader behind the scenes to document a broad range of ILSA practices – from the recruitment of countries into ILSAs, to the production and performance of large-scale testing, and the management, media reception and use of test data. Based on data that is only available to expert researchers with inside access, the international case study material includes examples from Australia, Ecuador, Germany, Japan, Mexico, Norway, Russia, Scotland, Slovenia, Sweden, the UK and the USA. The volume provides important insights for teachers, researchers and policy-makers who use and study assessment data and who wish to evaluate its significance for educational policy and practice.




Large-scale Graph Analysis: System, Algorithm and Optimization


Book Description

This book introduces readers to a workload-aware methodology for large-scale graph algorithm optimization in graph-computing systems, and proposes several optimization techniques that can enable these systems to handle advanced graph algorithms efficiently. More concretely, it proposes a workload-aware cost model to guide the development of high-performance algorithms. On the basis of the cost model, the book subsequently presents a system-level optimization resulting in a partition-aware graph-computing engine, PAGE. In addition, it presents three efficient and scalable advanced graph algorithms – the subgraph enumeration, cohesive subgraph detection, and graph extraction algorithms. This book offers a valuable reference guide for junior researchers, covering the latest advances in large-scale graph analysis; and for senior researchers, sharing state-of-the-art solutions based on advanced graph algorithms. In addition, all readers will find a workload-aware methodology for designing efficient large-scale graph algorithms.