Validity and Validation


Book Description

The Understanding Research series focuses on the process of writing up social research. The series is broken down into three categories: Understanding Statistics, Understanding Measurement, and Understanding Qualitative Research. The books provide researchers with guides to understanding, writing, and evaluating social research. Each volume demonstrates how research should be represented, including how to write up the methodology as well as the research findings. Each volume also reviews how to appropriately evaluate published research. Validity and Validation is an introduction to validity theory and to the methods used to obtain evidence for the validity of research and assessment results. The book pulls together the best thinking from educational and psychological research and assessment over the past 50 years. It briefly describes validity theory's roots in the philosophy of science. It highlights the ways these philosophical perspectives influence concepts of internal and external validity in research methodology, as well as concepts of validity and reliability in educational and psychological tests and measurements. Each chapter provides multiple examples (e.g., research designs and examples of output) to help the readers see how validation work is done in practice, from the ways we design research studies to the ways we interpret research results. Of particular importance is the practical focus on validation of scores from tests and other measures. The book also addresses strategies for investigating the validity of inferences we make about examinees using scores from assessments, as well as how to investigate score uses, the value implications of score interpretations, and the social consequences of score use. With this foundation, the book presents strategies for minimizing threats for validity as well as quantitative and qualitative methods for gathering evidence for the validity of scores.




Validity and Validation in Social, Behavioral, and Health Sciences


Book Description

This book combines an overview of validity theory, trends in validation practices and a review of standards and guidelines in several international jurisdictions with research synthesis of the validity evidence in different research areas. An overview of theory is both useful and timely, in view of the increased use of tests and measures for decision-making, ranking and policy purposes in large-scale testing, assessment and social indicators and quality of life research. Research synthesis is needed to help us assemble, critically appraise and integrate the overwhelming volume of research on validity in different contexts. Rather than examining whether any given measure is “valid”, the focus is on a critical appraisal of the kinds of validity evidence reported in the published research literature. The five sources of validity evidence discussed are: content-related, response processes, internal structure, associations with other variables and consequences. The 15 syntheses included here, represent a broad sampling of psychosocial, health, medical and educational research settings, giving us an extensive evidential basis to build upon earlier studies. The book concludes with a meta-synthesis of the 15 syntheses and a discussion of the current thinking of validation practices by leading experts in the field.




Advancing Human Assessment


Book Description

This book is open access under a CC BY-NC 2.5 license.​​ This book describes the extensive contributions made toward the advancement of human assessment by scientists from one of the world’s leading research institutions, Educational Testing Service. The book’s four major sections detail research and development in measurement and statistics, education policy analysis and evaluation, scientific psychology, and validity. Many of the developments presented have become de-facto standards in educational and psychological measurement, including in item response theory (IRT), linking and equating, differential item functioning (DIF), and educational surveys like the National Assessment of Educational Progress (NAEP), the Programme of international Student Assessment (PISA), the Progress of International Reading Literacy Study (PIRLS) and the Trends in Mathematics and Science Study (TIMSS). In addition to its comprehensive coverage of contributions to the theory and methodology of educational and psychological measurement and statistics, the book gives significant attention to ETS work in cognitive, personality, developmental, and social psychology, and to education policy analysis and program evaluation. The chapter authors are long-standing experts who provide broad coverage and thoughtful insights that build upon decades of experience in research and best practices for measurement, evaluation, scientific psychology, and education policy analysis. Opening with a chapter on the genesis of ETS and closing with a synthesis of the enormously diverse set of contributions made over its 70-year history, the book is a useful resource for all interested in the improvement of human assessment.




The Concept of Validity


Book Description

Validity is widely held to be the most important criterion for an assessment. Nevertheless, assessment professionals have disagreed about the meaning of validity almost from the introduction of the term as applied to testing about 100 years ago. Over the years, the best and brightest people in assessment have contributed their thinking to this problem and the fact that they have not agreed is testimony to the complexity and importance of validity. Even today, ways to define validity are being debated in the published literature in the assessment profession. How can such a fundamental concept be so controversial? This book brings focus to diverse perspectives about validity. Its chapter authors were chosen because of their expertise and because they differ from each other in the ways they think about the validity construct. Its introduction and ten chapters bridge both the theoretical and the practical. Contributors include most prominent names in the field of validity and their perspectives are at once cogent and controversial. From these diverse and well-informed discussions, the reader will gain a deep understanding of the core issues in validity along with directions toward possible resolutions. The debate that exists among these authors is a rich one that will stimulate the reader’s own understanding and opinion. Several chapters are oriented more practically. Ways to study validity are presented by professionals who blend current assessment practice with new suggestions for what sort of evidence to develop and how to generate the needed information. In addition they provide examples of some of the options on how to present the validity argument in the most effective ways. The initial chapter by the Editor is an effort to orient the reader as well as providing an overview of the book. Bob Lissitz has provided a brief perspective on each of the subsequent chapters as well as presenting a series of questions regarding validation that the reader will want to try to answer for themselves, as he or she reads through this book. This book’s topic is fundamental to assessment, its authors are distinguished, and its scope is broad. It deserves to become established as a fundamental reference on validity for years to come.




Scale Development


Book Description

In the Fourth Edition of Scale Development, Robert F. DeVellis demystifies measurement by emphasizing a logical rather than strictly mathematical understanding of concepts. The text supports readers in comprehending newer approaches to measurement, comparing them to classical approaches, and grasping more clearly the relative merits of each. This edition addresses new topics pertinent to modern measurement approaches and includes additional exercises and topics for class discussion. Available with Perusall—an eBook that makes it easier to prepare for class Perusall is an award-winning eBook platform featuring social annotation tools that allow students and instructors to collaboratively mark up and discuss their SAGE textbook. Backed by research and supported by technological innovations developed at Harvard University, this process of learning through collaborative annotation keeps your students engaged and makes teaching easier and more effective. Learn more.




Encyclopedia of Research Design


Book Description

"Comprising more than 500 entries, the Encyclopedia of Research Design explains how to make decisions about research design, undertake research projects in an ethical manner, interpret and draw valid inferences from data, and evaluate experiment design strategies and results. Two additional features carry this encyclopedia far above other works in the field: bibliographic entries devoted to significant articles in the history of research design and reviews of contemporary tools, such as software and statistical procedures, used to analyze results. It covers the spectrum of research design strategies, from material presented in introductory classes to topics necessary in graduate research; it addresses cross- and multidisciplinary research needs, with many examples drawn from the social and behavioral sciences, neurosciences, and biomedical and life sciences; it provides summaries of advantages and disadvantages of often-used strategies; and it uses hundreds of sample tables, figures, and equations based on real-life cases."--Publisher's description.




Validity in Educational and Psychological Assessment


Book Description

Validity is the hallmark of quality for educational and psychological measurement. But what does quality mean in this context? And to what, exactly, does the concept of validity apply? These apparently innocuous questions parachute the unwary inquirer into a minefield of tricky ideas. This book guides you through this minefield, investigating how the concept of validity has evolved from the nineteenth century to the present day. Communicating complicated concepts straightforwardly, the authors answer questions like: What does ′validity′ mean? What does it mean to ′validate′? How many different kinds of validity are there? When does validation begin and end? Is reliability a part of validity, or distinct from it? This book will be of interest to anyone with a professional or academic interest in evaluating the quality of educational or psychological assessments, measurements and diagnoses.




Validity


Book Description

Validity is a clear, substantive introduction to the two most fundamental aspects of defensible testing practice: understanding test score meaning and justifying test score use. Driven by evidence-based and consensus-grounded measurement theory, principles, and terminology, this book addresses the most common questions of applied validation, the quality of test information, and the usefulness of test results. Concise yet comprehensive, this volume’s integrated framework is ideal for graduate courses on assessment, testing, psychometrics, and research methods as well as for credentialing organizations, licensure and certification entities, education agencies, and test publishers.




The SAGE Handbook of Quantitative Methodology for the Social Sciences


Book Description

Click ′Additional Materials′ for downloadable samples "The 24 chapters in this Handbook span a wide range of topics, presenting the latest quantitative developments in scaling theory, measurement, categorical data analysis, multilevel models, latent variable models, and foundational issues. Each chapter reviews the historical context for the topic and then describes current work, including illustrative examples where appropriate. The level of presentation throughout the book is detailed enough to convey genuine understanding without overwhelming the reader with technical material. Ample references are given for readers who wish to pursue topics in more detail. The book will appeal to both researchers who wish to update their knowledge of specific quantitative methods, and students who wish to have an integrated survey of state-of- the-art quantitative methods." —Roger E. Millsap, Arizona State University "This handbook discusses important methodological tools and topics in quantitative methodology in easy to understand language. It is an exhaustive review of past and recent advances in each topic combined with a detailed discussion of examples and graphical illustrations. It will be an essential reference for social science researchers as an introduction to methods and quantitative concepts of great use." —Irini Moustaki, London School of Economics, U.K. "David Kaplan and SAGE Publications are to be congratulated on the development of a new handbook on quantitative methods for the social sciences. The Handbook is more than a set of methodologies, it is a journey. This methodological journey allows the reader to experience scaling, tests and measurement, and statistical methodologies applied to categorical, multilevel, and latent variables. The journey concludes with a number of philosophical issues of interest to researchers in the social sciences. The new Handbook is a must purchase." —Neil H. Timm, University of Pittsburgh The SAGE Handbook of Quantitative Methodology for the Social Sciences is the definitive reference for teachers, students, and researchers of quantitative methods in the social sciences, as it provides a comprehensive overview of the major techniques used in the field. The contributors, top methodologists and researchers, have written about their areas of expertise in ways that convey the utility of their respective techniques, but, where appropriate, they also offer a fair critique of these techniques. Relevance to real-world problems in the social sciences is an essential ingredient of each chapter and makes this an invaluable resource. The handbook is divided into six sections: • Scaling • Testing and Measurement • Models for Categorical Data • Models for Multilevel Data • Models for Latent Variables • Foundational Issues These sections, comprising twenty-four chapters, address topics in scaling and measurement, advances in statistical modeling methodologies, and broad philosophical themes and foundational issues that transcend many of the quantitative methodologies covered in the book. The Handbook is indispensable to the teaching, study, and research of quantitative methods and will enable readers to develop a level of understanding of statistical techniques commensurate with the most recent, state-of-the-art, theoretical developments in the field. It provides the foundations for quantitative research, with cutting-edge insights on the effectiveness of each method, depending on the data and distinct research situation.




Developing and Validating Test Items


Book Description

Since test items are the building blocks of any test, learning how to develop and validate test items has always been critical to the teaching-learning process. As they grow in importance and use, testing programs increasingly supplement the use of selected-response (multiple-choice) items with constructed-response formats. This trend is expected to continue. As a result, a new item writing book is needed, one that provides comprehensive coverage of both types of items and of the validity theory underlying them. This book is an outgrowth of the author’s previous book, Developing and Validating Multiple-Choice Test Items, 3e (Haladyna, 2004). That book achieved distinction as the leading source of guidance on creating and validating selected-response test items. Like its predecessor, the content of this new book is based on both an extensive review of the literature and on its author’s long experience in the testing field. It is very timely in this era of burgeoning testing programs, especially when these items are delivered in a computer-based environment. Key features include ... Comprehensive and Flexible – No other book so thoroughly covers the field of test item development and its various applications. Focus on Validity – Validity, the most important consideration in testing, is stressed throughout and is based on the Standards for Educational and Psychological Testing, currently under revision by AERA, APA, and NCME Illustrative Examples – The book presents various selected and constructed response formats and uses many examples to illustrate correct and incorrect ways of writing items. Strategies for training item writers and developing large numbers of items using algorithms and other item-generating methods are also presented. Based on Theory and Research – A comprehensive review and synthesis of existing research runs throughout the book and complements the expertise of its authors.