Developing and Validating Multiple-choice Test Items


Book Description

The most comprehensive and authoritative book in its field, this edition has been extensively revised and updated. This book is intended for anyone who develops test items for large-scale assessments, as well as teachers and graduate students who de




Developing and Validating Test Items


Book Description

Since test items are the building blocks of any test, learning how to develop and validate test items has always been critical to the teaching-learning process. As they grow in importance and use, testing programs increasingly supplement the use of selected-response (multiple-choice) items with constructed-response formats. This trend is expected to continue. As a result, a new item writing book is needed, one that provides comprehensive coverage of both types of items and of the validity theory underlying them. This book is an outgrowth of the author’s previous book, Developing and Validating Multiple-Choice Test Items, 3e (Haladyna, 2004). That book achieved distinction as the leading source of guidance on creating and validating selected-response test items. Like its predecessor, the content of this new book is based on both an extensive review of the literature and on its author’s long experience in the testing field. It is very timely in this era of burgeoning testing programs, especially when these items are delivered in a computer-based environment. Key features include ... Comprehensive and Flexible – No other book so thoroughly covers the field of test item development and its various applications. Focus on Validity – Validity, the most important consideration in testing, is stressed throughout and is based on the Standards for Educational and Psychological Testing, currently under revision by AERA, APA, and NCME Illustrative Examples – The book presents various selected and constructed response formats and uses many examples to illustrate correct and incorrect ways of writing items. Strategies for training item writers and developing large numbers of items using algorithms and other item-generating methods are also presented. Based on Theory and Research – A comprehensive review and synthesis of existing research runs throughout the book and complements the expertise of its authors.




Developing and Validating Multiple-choice Test Items


Book Description

This is the most current and comprehensive book devoted to item writing. It addresses the related topics of multiple-choice test item development and validation of responses to these test items -- two critical steps in the development of any cognitive test. In so doing, the volume provides a conceptual basis for item writing, reviews the issue of constructed- versus selected-response testing, presents a variety of formats, provides guidance in developing items as well as a basis for reviewing, evaluating, and improving items, and speculates about the future of item development and validation. This book helps readers better understand the concepts, principles, and procedures available to build better test items that will lead to more reliable tests of ability and achievement.




Handbook of Test Development


Book Description

The second edition of the Handbook of Test Development provides graduate students and professionals with an up-to-date, research-oriented guide to the latest developments in the field. Including thirty-two chapters by well-known scholars and practitioners, it is divided into five sections, covering the foundations of test development, content definition, item development, test design and form assembly, and the processes of test administration, documentation, and evaluation. Keenly aware of developments in the field since the publication of the first edition, including changes in technology, the evolution of psychometric theory, and the increased demands for effective tests via educational policy, the editors of this edition include new chapters on assessing noncognitive skills, measuring growth and learning progressions, automated item generation and test assembly, and computerized scoring of constructed responses. The volume also includes expanded coverage of performance testing, validity, fairness, and numerous other topics. Edited by Suzanne Lane, Mark R. Raymond, and Thomas M. Haladyna, The Handbook of Test Development, 2nd edition, is based on the revised Standards for Educational and Psychological Testing, and is appropriate for graduate courses and seminars that deal with test development and usage, professional testing services and credentialing agencies, state and local boards of education, and academic libraries serving these groups.




Constructing Test Items


Book Description

Constructing test items for standardized tests of achievement, ability, and aptitude is a task of enormous importance. The interpretability of a test's scores flows directly from the quality of its items and exercises. Concomitant with score interpretability is the notion that including only carefully crafted items on a test is the primary method by which the skilled test developer reduces unwanted error variance, or errors of measurement, and thereby increases a test score's reliability. The aim of this entire book is to increase the test constructor's awareness of this source of measurement error, and then to describe methods for identifying and minimizing it during item construction and later review. Persons involved in assessment are keenly aware of the increased attention given to alternative formats for test items in recent years. Yet, in many writers' zeal to be `curriculum-relevant' or `authentic' or `realistic', the items are often developed seemingly without conscious thought to the interpretations that may be garnered from them. This book argues that the format for such alternative items and exercises also requires rigor in their construction and even offers some solutions, as one chapter is devoted to these alternative formats. This book addresses major issues in constructing test items by focusing on four ideas. First, it describes the characteristics and functions of test items. A second feature of this book is the presentation of editorial guidelines for writing test items in all of the commonly used item formats, including constructed-response formats and performance tests. A third aspect of this book is the presentation of methods for determining the quality of test items. Finally, this book presents a compendium of important issues about test items, including procedures for ordering items in a test, ethical and legal concerns over using copyrighted test items, item scoring schemes, computer-generated items and more.




Scale Development


Book Description

In the Fourth Edition of Scale Development, Robert F. DeVellis demystifies measurement by emphasizing a logical rather than strictly mathematical understanding of concepts. The text supports readers in comprehending newer approaches to measurement, comparing them to classical approaches, and grasping more clearly the relative merits of each. This edition addresses new topics pertinent to modern measurement approaches and includes additional exercises and topics for class discussion. Available with Perusall—an eBook that makes it easier to prepare for class Perusall is an award-winning eBook platform featuring social annotation tools that allow students and instructors to collaboratively mark up and discuss their SAGE textbook. Backed by research and supported by technological innovations developed at Harvard University, this process of learning through collaborative annotation keeps your students engaged and makes teaching easier and more effective. Learn more.




Building a Validity Argument for a Listening Test of Academic Proficiency


Book Description

Over the years, various approaches to validation have emerged in psychological and educational assessment research, which can be classified into traditional approaches and modern approaches. Traditional approaches view validity as a multicomponential concept including, for example, content, construct, and predictive validity, while modern approaches conceptualize it as a unitary concept evaluated through argumentation. Drawing on the modern approach, this book builds a validity argument for an International English Language Testing System (IELTS) listening test sample. The book provides some insights into the listening sub-skills that the test engages, the psychometric dimensionality of the test, variables that predict item difficulty parameters, bias across age, nationality, test experience, and gender, as well as predictive-referenced evidence of validity. A variety of techniques including the Rasch model and structural equation modelling are used to answer the research questions and to build a validity argument framework; this argument organizes the thematically related findings into a coherent treatment of the validity of the listening test. The book presents the first treatment of validity argument and related analytical tools in one volume and maps the psychometric/statistical analysis tools onto the validity argument framework. It also provides an extensive literature review of listening comprehension, validation, and psychometric modeling and proposes both methods for developing and validating self-assessment instruments and novel approaches to improving the quality of language assessments.




Advancing Human Assessment


Book Description

This book is open access under a CC BY-NC 2.5 license.​​ This book describes the extensive contributions made toward the advancement of human assessment by scientists from one of the world’s leading research institutions, Educational Testing Service. The book’s four major sections detail research and development in measurement and statistics, education policy analysis and evaluation, scientific psychology, and validity. Many of the developments presented have become de-facto standards in educational and psychological measurement, including in item response theory (IRT), linking and equating, differential item functioning (DIF), and educational surveys like the National Assessment of Educational Progress (NAEP), the Programme of international Student Assessment (PISA), the Progress of International Reading Literacy Study (PIRLS) and the Trends in Mathematics and Science Study (TIMSS). In addition to its comprehensive coverage of contributions to the theory and methodology of educational and psychological measurement and statistics, the book gives significant attention to ETS work in cognitive, personality, developmental, and social psychology, and to education policy analysis and program evaluation. The chapter authors are long-standing experts who provide broad coverage and thoughtful insights that build upon decades of experience in research and best practices for measurement, evaluation, scientific psychology, and education policy analysis. Opening with a chapter on the genesis of ETS and closing with a synthesis of the enormously diverse set of contributions made over its 70-year history, the book is a useful resource for all interested in the improvement of human assessment.




The Nurse Educator's Guide to Assessing Learning Outcomes


Book Description

The new edition of this award winning text helps address the increased pressure that the NCLEX and other certification exams are placing on nursing students and faculty. The Nurse Educator’s Guide to Assessing Learning Outcomes, 2nd Edition guides classroom educators through the process of developing effective classroom exams and individual test items.




Methodological Issues of Longitudinal Surveys


Book Description

This book addresses a broad array of pressing challenges of longitudinal surveys and provides innovative solutions to methodological problems based on the example of the NEPS. It covers longitudinal issues such as sampling, weighting, recruiting and fieldwork management, the design of longitudinal surveys and the implementation of constructs, conducting competence tests over the life course, effective methods to improve and to maintain the highest level of data quality, data management tools for large-scale longitudinal surveys, the dissemination of research data to heterogeneous scientific communities, as well as establishing a long-term public relations and communications unit integrating a study’s stakeholder community over time.