Uncommon Measures


Book Description

The issues surrounding the comparability of various tests used to assess performance in schools received broad public attention during congressional debate over the Voluntary National Tests proposed by President Clinton in his 1997 State of the Union Address. Proponents of Voluntary National Tests argue that there is no widely understood, challenging benchmark of individual student performance in 4th-grade reading and 8th-grade mathematics, thus the need for a new test. Opponents argue that a statistical linkage among tests already used by states and districts might provide the sort of comparability called for by the president's proposal. Public Law 105-78 requested that the National Research Council study whether an equivalency scale could be developed that would allow test scores from existing commercial tests and state assessments to be compared with each other and with the National Assessment of Education Progress. In this book, the committee reviewed research literature on the statistical and technical aspects of creating valid links between tests and how the content, use, and purposes of education testing in the United States influences the quality and meaning of those links. The book summarizes relevant prior linkage studies and presents a picture of the diversity of state testing programs. It also looks at the unique characteristics of the National Assessment of Educational Progress. Uncommon Measures provides an answer to the question posed by Congress in Public Law 105-78, suggests criteria for evaluating the quality of linkages, and calls for further research to determine the level of precision needed to make inferences about linked tests. In arriving at its conclusions, the committee acknowledged that ultimately policymakers and educators must take responsibility for determining the degree of imprecision they are willing to tolerate in testing and linking. This book provides science-based information with which to make those decisions.













Embedding Questions


Book Description

Policy makers are caught between two powerful forces in relation to testing in America's schools. One is increased interest on the part of educators, reinforced by federal requirements, in developing tests that accurately reflect local educational standards and goals. The other is a strong push to gather information about the performance of students and schools relative to national and international standards and norms. The difficulty of achieving these two goals simultaneously is exacerbated by both the long-standing American tradition of local control of education and the growing public sentiment that students already take enough tests. Finding a solution to this dilemma has been the focus of numerous debates surrounding the Voluntary National Tests proposed by President Clinton in his 1997 State of the Union address. It was also the topic of a congressionally mandated 1998 National Research Council report (Uncommon Measures: Equivalence and Linkage Among Educational Tests), and was touched upon in a U.S. General Accounting Office report (Student Testing: Issues Related to Voluntary National Mathematics and Reading Tests). More recently, Congress asked the National Research Council to determine the technical feasibility, validity, and reliability of embedding test items from the National Assessment of Educational Progress or other tests in state and district assessments in 4th-grade reading and 8th-grade mathematics for the purpose of developing a valid measure of student achievement within states and districts and in terms of national performance standards or scales. This report is the response to that congressional mandate.




Uncommon Measures


Book Description

The issues surrounding the comparability of various tests used to assess performance in schools received broad public attention during congressional debate over the Voluntary National Tests proposed by President Clinton in his 1997 State of the Union Address. Proponents of Voluntary National Tests argue that there is no widely understood, challenging benchmark of individual student performance in 4th-grade reading and 8th-grade mathematics, thus the need for a new test. Opponents argue that a statistical linkage among tests already used by states and districts might provide the sort of comparability called for by the president's proposal. Public Law 105-78 requested that the National Research Council study whether an equivalency scale could be developed that would allow test scores from existing commercial tests and state assessments to be compared with each other and with the National Assessment of Education Progress. In this book, the committee reviewed research literature on the statistical and technical aspects of creating valid links between tests and how the content, use, and purposes of education testing in the United States influences the quality and meaning of those links. The book summarizes relevant prior linkage studies and presents a picture of the diversity of state testing programs. It also looks at the unique characteristics of the National Assessment of Educational Progress. Uncommon Measures provides an answer to the question posed by Congress in Public Law 105-78, suggests criteria for evaluating the quality of linkages, and calls for further research to determine the level of precision needed to make inferences about linked tests. In arriving at its conclusions, the committee acknowledged that ultimately policymakers and educators must take responsibility for determining the degree of imprecision they are willing to tolerate in testing and linking. This book provides science-based information with which to make those decisions.




Making Sense of Test-Based Accountability in Education


Book Description

Test-based accountability systems that attach high stakes to standardized test results have raised a number of issues on educational assessment and accountability. Do these high-stakes tests measure student achievement accurately? How can policymakers and educators attach the right consequences to the results of these tests? And what kinds of tradeoffs do these testing policies introduce? This book responds to the growing emphasis on high-stakes testing and offers recommendations for more-effective test-based accountability systems.




Keeping Score for All


Book Description

U.S. public schools are responsible for educating large numbers of English language learners and students with disabilities. This book considers policies for including students with disabilities and English language learners in assessment programs. It also examines the research findings on testing accommodations and their effect on test performance. Keeping Score for All discusses the comparability of states' policies with each other and with the National Assessment of Educational Progress (NAEP) policies and explores the impact of these differences on the interpretations of NAEP results. The book presents a critical review of the research literature and makes suggestions for future research to evaluate the validity of test scores obtained under accommodated conditions. The book concludes by proposing a new framework for conceptualizing accommodations. This framework would be useful both for policymakers, test designers, and practitioners in determining appropriate accommodations for specific assessments and for researchers in planning validity studies.




Educational Measurement


Book Description

Educational Measurement has been the bible in its field since the first edition was published by ACE in 1951. The importance of this fourth edition of Educational Measurement is to extensively update and extend the topics treated in the previous three editions. As such, the fourth edition documents progress in the field and provides critical guidance to the efforts of new generations of researchers and practitioners. Edited by Robert Brennan and jointly sponsored by the American Council on Education (ACE) and the National Council on Measurement in Education, the fourth edition provides in-depth treatments of critical measurement topics, and the chapter authors are acknowledged experts in their respective fields. Educational measurement researchers and practitioners will find this text essential, and those interested in statistics, psychology, business, and economics should also find this work to be of very strong interest. Topics covered are divided into three subject areas: theory and general principles; construction, administration, and scoring; and applications. The first part of the book covers the topics of validation, reliability, item response theory, scaling and norming, linking and equating, test fairness, and cognitive psychology. Part two includes chapters on test development, test administration, performance assessment, setting performance standards, and technology in testing. The final section includes chapters on second language testing, testing for accountability in K-12 schools, standardized assessment of individual achievement in K-12 schools, higher education admissions testing, monitoring educational progress, licensure and certification testing, and legal and ethical issues.




The Assessment of Science Meets the Science of Assessment


Book Description

To explore the connections between new approaches to science education and new developments in assessment, the Board on Testing and Assessment (BOTA) of the National Research Council (NRC) sponsored a two-day conference on February 22 and 23, 1997. Participants included BOTA members, other measurement experts, and educators and policymakers concerned with science education reform. The conference encouraged the exchange of ideas between those with measurement expertise and those with creative approaches to instruction and assessment.