Statistical Analyses for Language Assessment Book


Book Description

This book provides language teachers with guidelines to develop suitable listening tests.




Statistical Analyses for Language Testers


Book Description

Provides a step-by-step approach to the most useful statistical analyses for language test developers and researchers using IBM SPSS, Winsteps and Facets. It contains clearly-worked out examples for each analysis with detailed explanations.




Statistical Significance Testing for Natural Language Processing


Book Description

Data-driven experimental analysis has become the main evaluation tool of Natural Language Processing (NLP) algorithms. In fact, in the last decade, it has become rare to see an NLP paper, particularly one that proposes a new algorithm, that does not include extensive experimental analysis, and the number of involved tasks, datasets, domains, and languages is constantly growing. This emphasis on empirical results highlights the role of statistical significance testing in NLP research: If we, as a community, rely on empirical evaluation to validate our hypotheses and reveal the correct language processing mechanisms, we better be sure that our results are not coincidental. The goal of this book is to discuss the main aspects of statistical significance testing in NLP. Our guiding assumption throughout the book is that the basic question NLP researchers and engineers deal with is whether or not one algorithm can be considered better than another one. This question drives the field forward as it allows the constant progress of developing better technology for language processing challenges. In practice, researchers and engineers would like to draw the right conclusion from a limited set of experiments, and this conclusion should hold for other experiments with datasets they do not have at their disposal or that they cannot perform due to limited time and resources. The book hence discusses the opportunities and challenges in using statistical significance testing in NLP, from the point of view of experimental comparison between two algorithms. We cover topics such as choosing an appropriate significance test for the major NLP tasks, dealing with the unique aspects of significance testing for non-convex deep neural networks, accounting for a large number of comparisons between two NLP algorithms in a statistically valid manner (multiple hypothesis testing), and, finally, the unique challenges yielded by the nature of the data and practices of the field.




Statistical Analyses for Language Testers


Book Description

Provides a step-by-step approach to the most useful statistical analyses for language test developers and researchers using IBM SPSS, Winsteps and Facets. It contains clearly-worked out examples for each analysis with detailed explanations.




The Statistical Analysis of Experimental Data


Book Description

First half of book presents fundamental mathematical definitions, concepts, and facts while remaining half deals with statistics primarily as an interpretive tool. Well-written text, numerous worked examples with step-by-step presentation. Includes 116 tables.




Statistical Analysis of Reliability and Life-testing Models


Book Description

Probabilistic models; Basic statistical inference; The exponential distribution; The weibull distribution; The gamma distribution; Extreme-value distribution; The logistic and other distribution; Goodness-of-fit tests.




Statistics Corner


Book Description

James Dean Brown ("JD"), currently Professor of Second Language Studies at the University of Hawaii at Manoa, has lectured and taught around the world and has published numerous articles and books on language testing, curriculum design, research methods, and connected speech. For close to twenty years, Professor Brown has contributed a regular column called Statistics Corner to Shiken, the biannual publication of the Testing and Evaluation Special Interest Group (TEVAL) of the Japan Association for Language Teaching (JALT). In his column, JD answers questions submitted by readers about language testing and statistics in an informal and easy to understand format. This volume brings together in one convenient location, forty-one Statistics Corner columns-updated, arranged thematically, and fully indexed. Presented in a question and answer format, the clear and concise explanations are both accessible to novices and engaging to experts. Topics addressed include: Second language testing strategies Likert items and scales of measurement Validity and reliability of tests and questionnaires Item analysis techniques for norm-referenced and criterion-referenced tests Conducting and interpreting principle component and factor analyses Planning and interpreting qualitative, quantitative, and mixed-methods research Clear explanations of the meaning and interpretation of frequently reported statistics such as Cronbach's alpha, standard error, confidence intervals, eta squared, Cohen's Kappa, skewness and kurtosis, and more."




Statistical Inference as Severe Testing


Book Description

Mounting failures of replication in social and biological sciences give a new urgency to critically appraising proposed reforms. This book pulls back the cover on disagreements between experts charged with restoring integrity to science. It denies two pervasive views of the role of probability in inference: to assign degrees of belief, and to control error rates in a long run. If statistical consumers are unaware of assumptions behind rival evidence reforms, they can't scrutinize the consequences that affect them (in personalized medicine, psychology, etc.). The book sets sail with a simple tool: if little has been done to rule out flaws in inferring a claim, then it has not passed a severe test. Many methods advocated by data experts do not stand up to severe scrutiny and are in tension with successful strategies for blocking or accounting for cherry picking and selective reporting. Through a series of excursions and exhibits, the philosophy and history of inductive inference come alive. Philosophical tools are put to work to solve problems about science and pseudoscience, induction and falsification.




Testing for Language Teachers


Book Description

This second edition remains the most practical guide to testing language. It has a new chapter on testing young learners.




Language Testing and Assessment


Book Description