An Introduction to Categorical Data Analysis


Book Description

A valuable new edition of a standard reference The use of statistical methods for categorical data has increased dramatically, particularly for applications in the biomedical and social sciences. An Introduction to Categorical Data Analysis, Third Edition summarizes these methods and shows readers how to use them using software. Readers will find a unified generalized linear models approach that connects logistic regression and loglinear models for discrete data with normal regression for continuous data. Adding to the value in the new edition is: • Illustrations of the use of R software to perform all the analyses in the book • A new chapter on alternative methods for categorical data, including smoothing and regularization methods (such as the lasso), classification methods such as linear discriminant analysis and classification trees, and cluster analysis • New sections in many chapters introducing the Bayesian approach for the methods of that chapter • More than 70 analyses of data sets to illustrate application of the methods, and about 200 exercises, many containing other data sets • An appendix showing how to use SAS, Stata, and SPSS, and an appendix with short solutions to most odd-numbered exercises Written in an applied, nontechnical style, this book illustrates the methods using a wide variety of real data, including medical clinical trials, environmental questions, drug use by teenagers, horseshoe crab mating, basketball shooting, correlates of happiness, and much more. An Introduction to Categorical Data Analysis, Third Edition is an invaluable tool for statisticians and biostatisticians as well as methodologists in the social and behavioral sciences, medicine and public health, marketing, education, and the biological and agricultural sciences.




A Course in Categorical Data Analysis


Book Description

Categorical data-comprising counts of individuals, objects, or entities in different categories-emerge frequently from many areas of study, including medicine, sociology, geology, and education. They provide important statistical information that can lead to real-life conclusions and the discovery of fresh knowledge. Therefore, the ability to manipulate, understand, and interpret categorical data becomes of interest-if not essential-to professionals and students in a broad range of disciplines. Although t-tests, linear regression, and analysis of variance are useful, valid methods for analysis of measurement data, categorical data requires a different methodology and techniques typically not encountered in introductory statistics courses. Developed from long experience in teaching categorical analysis to a multidisciplinary mix of undergraduate and graduate students, A Course in Categorical Data Analysis presents the easiest, most straightforward ways of extracting real-life conclusions from contingency tables. The author uses a Fisherian approach to categorical data analysis and incorporates numerous examples and real data sets. Although he offers S-PLUS routines through the Internet, readers do not need full knowledge of a statistical software package. In this unique text, the author chooses methods and an approach that nurtures intuitive thinking. He trains his readers to focus not on finding a model that fits the data, but on using different models that may lead to meaningful conclusions. The book offers some simple, innovative techniques not highighted in other texts that help make the book accessible to a broad, interdisciplinary audience. A Course in Categorical Data Analysis enables readers to quickly use its offering of tools for drawing scientific, medical, or real-life conclusions from categorical data sets.




Analysis of Categorical Data with R


Book Description

Analysis of Categorical Data with R, Second Edition presents a modern account of categorical data analysis using the R software environment. It covers recent techniques of model building and assessment for binary, multicategory, and count response variables and discusses fundamentals, such as odds ratio and probability estimation. The authors give detailed advice and guidelines on which procedures to use and why to use them. The second edition is a substantial update of the first based on the authors’ experiences of teaching from the book for nearly a decade. The book is organized as before, but with new content throughout, and there are two new substantive topics in the advanced topics chapter—group testing and splines. The computing has been completely updated, with the "emmeans" package now integrated into the book. The examples have also been updated, notably to include new examples based on COVID-19, and there are more than 90 new exercises in the book. The solutions manual and teaching videos have also been updated. Features: Requires no prior experience with R, and offers an introduction to the essential features and functions of R Includes numerous examples from medicine, psychology, sports, ecology, and many other areas Integrates extensive R code and output Graphically demonstrates many of the features and properties of various analysis methods Offers a substantial number of exercises in all chapters, enabling use as a course text or for self-study Supplemented by a website with data sets, code, and teaching videos Analysis of Categorical Data with R, Second Edition is primarily designed for a course on categorical data analysis taught at the advanced undergraduate or graduate level. Such a course could be taught in a statistics or biostatistics department, or within mathematics, psychology, social science, ecology, or another quantitative discipline. It could also be used by a self-learner and would make an ideal reference for a researcher from any discipline where categorical data arise.




Lectures on Categorical Data Analysis


Book Description

This book offers a relatively self-contained presentation of the fundamental results in categorical data analysis, which plays a central role among the statistical techniques applied in the social, political and behavioral sciences, as well as in marketing and medical and biological research. The methods applied are mainly aimed at understanding the structure of associations among variables and the effects of other variables on these interactions. A great advantage of studying categorical data analysis is that many concepts in statistics become transparent when discussed in a categorical data context, and, in many places, the book takes this opportunity to comment on general principles and methods in statistics, addressing not only the “how” but also the “why.” Assuming minimal background in calculus, linear algebra, probability theory and statistics, the book is designed to be used in upper-undergraduate and graduate-level courses in the field and in more general statistical methodology courses, as well as a self-study resource for researchers and professionals. The book covers such key issues as: higher order interactions among categorical variables; the use of the delta-method to correctly determine asymptotic standard errors for complex quantities reported in surveys; the fundamentals of the main theories of causal analysis based on observational data; the usefulness of the odds ratio as a measure of association; and a detailed discussion of log-linear models, including graphical models. The book contains over 200 problems, many of which may also be used as starting points for undergraduate research projects. The material can be used by students toward a variety of goals, depending on the degree of theory or application desired.




Categorical Data Analysis for the Behavioral and Social Sciences


Book Description

Featuring a practical approach with numerous examples, the second edition of Categorical Data Analysis for the Behavioral and Social Sciences focuses on helping the reader develop a conceptual understanding of categorical methods, making it a much more accessible text than others on the market. The authors cover common categorical analysis methods and emphasize specific research questions that can be addressed by each analytic procedure, including how to obtain results using SPSS, SAS, and R, so that readers are able to address the research questions they wish to answer. Each chapter begins with a "Look Ahead" section to highlight key content. This is followed by an in-depth focus and explanation of the relationship between the initial research question, the use of software to perform the analyses, and how to interpret the output substantively. Included at the end of each chapter are a range of software examples and questions to test knowledge. New to the second edition: The addition of R syntax for all analyses and an update of SPSS and SAS syntax. The addition of a new chapter on GLMMs. Clarification of concepts and ideas that graduate students found confusing, including revised problems at the end of the chapters. Written for those without an extensive mathematical background, this book is ideal for a graduate course in categorical data analysis taught in departments of psychology, educational psychology, human development and family studies, sociology, public health, and business. Researchers in these disciplines interested in applying these procedures will also appreciate this book’s accessible approach.




Discrete Data Analysis with R


Book Description

An Applied Treatment of Modern Graphical Methods for Analyzing Categorical DataDiscrete Data Analysis with R: Visualization and Modeling Techniques for Categorical and Count Data presents an applied treatment of modern methods for the analysis of categorical data, both discrete response data and frequency data. It explains how to use graphical meth




The Statistical Analysis of Categorical Data


Book Description

The aim of this book is to give an up to date account of the most commonly uses statist i cal models for categoriCal data. The emphasis is on the connection between theory and appIications to real data sets. The book only covers models for categorical data. Various n:t0dels for mixed continuous and categorical data are thus excluded. The book is written as a textbook, although many methods and results are quite recent. This should imply, that the book can be used for a graduate course in categorical data analysis. With this aim in mind chapters 3 to 12 are concluded with a set of exer eises. In many cases, the data sets are those data sets, which were not included in the examples of the book, although they at one point in time were regarded as potential can didates for an example. A certain amount of general knowledge of statistical theory is necessary to fully benefit from the book. A summary of the basic statistical concepts deemed necessary pre requisites is given in chapter 2. The mathematical level is only moderately high, but the account in chapter 3 of basic properties of exponential families and the parametric multinomial distribution is made as mathematical preeise as possible without going into mathematical details and leaving out most proofs.




Longitudinal Categorical Data Analysis


Book Description

This is the first book in longitudinal categorical data analysis with parametric correlation models developed based on dynamic relationships among repeated categorical responses. This book is a natural generalization of the longitudinal binary data analysis to the multinomial data setup with more than two categories. Thus, unlike the existing books on cross-sectional categorical data analysis using log linear models, this book uses multinomial probability models both in cross-sectional and longitudinal setups. A theoretical foundation is provided for the analysis of univariate multinomial responses, by developing models systematically for the cases with no covariates as well as categorical covariates, both in cross-sectional and longitudinal setups. In the longitudinal setup, both stationary and non-stationary covariates are considered. These models have also been extended to the bivariate multinomial setup along with suitable covariates. For the inferences, the book uses the generalized quasi-likelihood as well as the exact likelihood approaches. The book is technically rigorous, and, it also presents illustrations of the statistical analysis of various real life data involving univariate multinomial responses both in cross-sectional and longitudinal setups. This book is written mainly for the graduate students and researchers in statistics and social sciences, among other applied statistics research areas. However, the rest of the book, specifically the chapters from 1 to 3, may also be used for a senior undergraduate course in statistics.




Applied Categorical Data Analysis


Book Description

The nonstatistician's quick reference to applied categorical data analysis With a succinct, unified approach to applied categorical data analysis and an emphasis on applications, this book is immensely useful to researchers and students in the biomedical disciplines and to anyone concerned with statistical analysis. This self-contained volume provides up-to-date coverage of all major methodologies in this area of applied statistics and acquaints the reader with statistical thinking as expressed through a variety of modern-day topics and techniques. Applied Categorical Data Analysis introduces a number of new research areas, including the Mantel-Haenszel method, Kappa statistics, ordinal risks, odds ratio estimates, goodness-of-fit, and various regression models for categorical data. Chap T. Le, author of Health and Numbers and Applied Survival Analysis, presents his information in a user-friendly format and an accessible style while purposefully keeping the mathematics to a level appropriate for students in applied fields. Well supplemented with helpful graphs and tables, Applied Categorical Data Analysis: * Covers both basic and advanced topics * Employs many real-life examples from biomedicine, epidemiology, and public health * Presents case studies in meticulous detail * Provides end-of-chapter exercise sets and solutions * Incorporates samples of computer programs (most notably in SAS). Applied Categorical Data Analysis is an important resource for graduate students and professionals who need a compact reference and guide to both the fundamentals and applications of the major methods in the field.




Categorical Statistics for Communication Research


Book Description

Categorical Statistics for CommunicationResearch presents scholars with a discipline-specific guide to categorical data analysis. The text blends necessary background information and formulas for statistical procedures with data analyses illustrating techniques such as log- linear modeling and logistic regression analysis. Provides techniques for analyzing categorical data from a communication studies perspective Provides an accessible presentation of techniques for analyzing categorical data for communication scholars and other social scientists working at the advanced undergraduate and graduate teaching levels Illustrated with examples from different types of communication research such as health, political and sports communication and entertainment Includes exercises at the end of each chapter and a companion website containing exercise answers and chapter-by-chapter PowerPoint slides