Project-Based R Companion to Introductory Statistics


Book Description

Project-Based R Companion to Introductory Statistics is envisioned as a companion to a traditional statistics or biostatistics textbook, with each chapter covering traditional topics such as descriptive statistics, regression, and hypothesis testing. However, unlike a traditional textbook, each chapter will present its material using a complete step-by-step analysis of a real publicly available dataset, with an emphasis on the practical skills of testing assumptions, data exploration, and forming conclusions. The chapters in the main body of the book include a worked example showing the R code used at each step followed by a multi-part project for students to complete. These projects, which could serve as alternatives to traditional discrete homework problems, will illustrate how to "put the pieces together" and conduct a complete start-to-finish data analysis using the R statistical software package. At the end of the book, there are several projects that require the use of multiple statistical techniques that could be used as a take-home final exam or final project for a class. Key features of the text: Organized in chapters focusing on the same topics found in typical introductory statistics textbooks (descriptive statistics, regression, two-way tables, hypothesis testing for means and proportions, etc.) so instructors can easily pair this supplementary material with course plans Includes student projects for each chapter which can be assigned as laboratory exercises or homework assignments to supplement traditional homework Features real-world datasets from scientific publications in the fields of history, pop culture, business, medicine, and forensics for students to analyze Allows students to gain experience working through a variety of statistical analyses from start to finish The book is written at the undergraduate level to be used in an introductory statistical methods course or subject-specific research methods course such as biostatistics or research methods for psychology or business analytics. Author After a 10-year career as a research biostatistician in the Department of Ophthalmology and Visual Sciences at the University of Wisconsin-Madison, Chelsea Myers teaches statistics and biostatistics at Rollins College and Valencia College in Central Florida. She has authored or co-authored more than 30 scientific papers and presentations and is the creator of the MCAT preparation website MCATMath.com.




Project-Based R Companion to Introductory Statistics


Book Description

Project-Based R Companion to Introductory Statistics is envisioned as a companion to a traditional statistics or biostatistics textbook, with each chapter covering traditional topics such as descriptive statistics, regression, and hypothesis testing. However, unlike a traditional textbook, each chapter will present its material using a complete step-by-step analysis of a real publicly available dataset, with an emphasis on the practical skills of testing assumptions, data exploration, and forming conclusions. The chapters in the main body of the book include a worked example showing the R code used at each step followed by a multi-part project for students to complete. These projects, which could serve as alternatives to traditional discrete homework problems, will illustrate how to "put the pieces together" and conduct a complete start-to-finish data analysis using the R statistical software package. At the end of the book, there are several projects that require the use of multiple statistical techniques that could be used as a take-home final exam or final project for a class. Key features of the text: Organized in chapters focusing on the same topics found in typical introductory statistics textbooks (descriptive statistics, regression, two-way tables, hypothesis testing for means and proportions, etc.) so instructors can easily pair this supplementary material with course plans Includes student projects for each chapter which can be assigned as laboratory exercises or homework assignments to supplement traditional homework Features real-world datasets from scientific publications in the fields of history, pop culture, business, medicine, and forensics for students to analyze Allows students to gain experience working through a variety of statistical analyses from start to finish The book is written at the undergraduate level to be used in an introductory statistical methods course or subject-specific research methods course such as biostatistics or research methods for psychology or business analytics. Author After a 10-year career as a research biostatistician in the Department of Ophthalmology and Visual Sciences at the University of Wisconsin-Madison, Chelsea Myers teaches statistics and biostatistics at Rollins College and Valencia College in Central Florida. She has authored or co-authored more than 30 scientific papers and presentations and is the creator of the MCAT preparation website MCATMath.com.




Using R for Introductory Statistics


Book Description

The second edition of a bestselling textbook, Using R for Introductory Statistics guides students through the basics of R, helping them overcome the sometimes steep learning curve. The author does this by breaking the material down into small, task-oriented steps. The second edition maintains the features that made the first edition so popular, while updating data, examples, and changes to R in line with the current version. See What’s New in the Second Edition: Increased emphasis on more idiomatic R provides a grounding in the functionality of base R. Discussions of the use of RStudio helps new R users avoid as many pitfalls as possible. Use of knitr package makes code easier to read and therefore easier to reason about. Additional information on computer-intensive approaches motivates the traditional approach. Updated examples and data make the information current and topical. The book has an accompanying package, UsingR, available from CRAN, R’s repository of user-contributed packages. The package contains the data sets mentioned in the text (data(package="UsingR")), answers to selected problems (answers()), a few demonstrations (demo()), the errata (errata()), and sample code from the text. The topics of this text line up closely with traditional teaching progression; however, the book also highlights computer-intensive approaches to motivate the more traditional approach. The authors emphasize realistic data and examples and rely on visualization techniques to gather insight. They introduce statistics and R seamlessly, giving students the tools they need to use R and the information they need to navigate the sometimes complex world of statistical computing.




R and Statdisk


Book Description

Delve into the fascinating field of Statistics. Explore introductory level Statistics topics through the lens of how those calculations and decisions can be impacted by values, people, beliefs, and information in the world around us. Learn to calculate the mathematically correct solution and then explore whether the "correct" solutions is the "right" decision. Expand your analysis using publisher resources such as Statdisk, or free analysis tools such as programming with R.




R Companion to Elementary Applied Statistics


Book Description

The R Companion to Elementary Applied Statistics includes traditional applications covered in elementary statistics courses as well as some additional methods that address questions that might arise during or after the application of commonly used methods. Beginning with basic tasks and computations with R, readers are then guided through ways to bring data into R, manipulate the data as needed, perform common statistical computations and elementary exploratory data analysis tasks, prepare customized graphics, and take advantage of R for a wide range of methods that find use in many elementary applications of statistics. Features: Requires no familiarity with R or programming to begin using this book. Can be used as a resource for a project-based elementary applied statistics course, or for researchers and professionals who wish to delve more deeply into R. Contains an extensive array of examples that illustrate ideas on various ways to use pre-packaged routines, as well as on developing individualized code. Presents quite a few methods that may be considered non-traditional, or advanced. Includes accompanying carefully documented script files that contain code for all examples presented, and more. R is a powerful and free product that is gaining popularity across the scientific community in both the professional and academic arenas. Statistical methods discussed in this book are used to introduce the fundamentals of using R functions and provide ideas for developing further skills in writing R code. These ideas are illustrated through an extensive collection of examples. About the Author: Christopher Hay-Jahans received his Doctor of Arts in mathematics from Idaho State University in 1999. After spending three years at University of South Dakota, he moved to Juneau, Alaska, in 2002 where he has taught a wide range of undergraduate courses at University of Alaska Southeast.




Introductory Statistics with R


Book Description

This book provides an elementary-level introduction to R, targeting both non-statistician scientists in various fields and students of statistics. The main mode of presentation is via code examples with liberal commenting of the code and the output, from the computational as well as the statistical viewpoint. Brief sections introduce the statistical methods before they are used. A supplementary R package can be downloaded and contains the data sets. All examples are directly runnable and all graphics in the text are generated from the examples. The statistical methodology covered includes statistical standard distributions, one- and two-sample tests with continuous data, regression analysis, one-and two-way analysis of variance, regression analysis, analysis of tabular data, and sample size calculations. In addition, the last four chapters contain introductions to multiple linear regression analysis, linear models in general, logistic regression, and survival analysis.




The R Book


Book Description

The high-level language of R is recognized as one of the mostpowerful and flexible statistical software environments, and israpidly becoming the standard setting for quantitative analysis,statistics and graphics. R provides free access to unrivalledcoverage and cutting-edge applications, enabling the user to applynumerous statistical methods ranging from simple regression to timeseries or multivariate analysis. Building on the success of the author’s bestsellingStatistics: An Introduction using R, The R Book ispacked with worked examples, providing an all inclusive guide to R,ideal for novice and more accomplished users alike. The bookassumes no background in statistics or computing and introduces theadvantages of the R environment, detailing its applications in awide range of disciplines. Provides the first comprehensive reference manual for the Rlanguage, including practical guidance and full coverage of thegraphics facilities. Introduces all the statistical models covered by R, beginningwith simple classical tests such as chi-square and t-test. Proceeds to examine more advance methods, from regression andanalysis of variance, through to generalized linear models,generalized mixed models, time series, spatial statistics,multivariate statistics and much more. The R Book is aimed at undergraduates, postgraduates andprofessionals in science, engineering and medicine. It is alsoideal for students and professionals in statistics, economics,geography and the social sciences.




Introductory Statistics


Book Description

This comprehensive and uniquely organized text is aimed at undergraduate and graduate level statistics courses in education, psychology, and other social sciences. A conceptual approach, built around common issues and problems rather than statistical techniques, allows students to understand the conceptual nature of statistical procedures and to focus more on cases and examples of analysis. Wherever possible, presentations contain explanations of the underlying reasons behind a technique. Importantly, this is one of the first statistics texts in the social sciences using R as the principal statistical package. Key features include the following. Conceptual Focus – The focus throughout is more on conceptual understanding and attainment of statistical literacy and thinking than on learning a set of tools and procedures. Problems and Cases – Chapters and sections open with examples of situations related to the forthcoming issues, and major sections ends with a case study. For example, after the section on describing relationships between variables, there is a worked case that demonstrates the analyses, presents computer output, and leads the student through an interpretation of that output. Continuity of Examples – A master data set containing nearly all of the data used in the book’s examples is introduced at the beginning of the text. This ensures continuity in the examples used across the text. Companion Website – A companion website contains instructions on how to use R, SAS, and SPSS to solve the end-of-chapter exercises and offers additional exercises. Field Tested – The manuscript has been field tested for three years at two leading institutions.




R for Data Science


Book Description

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you've learned along the way. You'll learn how to: Wrangle—transform your datasets into a form convenient for analysis Program—learn powerful R tools for solving data problems with greater clarity and ease Explore—examine your data, generate hypotheses, and quickly test them Model—provide a low-dimensional summary that captures true "signals" in your dataset Communicate—learn R Markdown for integrating prose, code, and results




An Introduction to Statistical Learning


Book Description

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance, marketing, and astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, deep learning, survival analysis, multiple testing, and more. Color graphics and real-world examples are used to illustrate the methods presented. This book is targeted at statisticians and non-statisticians alike, who wish to use cutting-edge statistical learning techniques to analyze their data. Four of the authors co-wrote An Introduction to Statistical Learning, With Applications in R (ISLR), which has become a mainstay of undergraduate and graduate classrooms worldwide, as well as an important reference book for data scientists. One of the keys to its success was that each chapter contains a tutorial on implementing the analyses and methods presented in the R scientific computing environment. However, in recent years Python has become a popular language for data science, and there has been increasing demand for a Python-based alternative to ISLR. Hence, this book (ISLP) covers the same materials as ISLR but with labs implemented in Python. These labs will be useful both for Python novices, as well as experienced users.