Statistical modeling : a fresh approach


Book Description

"Statistical Modeling: A Fresh Approach introduces and illuminates the statistical reasoning used in modern research throughout the natural and social sciences, medicine, government, and commerce. It emphasizes the use of models to untangle and quantify variation in observed data. By a deft and concise use of computing coupled with an innovative geometrical presentation of the relationship among variables. A Fresh Approach reveals the logic of statistical inference and empowers the reader to use and understand techniques such as analysis of covariance that appear widely in published research but are hardly ever found in introductory texts."-- book cover




Statistical Models and Methods for Financial Markets


Book Description

The idea of writing this bookarosein 2000when the ?rst author wasassigned to teach the required course STATS 240 (Statistical Methods in Finance) in the new M. S. program in ?nancial mathematics at Stanford, which is an interdisciplinary program that aims to provide a master’s-level education in applied mathematics, statistics, computing, ?nance, and economics. Students in the programhad di?erent backgroundsin statistics. Some had only taken a basic course in statistical inference, while others had taken a broad spectrum of M. S. - and Ph. D. -level statistics courses. On the other hand, all of them had already taken required core courses in investment theory and derivative pricing, and STATS 240 was supposed to link the theory and pricing formulas to real-world data and pricing or investment strategies. Besides students in theprogram,thecoursealso attractedmanystudentsfromother departments in the university, further increasing the heterogeneity of students, as many of them had a strong background in mathematical and statistical modeling from the mathematical, physical, and engineering sciences but no previous experience in ?nance. To address the diversity in background but common strong interest in the subject and in a potential career as a “quant” in the ?nancialindustry,thecoursematerialwascarefullychosennotonlytopresent basic statistical methods of importance to quantitative ?nance but also to summarize domain knowledge in ?nance and show how it can be combined with statistical modeling in ?nancial analysis and decision making. The course material evolved over the years, especially after the second author helped as the head TA during the years 2004 and 2005.




Statistical Models


Book Description

This lively and engaging book explains the things you have to know in order to read empirical papers in the social and health sciences, as well as the techniques you need to build statistical models of your own. The discussion in the book is organized around published studies, as are many of the exercises. Relevant journal articles are reprinted at the back of the book. Freedman makes a thorough appraisal of the statistical methods in these papers and in a variety of other examples. He illustrates the principles of modelling, and the pitfalls. The discussion shows you how to think about the critical issues - including the connection (or lack of it) between the statistical models and the real phenomena. The book is written for advanced undergraduates and beginning graduate students in statistics, as well as students and professionals in the social and health sciences.




Probability, Statistics, and Data


Book Description

This book is a fresh approach to a calculus based, first course in probability and statistics, using R throughout to give a central role to data and simulation. The book introduces probability with Monte Carlo simulation as an essential tool. Simulation makes challenging probability questions quickly accessible and easily understandable. Mathematical approaches are included, using calculus when appropriate, but are always connected to experimental computations. Using R and simulation gives a nuanced understanding of statistical inference. The impact of departure from assumptions in statistical tests is emphasized, quantified using simulations, and demonstrated with real data. The book compares parametric and non-parametric methods through simulation, allowing for a thorough investigation of testing error and power. The text builds R skills from the outset, allowing modern methods of resampling and cross validation to be introduced along with traditional statistical techniques. Fifty-two data sets are included in the complementary R package fosdata. Most of these data sets are from recently published papers, so that you are working with current, real data, which is often large and messy. Two central chapters use powerful tidyverse tools (dplyr, ggplot2, tidyr, stringr) to wrangle data and produce meaningful visualizations. Preliminary versions of the book have been used for five semesters at Saint Louis University, and the majority of the more than 400 exercises have been classroom tested.




Applied Linear Statistical Models


Book Description

Linear regression with one predictor variable; Inferences in regression and correlation analysis; Diagnosticis and remedial measures; Simultaneous inferences and other topics in regression analysis; Matrix approach to simple linear regression analysis; Multiple linear regression; Nonlinear regression; Design and analysis of single-factor studies; Multi-factor studies; Specialized study designs.




Statistical Models for Causal Analysis


Book Description

Simplifies the treatment of statistical inference focusing on how to specify and interpret models in the context of testing causal theories. Simple bivariate regression, multiple regression, multiple classification analysis, path analysis, logit regression, multinomial logit regression and survival models are among the subjects covered. Features an appendix of computer programs (for major statistical packages) that are used to generate illustrative examples contained in the chapters.




Modern Data Science with R


Book Description

From a review of the first edition: "Modern Data Science with R... is rich with examples and is guided by a strong narrative voice. What’s more, it presents an organizing framework that makes a convincing argument that data science is a course distinct from applied statistics" (The American Statistician). Modern Data Science with R is a comprehensive data science textbook for undergraduates that incorporates statistical and computational thinking to solve real-world data problems. Rather than focus exclusively on case studies or programming syntax, this book illustrates how statistical programming in the state-of-the-art R/RStudio computing environment can be leveraged to extract meaningful information from a variety of data in the service of addressing compelling questions. The second edition is updated to reflect the growing influence of the tidyverse set of packages. All code in the book has been revised and styled to be more readable and easier to understand. New functionality from packages like sf, purrr, tidymodels, and tidytext is now integrated into the text. All chapters have been revised, and several have been split, re-organized, or re-imagined to meet the shifting landscape of best practice.




Statistical Methods for the Social and Behavioural Sciences


Book Description

Statistical methods in modern research increasingly entail developing, estimating and testing models for data. Rather than rigid methods of data analysis, the need today is for more flexible methods for modelling data. In this logical, easy-to-follow and exceptionally clear book, David Flora provides a comprehensive survey of the major statistical procedures currently used. His innovative model-based approach teaches you how to: Understand and choose the right statistical model to fit your data Match substantive theory and statistical models Apply statistical procedures hands-on, with example data analyses Develop and use graphs to understand data and fit models to data Work with statistical modeling principles using any software package Learn by applying, with input and output files for R, SAS, SPSS, and Mplus. Statistical Methods for the Social and Behavioural Sciences: A Model Based Approach is the essential guide for those looking to extend their understanding of the principles of statistics, and begin using the right statistical modeling method for their own data. It is particularly suited to second or advanced courses in statistical methods across the social and behavioural sciences.




Statistical Mechanics


Book Description

Statistical Mechanics: Fundamentals and Model Solutions, Second Edition Fully updated throughout and with new chapters on the Mayer expansion for classical gases and on cluster expansion for lattice models, this new edition of Statistical Mechanics: Fundamentals and Model Solutions provides a comprehensive introduction to equilibrium statistical mechanics for advanced undergraduate and graduate students of mathematics and physics. The author presents a fresh approach to the subject, setting out the basic assumptions clearly and emphasizing the importance of the thermodynamic limit and the role of convexity. With problems and solutions, the book clearly explains the role of models for physical systems, and discusses and solves various models. An understanding of these models is of increasing importance as they have proved to have applications in many areas of mathematics and physics. Features Updated throughout with new content from the field An established and well-loved textbook Contains new problems and solutions for further learning opportunity Author Professor Teunis C. Dorlas is at the Dublin Institute for Advanced Studies, Ireland.




An Introduction to Statistical Learning


Book Description

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance, marketing, and astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, deep learning, survival analysis, multiple testing, and more. Color graphics and real-world examples are used to illustrate the methods presented. This book is targeted at statisticians and non-statisticians alike, who wish to use cutting-edge statistical learning techniques to analyze their data. Four of the authors co-wrote An Introduction to Statistical Learning, With Applications in R (ISLR), which has become a mainstay of undergraduate and graduate classrooms worldwide, as well as an important reference book for data scientists. One of the keys to its success was that each chapter contains a tutorial on implementing the analyses and methods presented in the R scientific computing environment. However, in recent years Python has become a popular language for data science, and there has been increasing demand for a Python-based alternative to ISLR. Hence, this book (ISLP) covers the same materials as ISLR but with labs implemented in Python. These labs will be useful both for Python novices, as well as experienced users.