Applied Smoothing Techniques for Data Analysis


Book Description

The book describes the use of smoothing techniques in statistics, including both density estimation and nonparametric regression. Considerable advances in research in this area have been made in recent years. The aim of this text is to describe a variety of ways in which these methods can be applied to practical problems in statistics. The role of smoothing techniques in exploring data graphically is emphasised, but the use of nonparametric curves in drawing conclusions from data, as an extension of more standard parametric models, is also a major focus of the book. Examples are drawn from a wide range of applications. The book is intended for those who seek an introduction to the area, with an emphasis on applications rather than on detailed theory. It is therefore expected that the book will benefit those attending courses at an advanced undergraduate, or postgraduate, level, as well as researchers, both from statistics and from other disciplines, who wish to learn about and apply these techniques in practical data analysis. The text makes extensive reference to S-Plus, as a computing environment in which examples can be explored. S-Plus functions and example scripts are provided to implement many of the techniques described. These parts are, however, clearly separate from the main body of text, and can therefore easily be skipped by readers not interested in S-Plus.




Introduction to Data Science


Book Description

Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data visualization, predictive algorithm building, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation. This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful. The book is divided into six parts: R, data visualization, statistics with R, data wrangling, machine learning, and productivity tools. Each part has several chapters meant to be presented as one lecture. The author uses motivating case studies that realistically mimic a data scientist’s experience. He starts by asking specific questions and answers these through data analysis so concepts are learned as a means to answering the questions. Examples of the case studies included are: US murder rates by state, self-reported student heights, trends in world health and economics, the impact of vaccines on infectious disease rates, the financial crisis of 2007-2008, election forecasting, building a baseball team, image processing of hand-written digits, and movie recommendation systems. The statistical concepts used to answer the case study questions are only briefly introduced, so complementing with a probability and statistics textbook is highly recommended for in-depth understanding of these concepts. If you read and understand the chapters and complete the exercises, you will be prepared to learn the more advanced concepts and skills needed to become an expert.




Smoothing and Regression


Book Description

A comprehensive introduction to a wide variety of univariate and multivariate smoothing techniques for regression Smoothing and Regression: Approaches, Computation, and Application bridges the many gaps that exist among competing univariate and multivariate smoothing techniques. It introduces, describes, and in some cases compares a large number of the latest and most advanced techniques for regression modeling. Unlike many other volumes on this topic, which are highly technical and specialized, this book discusses all methods in light of both computational efficiency and their applicability for real data analysis. Using examples of applications from the biosciences, environmental sciences, engineering, and economics, as well as medical research and marketing, this volume addresses the theory, computation, and application of each approach. A number of the techniques discussed, such as smoothing under shape restrictions or of dependent data, are presented for the first time in book form. Special features of this book include: * Comprehensive coverage of smoothing and regression with software hints and applications from a wide variety of disciplines * A unified, easy-to-follow format * Contributions from more than 25 leading researchers from around the world * More than 150 illustrations also covering new graphical techniques important for exploratory data analysis and visualization of high-dimensional problems * Extensive end-of-chapter references For professionals and aspiring professionals in statistics, applied mathematics, computer science, and econometrics, as well as for researchers in the applied and social sciences, Smoothing and Regression is a unique and important new resource destined to become one the most frequently consulted references in the field.




Smoothing Techniques


Book Description

The author has attempted to present a book that provides a non-technical introduction into the area of non-parametric density and regression function estimation. The application of these methods is discussed in terms of the S computing environment. Smoothing in high dimensions faces the problem of data sparseness. A principal feature of smoothing, the averaging of data points in a prescribed neighborhood, is not really practicable in dimensions greater than three if we have just one hundred data points. Additive models provide a way out of this dilemma; but, for their interactiveness and recursiveness, they require highly effective algorithms. For this purpose, the method of WARPing (Weighted Averaging using Rounded Points) is described in great detail.




Applied Nonparametric Regression


Book Description

This is the first book to bring together in one place the techniques for regression curve smoothing involving more than one variable.




Smoothing of Multivariate Data


Book Description

An applied treatment of the key methods and state-of-the-art tools for visualizing and understanding statistical data Smoothing of Multivariate Data provides an illustrative and hands-on approach to the multivariate aspects of density estimation, emphasizing the use of visualization tools. Rather than outlining the theoretical concepts of classification and regression, this book focuses on the procedures for estimating a multivariate distribution via smoothing. The author first provides an introduction to various visualization tools that can be used to construct representations of multivariate functions, sets, data, and scales of multivariate density estimates. Next, readers are presented with an extensive review of the basic mathematical tools that are needed to asymptotically analyze the behavior of multivariate density estimators, with coverage of density classes, lower bounds, empirical processes, and manipulation of density estimates. The book concludes with an extensive toolbox of multivariate density estimators, including anisotropic kernel estimators, minimization estimators, multivariate adaptive histograms, and wavelet estimators. A completely interactive experience is encouraged, as all examples and figurescan be easily replicated using the R software package, and every chapter concludes with numerous exercises that allow readers to test their understanding of the presented techniques. The R software is freely available on the book's related Web site along with "Code" sections for each chapter that provide short instructions for working in the R environment. Combining mathematical analysis with practical implementations, Smoothing of Multivariate Data is an excellent book for courses in multivariate analysis, data analysis, and nonparametric statistics at the upper-undergraduate and graduatelevels. It also serves as a valuable reference for practitioners and researchers in the fields of statistics, computer science, economics, and engineering.




Data Analysis Methods in Physical Oceanography


Book Description

Data Analysis Methods in Physical Oceanography is a practical referenceguide to established and modern data analysis techniques in earth and oceansciences. This second and revised edition is even more comprehensive with numerous updates, and an additional appendix on 'Convolution and Fourier transforms'. Intended for both students and established scientists, the fivemajor chapters of the book cover data acquisition and recording, dataprocessing and presentation, statistical methods and error handling,analysis of spatial data fields, and time series analysis methods. Chapter 5on time series analysis is a book in itself, spanning a wide diversity oftopics from stochastic processes and stationarity, coherence functions,Fourier analysis, tidal harmonic analysis, spectral and cross-spectralanalysis, wavelet and other related methods for processing nonstationarydata series, digital filters, and fractals. The seven appendices includeunit conversions, approximation methods and nondimensional numbers used ingeophysical fluid dynamics, presentations on convolution, statisticalterminology, and distribution functions, and a number of importantstatistical tables. Twenty pages are devoted to references. Featuring:• An in-depth presentation of modern techniques for the analysis of temporal and spatial data sets collected in oceanography, geophysics, and other disciplines in earth and ocean sciences.• A detailed overview of oceanographic instrumentation and sensors - old and new - used to collect oceanographic data.• 7 appendices especially applicable to earth and ocean sciences ranging from conversion of units, through statistical tables, to terminology and non-dimensional parameters. In praise of the first edition: "(...)This is a very practical guide to the various statistical analysis methods used for obtaining information from geophysical data, with particular reference to oceanography(...)The book provides both a text for advanced students of the geophysical sciences and a useful reference volume for researchers." Aslib Book Guide Vol 63, No. 9, 1998 "(...)This is an excellent book that I recommend highly and will definitely use for my own research and teaching." EOS Transactions, D.A. Jay, 1999 "(...)In summary, this book is the most comprehensive and practical source of information on data analysis methods available to the physical oceanographer. The reader gets the benefit of extremely broad coverage and an excellent set of examples drawn from geographical observations." Oceanography, Vol. 12, No. 3, A. Plueddemann, 1999 "(...)Data Analysis Methods in Physical Oceanography is highly recommended for a wide range of readers, from the relative novice to the experienced researcher. It would be appropriate for academic and special libraries." E-Streams, Vol. 2, No. 8, P. Mofjelf, August 1999




Nonparametric Regression Methods for Longitudinal Data Analysis


Book Description

Incorporates mixed-effects modeling techniques for more powerful and efficient methods This book presents current and effective nonparametric regression techniques for longitudinal data analysis and systematically investigates the incorporation of mixed-effects modeling techniques into various nonparametric regression models. The authors emphasize modeling ideas and inference methodologies, although some theoretical results for the justification of the proposed methods are presented. With its logical structure and organization, beginning with basic principles, the text develops the foundation needed to master advanced principles and applications. Following a brief overview, data examples from biomedical research studies are presented and point to the need for nonparametric regression analysis approaches. Next, the authors review mixed-effects models and nonparametric regression models, which are the two key building blocks of the proposed modeling techniques. The core section of the book consists of four chapters dedicated to the major nonparametric regression methods: local polynomial, regression spline, smoothing spline, and penalized spline. The next two chapters extend these modeling techniques to semiparametric and time varying coefficient models for longitudinal data analysis. The final chapter examines discrete longitudinal data modeling and analysis. Each chapter concludes with a summary that highlights key points and also provides bibliographic notes that point to additional sources for further study. Examples of data analysis from biomedical research are used to illustrate the methodologies contained throughout the book. Technical proofs are presented in separate appendices. With its focus on solving problems, this is an excellent textbook for upper-level undergraduate and graduate courses in longitudinal data analysis. It is also recommended as a reference for biostatisticians and other theoretical and applied research statisticians with an interest in longitudinal data analysis. Not only do readers gain an understanding of the principles of various nonparametric regression methods, but they also gain a practical understanding of how to use the methods to tackle real-world problems.




Functional Data Analysis


Book Description

Included here are expressions in the functional domain of such classics as linear regression, principal components analysis, linear modelling, and canonical correlation analysis, as well as specifically functional techniques such as curve registration and principal differential analysis. Data arising in real applications are used throughout for both motivation and illustration, showing how functional approaches allow us to see new things, especially by exploiting the smoothness of the processes generating the data. The data sets exemplify the wide scope of functional data analysis; they are drawn from growth analysis, meteorology, biomechanics, equine science, economics, and medicine. The book presents novel statistical technology while keeping the mathematical level widely accessible. It is designed to appeal to students, applied data analysts, and to experienced researchers; and as such is of value both within statistics and across a broad spectrum of other fields. Much of the material appears here for the first time.




Applied Functional Data Analysis


Book Description

This book contains the ideas of functional data analysis by a number of case studies. The case studies are accessible to research workers in a wide range of disciplines. Every reader should gain not only a specific understanding of the methods of functional data analysis, but more importantly a general insight into the underlying patterns of thought. There is an associated web site with MATLABr and S?PLUSr implementations of the methods discussed.