Gaussian Process Regression Analysis for Functional Data


Book Description

Gaussian Process Regression Analysis for Functional Data presents nonparametric statistical methods for functional regression analysis, specifically the methods based on a Gaussian process prior in a functional space. The authors focus on problems involving functional response variables and mixed covariates of functional and scalar variables. Covering the basics of Gaussian process regression, the first several chapters discuss functional data analysis, theoretical aspects based on the asymptotic properties of Gaussian process regression models, and new methodological developments for high dimensional data and variable selection. The remainder of the text explores advanced topics of functional regression analysis, including novel nonparametric statistical methods for curve prediction, curve clustering, functional ANOVA, and functional regression analysis of batch data, repeated curves, and non-Gaussian data. Many flexible models based on Gaussian processes provide efficient ways of model learning, interpreting model structure, and carrying out inference, particularly when dealing with large dimensional functional data. This book shows how to use these Gaussian process regression models in the analysis of functional data. Some MATLAB® and C codes are available on the first author’s website.







Geostatistical Functional Data Analysis


Book Description

Geostatistical Functional Data Analysis Explore the intersection between geostatistics and functional data analysis with this insightful new reference Geostatistical Functional Data Analysis presents a unified approach to modelling functional data when spatial and spatio-temporal correlations are present. The Editors link together the wide research areas of geostatistics and functional data analysis to provide the reader with a new area called geostatistical functional data analysis that will bring new insights and new open questions to researchers coming from both scientific fields. This book provides a complete and up-to-date account to deal with functional data that is spatially correlated, but also includes the most innovative developments in different open avenues in this field. Containing contributions from leading experts in the field, this practical guide provides readers with the necessary tools to employ and adapt classic statistical techniques to handle spatial regression. The book also includes: A thorough introduction to the spatial kriging methodology when working with functions A detailed exposition of more classical statistical techniques adapted to the functional case and extended to handle spatial correlations Practical discussions of ANOVA, regression, and clustering methods to explore spatial correlation in a collection of curves sampled in a region In-depth explorations of the similarities and differences between spatio-temporal data analysis and functional data analysis Aimed at mathematicians, statisticians, postgraduate students, and researchers involved in the analysis of functional and spatial data, Geostatistical Functional Data Analysis will also prove to be a powerful addition to the libraries of geoscientists, environmental scientists, and economists seeking insightful new knowledge and questions at the interface of geostatistics and functional data analysis.




Functional Regression Models


Book Description

The conventional method for functional quantile regression is to fit the regression model for each quantile of interest separately. The slope function of the regression, as a bivariate function indexed by time and quantile, is actually estimated as a univariate function of time only by first fixing the quantile. This estimation strategy has two major limitations. The monotonicity of conditional quantiles can not be guaranteed, and the smoothness of the slope estimator as a bivariate function can not be controlled. We develop a new framework for functional quantile regression to overcome the two limitations. We propose to simultaneously fit the functional quantile regression model for multiple quantiles under some constraints so that the estimated quantiles satisfy the monotonicity conditions. Meanwhile, the smoothness of the slope estimator is controlled. Motivated by an application of modeling the impact of daily temperature, annual precipitation and irrigation system on soybean yield, we propose two locally sparse estimation methods under a semi-parametric functional quantile regression model. In the target application, the daily temperature is a functional predictor, and the influence of daily temperature on soybean yield may not always exist during the whole growing season. We aim to identify the time regions where the influence exists. For this purpose, in two projects, we use two different penalized estimation methods, functional SCAD and modified group lasso, to obtain locally sparse estimations for the bivariate slope function associated with the functional predictor. Focusing on the soybean yield application introduced above, we further propose a novel semi-parametric functional generalized linear model (FGLM) to analyze the relationship between the environmental factors and the soybean yield. In this project, we consider the data from different years as from different populations due to the fact that the climate conditions can be very different year by year. Based on the new assumption, the main challenge is that we only have limited number of observations for each year. To solve this issue, we combine a density ratio model with the proposed semi-parametric FGLM so that the new framework can be fitted using the pool data. We propose to use a combination of penalized B-spline and empirical likelihood method to fit the model. The proposed method is highly flexible and robust to model misspecification.




Predictions in Time Series Using Regression Models


Book Description

Regression methods have been a necessary piece of time arrangement investigation for over a century. As of late, new advancements have made real walks in such territories as non-constant information where a direct model isn't fitting. This book acquaints the peruser with fresher improvements and more assorted regression models and methods for time arrangement examination. Open to any individual who knows about the fundamental present day ideas of factual deduction, Regression Models for Time Series Analysis gives a truly necessary examination of late measurable advancements. Essential among them is the imperative class of models known as summed up straight models (GLM) which gives, under a few conditions, a bound together regression hypothesis reasonable for constant, all out, and check information. The creators stretch out GLM methodology deliberately to time arrangement where the essential and covariate information are both arbitrary and stochastically reliant. They acquaint readers with different regression models created amid the most recent thirty years or somewhere in the vicinity and condense traditional and later outcomes concerning state space models.




Regression Modelling wih Spatial and Spatial-Temporal Data


Book Description

Modelling Spatial and Spatial-Temporal Data: A Bayesian Approach is aimed at statisticians and quantitative social, economic and public health students and researchers who work with spatial and spatial-temporal data. It assumes a grounding in statistical theory up to the standard linear regression model. The book compares both hierarchical and spatial econometric modelling, providing both a reference and a teaching text with exercises in each chapter. The book provides a fully Bayesian, self-contained, treatment of the underlying statistical theory, with chapters dedicated to substantive applications. The book includes WinBUGS code and R code and all datasets are available online. Part I covers fundamental issues arising when modelling spatial and spatial-temporal data. Part II focuses on modelling cross-sectional spatial data and begins by describing exploratory methods that help guide the modelling process. There are then two theoretical chapters on Bayesian models and a chapter of applications. Two chapters follow on spatial econometric modelling, one describing different models, the other substantive applications. Part III discusses modelling spatial-temporal data, first introducing models for time series data. Exploratory methods for detecting different types of space-time interaction are presented followed by two chapters on the theory of space-time separable (without space-time interaction) and inseparable (with space-time interaction) models. An applications chapter includes: the evaluation of a policy intervention; analysing the temporal dynamics of crime hotspots; chronic disease surveillance; and testing for evidence of spatial spillovers in the spread of an infectious disease. A final chapter suggests some future directions and challenges.




Functional Data Analysis with R and MATLAB


Book Description

The book provides an application-oriented overview of functional analysis, with extended and accessible presentations of key concepts such as spline basis functions, data smoothing, curve registration, functional linear models and dynamic systems Functional data analysis is put to work in a wide a range of applications, so that new problems are likely to find close analogues in this book The code in R and Matlab in the book has been designed to permit easy modification to adapt to new data structures and research problems




Analysis of Time Dynamics and Additive Modeling for Functional Data


Book Description

In the first chapter we demonstrate that, while in most current data modeling for time-dynamic systems, one works witha pre-specified differential equation and attempts to fit its parameters, in the case of functional data, the equation itself can be inferred from the data. Only assuming that the dynamics are described by a first order nonlinear stochastic differential equation, we obtain these data-adaptive dynamic equations via a simple smoothing-based procedure from the observed data. They reveal the basic dynamics of the processes that generate the observed functional data. We prove consistency and introduce diagnostics to ascertain the fraction of variance that is explained by the deterministic part of the equation. This approach is shown to yield useful insights into the time-dynamic nature of human growth. In the second chapter, we propose a nonparametric method to explicitly model and represent the derivatives of smooth underlying trajectories for longitudinal data. This representation is based on a direct Karhunen-Loève expansion of the unobserved derivatives and leads to the notion of derivative principal components, which complement functional principal components, one of the most popular tools of functional data analysis. The proposed derivative principal components can be obtained for irregularly spaced and sparsely observed longitudinal data, as typically encountered in biomedical studies, as well as for noisy longitudinal data measured on regular and dense grids. We compare the proposed representations for derivatives with alternative approaches via simulation studies, and show that representations by derivative principal components can recover the underlying derivatives more accurately in various settings. As an example, we demonstrate the utility of derivative principal components for the classification of wheat spectra, where the spectral measurements are densely and regularly spaced. In this application, derivative principal components are found to be more predictive for the moisture content of wheat than the conventional functional principal components, and to have improved classification performance. Consistency results and asymptotic convergence rates for the proposed estimates of derivative principal components and other components of the model are derived under mild conditions, using tools from functional data analysis. In the last chapter, we consider a functional regression setting and introduce novel representations to improve prediction accuracy. In various functional regression settings one observes i.i.d. samplesof paired stochastic processes (X,Y), and is interested in predicting the trajectory of Y, given the trajectory X. For example, one may wish to predict the future of a process fromobserving an initial segment of the trajectory. Commonly used functional regression models are based on representations that are obtained separately for X and Y such as functional principal components. In contrast to these established methods, we base our approach on a singular expansion of the paired processes X, Y with singular functions that are derived from the cross-covariance surface between X and Y. The motivation for this approach is that the resulting singular components are better tuned towards reflecting the association between X and Y. The regression relationship is then based onthe assumption that each singular component of Y, or each scalar Y(t), follows an additive regression model with the singular components of X as predictors. The resulting singular additive models are fitted by smooth backfitting. We study the behavior of an implementation of this approach in simulations and data analysis.




Statistics for Spatio-Temporal Data


Book Description

Winner of the 2013 DeGroot Prize. A state-of-the-art presentation of spatio-temporal processes, bridging classic ideas with modern hierarchical statistical modeling concepts and the latest computational methods Noel Cressie and Christopher K. Wikle, are also winners of the 2011 PROSE Award in the Mathematics category, for the book “Statistics for Spatio-Temporal Data” (2011), published by John Wiley and Sons. (The PROSE awards, for Professional and Scholarly Excellence, are given by the Association of American Publishers, the national trade association of the US book publishing industry.) Statistics for Spatio-Temporal Data has now been reprinted with small corrections to the text and the bibliography. The overall content and pagination of the new printing remains the same; the difference comes in the form of corrections to typographical errors, editing of incomplete and missing references, and some updated spatio-temporal interpretations. From understanding environmental processes and climate trends to developing new technologies for mapping public-health data and the spread of invasive-species, there is a high demand for statistical analyses of data that take spatial, temporal, and spatio-temporal information into account. Statistics for Spatio-Temporal Data presents a systematic approach to key quantitative techniques that incorporate the latest advances in statistical computing as well as hierarchical, particularly Bayesian, statistical modeling, with an emphasis on dynamical spatio-temporal models. Cressie and Wikle supply a unique presentation that incorporates ideas from the areas of time series and spatial statistics as well as stochastic processes. Beginning with separate treatments of temporal data and spatial data, the book combines these concepts to discuss spatio-temporal statistical methods for understanding complex processes. Topics of coverage include: Exploratory methods for spatio-temporal data, including visualization, spectral analysis, empirical orthogonal function analysis, and LISAs Spatio-temporal covariance functions, spatio-temporal kriging, and time series of spatial processes Development of hierarchical dynamical spatio-temporal models (DSTMs), with discussion of linear and nonlinear DSTMs and computational algorithms for their implementation Quantifying and exploring spatio-temporal variability in scientific applications, including case studies based on real-world environmental data Throughout the book, interesting applications demonstrate the relevance of the presented concepts. Vivid, full-color graphics emphasize the visual nature of the topic, and a related FTP site contains supplementary material. Statistics for Spatio-Temporal Data is an excellent book for a graduate-level course on spatio-temporal statistics. It is also a valuable reference for researchers and practitioners in the fields of applied mathematics, engineering, and the environmental and health sciences.