Spatial Predictive Modeling with R


Book Description

Spatial predictive modeling (SPM) is an emerging discipline in applied sciences, playing a key role in the generation of spatial predictions in various disciplines. SPM refers to preparing relevant data, developing optimal predictive models based on point data, and then generating spatial predictions. This book aims to systematically introduce the entire process of SPM as a discipline. The process contains data acquisition, spatial predictive methods and variable selection, parameter optimization, accuracy assessment, and the generation and visualization of spatial predictions, where spatial predictive methods are from geostatistics, modern statistics, and machine learning. The key features of this book are: •Systematically introducing major components of SPM process. •Novel hybrid methods (228 hybrids plus numerous variants) of modern statistical methods or machine learning methods with mathematical and/or univariate geostatistical methods. •Novel predictive accuracy-based variable selection techniques for spatial predictive methods. •Predictive accuracy-based parameter/model optimization. •Reproducible examples for SPM of various data types in R. This book provides guidelines, recommendations, and reproducible examples for developing optimal predictive models by considering various components and associated factors for quality-improved spatial predictions. It provides valuable tools for researchers, modelers, and university students not only in SPM field but also in other predictive modeling fields. Dr Li has produced over 100 various publications in spatial predictive modelling, statistical computing, ecological and environmental modelling, and ecology, developed a number of hybrid methods for SPM, and published four R packages for variable selections as well as SPM.




Spatial Modeling in GIS and R for Earth and Environmental Sciences


Book Description

Spatial Modeling in GIS and R for Earth and Environmental Sciences offers an integrated approach to spatial modelling using both GIS and R. Given the importance of Geographical Information Systems and geostatistics across a variety of applications in Earth and Environmental Science, a clear link between GIS and open source software is essential for the study of spatial objects or phenomena that occur in the real world and facilitate problem-solving. Organized into clear sections on applications and using case studies, the book helps researchers to more quickly understand GIS data and formulate more complex conclusions. The book is the first reference to provide methods and applications for combining the use of R and GIS in modeling spatial processes. It is an essential tool for students and researchers in earth and environmental science, especially those looking to better utilize GIS and spatial modeling. - Offers a clear, interdisciplinary guide to serve researchers in a variety of fields, including hazards, land surveying, remote sensing, cartography, geophysics, geology, natural resources, environment and geography - Provides an overview, methods and case studies for each application - Expresses concepts and methods at an appropriate level for both students and new users to learn by example




Geocomputation with R


Book Description

Geocomputation with R is for people who want to analyze, visualize and model geographic data with open source software. It is based on R, a statistical programming language that has powerful data processing, visualization, and geospatial capabilities. The book equips you with the knowledge and skills to tackle a wide range of issues manifested in geographic data, including those with scientific, societal, and environmental implications. This book will interest people from many backgrounds, especially Geographic Information Systems (GIS) users interested in applying their domain-specific knowledge in a powerful open source language for data science, and R users interested in extending their skills to handle spatial data. The book is divided into three parts: (I) Foundations, aimed at getting you up-to-speed with geographic data in R, (II) extensions, which covers advanced techniques, and (III) applications to real-world problems. The chapters cover progressively more advanced topics, with early chapters providing strong foundations on which the later chapters build. Part I describes the nature of spatial datasets in R and methods for manipulating them. It also covers geographic data import/export and transforming coordinate reference systems. Part II represents methods that build on these foundations. It covers advanced map making (including web mapping), "bridges" to GIS, sharing reproducible code, and how to do cross-validation in the presence of spatial autocorrelation. Part III applies the knowledge gained to tackle real-world problems, including representing and modeling transport systems, finding optimal locations for stores or services, and ecological modeling. Exercises at the end of each chapter give you the skills needed to tackle a range of geospatial problems. Solutions for each chapter and supplementary materials providing extended examples are available at https://geocompr.github.io/geocompkg/articles/. Dr. Robin Lovelace is a University Academic Fellow at the University of Leeds, where he has taught R for geographic research over many years, with a focus on transport systems. Dr. Jakub Nowosad is an Assistant Professor in the Department of Geoinformation at the Adam Mickiewicz University in Poznan, where his focus is on the analysis of large datasets to understand environmental processes. Dr. Jannes Muenchow is a Postdoctoral Researcher in the GIScience Department at the University of Jena, where he develops and teaches a range of geographic methods, with a focus on ecological modeling, statistical geocomputing, and predictive mapping. All three are active developers and work on a number of R packages, including stplanr, sabre, and RQGIS.




Applied Predictive Modeling


Book Description

Applied Predictive Modeling covers the overall predictive modeling process, beginning with the crucial steps of data preprocessing, data splitting and foundations of model tuning. The text then provides intuitive explanations of numerous common and modern regression and classification techniques, always with an emphasis on illustrating and solving real data problems. The text illustrates all parts of the modeling process through many hands-on, real-life examples, and every chapter contains extensive R code for each step of the process. This multi-purpose text can be used as an introduction to predictive models and the overall modeling process, a practitioner’s reference handbook, or as a text for advanced undergraduate or graduate level predictive modeling courses. To that end, each chapter contains problem sets to help solidify the covered concepts and uses data available in the book’s R package. This text is intended for a broad audience as both an introduction to predictive models as well as a guide to applying them. Non-mathematical readers will appreciate the intuitive explanations of the techniques while an emphasis on problem-solving with real data across a wide variety of applications will aid practitioners who wish to extend their expertise. Readers should have knowledge of basic statistical ideas, such as correlation and linear regression analysis. While the text is biased against complex equations, a mathematical background is needed for advanced topics.




Predictive Soil Mapping with R


Book Description

Predictive Soil Mapping (PSM) is based on applying statistical and/or machine learning techniques to fit models for the purpose of producing spatial and/or spatiotemporal predictions of soil variables i.e. maps of soil properties and classes at different resolutions. It is a multidisciplinary field combining statistics, data science, soil science, physical geography, remote sensing, geoinformation science and a number of other sciences. Predictive Soil Mapping with R is about understanding the main concepts behind soil mapping, mastering R packages that can be used to produce high quality soil maps, and about optimizing all processes involved so that also the production costs can be reduced. The online version of the book is available at: https: //envirometrix.github.io/PredictiveSoilMapping/ Pull requests and general comments are welcome. These materials are based on technical tutorials initially developed by the ISRIC's Global Soil Information Facilities (GSIF) development team over the period 2014-2017




Modern Data Science with R


Book Description

From a review of the first edition: "Modern Data Science with R... is rich with examples and is guided by a strong narrative voice. What’s more, it presents an organizing framework that makes a convincing argument that data science is a course distinct from applied statistics" (The American Statistician). Modern Data Science with R is a comprehensive data science textbook for undergraduates that incorporates statistical and computational thinking to solve real-world data problems. Rather than focus exclusively on case studies or programming syntax, this book illustrates how statistical programming in the state-of-the-art R/RStudio computing environment can be leveraged to extract meaningful information from a variety of data in the service of addressing compelling questions. The second edition is updated to reflect the growing influence of the tidyverse set of packages. All code in the book has been revised and styled to be more readable and easier to understand. New functionality from packages like sf, purrr, tidymodels, and tidytext is now integrated into the text. All chapters have been revised, and several have been split, re-organized, or re-imagined to meet the shifting landscape of best practice.




Modern Statistics with R


Book Description

The past decades have transformed the world of statistical data analysis, with new methods, new types of data, and new computational tools. Modern Statistics with R introduces you to key parts of this modern statistical toolkit. It teaches you: Data wrangling - importing, formatting, reshaping, merging, and filtering data in R. Exploratory data analysis - using visualisations and multivariate techniques to explore datasets. Statistical inference - modern methods for testing hypotheses and computing confidence intervals. Predictive modelling - regression models and machine learning methods for prediction, classification, and forecasting. Simulation - using simulation techniques for sample size computations and evaluations of statistical methods. Ethics in statistics - ethical issues and good statistical practice. R programming - writing code that is fast, readable, and (hopefully!) free from bugs. No prior programming experience is necessary. Clear explanations and examples are provided to accommodate readers at all levels of familiarity with statistical principles and coding practices. A basic understanding of probability theory can enhance comprehension of certain concepts discussed within this book. In addition to plenty of examples, the book includes more than 200 exercises, with fully worked solutions available at: www.modernstatisticswithr.com.




Applied Spatial Data Analysis with R


Book Description

Applied Spatial Data Analysis with R, second edition, is divided into two basic parts, the first presenting R packages, functions, classes and methods for handling spatial data. This part is of interest to users who need to access and visualise spatial data. Data import and export for many file formats for spatial data are covered in detail, as is the interface between R and the open source GRASS GIS and the handling of spatio-temporal data. The second part showcases more specialised kinds of spatial data analysis, including spatial point pattern analysis, interpolation and geostatistics, areal data analysis and disease mapping. The coverage of methods of spatial data analysis ranges from standard techniques to new developments, and the examples used are largely taken from the spatial statistics literature. All the examples can be run using R contributed packages available from the CRAN website, with code and additional data sets from the book's own website. Compared to the first edition, the second edition covers the more systematic approach towards handling spatial data in R, as well as a number of important and widely used CRAN packages that have appeared since the first edition. This book will be of interest to researchers who intend to use R to handle, visualise, and analyse spatial data. It will also be of interest to spatial data analysts who do not use R, but who are interested in practical aspects of implementing software for spatial data analysis. It is a suitable companion book for introductory spatial statistics courses and for applied methods courses in a wide range of subjects using spatial data, including human and physical geography, geographical information science and geoinformatics, the environmental sciences, ecology, public health and disease control, economics, public administration and political science. The book has a website where complete code examples, data sets, and other support material may be found: http://www.asdar-book.org. The authors have taken part in writing and maintaining software for spatial data handling and analysis with R in concert since 2003.




Spatial Predictive Modeling with R


Book Description

Spatial predictive modeling (SPM) is an emerging discipline in applied sciences, playing a key role in the generation of spatial predictions in various disciplines. SPM refers to preparing relevant data, developing optimal predictive models based on point data, and then generating spatial predictions. This book aims to systematically introduce the entire process of SPM as a discipline. The process contains data acquisition, spatial predictive methods and variable selection, parameter optimization, accuracy assessment, and the generation and visualization of spatial predictions, where spatial predictive methods are from geostatistics, modern statistics, and machine learning. The key features of this book are: •Systematically introducing major components of SPM process. •Novel hybrid methods (228 hybrids plus numerous variants) of modern statistical methods or machine learning methods with mathematical and/or univariate geostatistical methods. •Novel predictive accuracy-based variable selection techniques for spatial predictive methods. •Predictive accuracy-based parameter/model optimization. •Reproducible examples for SPM of various data types in R. This book provides guidelines, recommendations, and reproducible examples for developing optimal predictive models by considering various components and associated factors for quality-improved spatial predictions. It provides valuable tools for researchers, modelers, and university students not only in SPM field but also in other predictive modeling fields. Dr Li has produced over 100 various publications in spatial predictive modelling, statistical computing, ecological and environmental modelling, and ecology, developed a number of hybrid methods for SPM, and published four R packages for variable selections as well as SPM.




Modeling Techniques in Predictive Analytics


Book Description

Now fully updated, this uniquely accessible book will help you use predictive analytics to solve real business problems and drive real competitive advantage. If you're new to the discipline, it will give you the strong foundation you need to get accurate, actionable results. If you're already a modeler, programmer, or manager, it will teach you crucial skills you don't yet have. This guide illuminates the discipline through realistic vignettes and intuitive data visualizations-not complex math. Thomas W. Miller, leader of Northwestern University's pioneering program in predictive analytics, guides you through defining problems, identifying data, crafting and optimizing models, writing effective R code, interpreting results, and more. Every chapter focuses on one of today's key applications for predictive analytics, delivering skills and knowledge to put models to work-and maximize their value. Reflecting extensive student and instructor feedback, this edition adds five classroom-tested case studies, updates all code for new versions of R, explains code behavior more clearly and completely, and covers modern data science methods even more effectively.