Data Science


Book Description

The amount of new information is constantly increasing, faster than our ability to fully interpret and utilize it to improve human experiences. Addressing this asymmetry requires novel and revolutionary scientific methods and effective human and artificial intelligence interfaces. By lifting the concept of time from a positive real number to a 2D complex time (kime), this book uncovers a connection between artificial intelligence (AI), data science, and quantum mechanics. It proposes a new mathematical foundation for data science based on raising the 4D spacetime to a higher dimension where longitudinal data (e.g., time-series) are represented as manifolds (e.g., kime-surfaces). This new framework enables the development of innovative data science analytical methods for model-based and model-free scientific inference, derived computed phenotyping, and statistical forecasting. The book provides a transdisciplinary bridge and a pragmatic mechanism to translate quantum mechanical principles, such as particles and wavefunctions, into data science concepts, such as datum and inference-functions. It includes many open mathematical problems that still need to be solved, technological challenges that need to be tackled, and computational statistics algorithms that have to be fully developed and validated. Spacekime analytics provide mechanisms to effectively handle, process, and interpret large, heterogeneous, and continuously-tracked digital information from multiple sources. The authors propose computational methods, probability model-based techniques, and analytical strategies to estimate, approximate, or simulate the complex time phases (kime directions). This allows transforming time-varying data, such as time-series observations, into higher-dimensional manifolds representing complex-valued and kime-indexed surfaces (kime-surfaces). The book includes many illustrations of model-based and model-free spacekime analytic techniques applied to economic forecasting, identification of functional brain activation, and high-dimensional cohort phenotyping. Specific case-study examples include unsupervised clustering using the Michigan Consumer Sentiment Index (MCSI), model-based inference using functional magnetic resonance imaging (fMRI) data, and model-free inference using the UK Biobank data archive. The material includes mathematical, inferential, computational, and philosophical topics such as Heisenberg uncertainty principle and alternative approaches to large sample theory, where a few spacetime observations can be amplified by a series of derived, estimated, or simulated kime-phases. The authors extend Newton-Leibniz calculus of integration and differentiation to the spacekime manifold and discuss possible solutions to some of the "problems of time". The coverage also includes 5D spacekime formulations of classical 4D spacetime mathematical equations describing natural laws of physics, as well as, statistical articulation of spacekime analytics in a Bayesian inference framework. The steady increase of the volume and complexity of observed and recorded digital information drives the urgent need to develop novel data analytical strategies. Spacekime analytics represents one new data-analytic approach, which provides a mechanism to understand compound phenomena that are observed as multiplex longitudinal processes and computationally tracked by proxy measures. This book may be of interest to academic scholars, graduate students, postdoctoral fellows, artificial intelligence and machine learning engineers, biostatisticians, econometricians, and data analysts. Some of the material may also resonate with philosophers, futurists, astrophysicists, space industry technicians, biomedical researchers, health practitioners, and the general public.




Statistics for Analytical Chemistry


Book Description




Statistics and the Evaluation of Evidence for Forensic Scientists


Book Description

The first edition of Statistics and the Evaluation of Evidence for Forensic Scientists established itself as a highly regarded authority on this area. Fully revised and updated, the second edition provides significant new material on areas of current interest including: Glass Interpretation Fibres Interpretation Bayes’ Nets The title presents comprehensive coverage of the statistical evaluation of forensic evidence. It is written with the assumption of a modest mathematical background and is illustrated throughout with up-to-date examples from a forensic science background. The clarity of exposition makes this book ideal for all forensic scientists, lawyers and other professionals in related fields interested in the quantitative assessment and evaluation of evidence. 'There can be no doubt that the appreciation of some evidence in a court of law has been greatly enhanced by the sound use of statistical ideas and one can be confident that the next decade will see further developments, during which time this book will admirably serve those who have cause to use statistics in forensic science.' D.V. Lindley




Artificial Intelligence in Drug Discovery


Book Description

Following significant advances in deep learning and related areas interest in artificial intelligence (AI) has rapidly grown. In particular, the application of AI in drug discovery provides an opportunity to tackle challenges that previously have been difficult to solve, such as predicting properties, designing molecules and optimising synthetic routes. Artificial Intelligence in Drug Discovery aims to introduce the reader to AI and machine learning tools and techniques, and to outline specific challenges including designing new molecular structures, synthesis planning and simulation. Providing a wealth of information from leading experts in the field this book is ideal for students, postgraduates and established researchers in both industry and academia.




Data Analysis


Book Description

One of the strengths of this book is the author's ability to motivate the use of Bayesian methods through simple yet effective examples. - Katie St. Clair MAA Reviews.




Introduction to Bayesian Statistics


Book Description

"...this edition is useful and effective in teaching Bayesian inference at both elementary and intermediate levels. It is a well-written book on elementary Bayesian inference, and the material is easily accessible. It is both concise and timely, and provides a good collection of overviews and reviews of important tools used in Bayesian statistical methods." There is a strong upsurge in the use of Bayesian methods in applied statistical analysis, yet most introductory statistics texts only present frequentist methods. Bayesian statistics has many important advantages that students should learn about if they are going into fields where statistics will be used. In this third Edition, four newly-added chapters address topics that reflect the rapid advances in the field of Bayesian statistics. The authors continue to provide a Bayesian treatment of introductory statistical topics, such as scientific data gathering, discrete random variables, robust Bayesian methods, and Bayesian approaches to inference for discrete random variables, binomial proportions, Poisson, and normal means, and simple linear regression. In addition, more advanced topics in the field are presented in four new chapters: Bayesian inference for a normal with unknown mean and variance; Bayesian inference for a Multivariate Normal mean vector; Bayesian inference for the Multiple Linear Regression Model; and Computational Bayesian Statistics including Markov Chain Monte Carlo. The inclusion of these topics will facilitate readers' ability to advance from a minimal understanding of Statistics to the ability to tackle topics in more applied, advanced level books. Minitab macros and R functions are available on the book's related website to assist with chapter exercises. Introduction to Bayesian Statistics, Third Edition also features: Topics including the Joint Likelihood function and inference using independent Jeffreys priors and join conjugate prior The cutting-edge topic of computational Bayesian Statistics in a new chapter, with a unique focus on Markov Chain Monte Carlo methods Exercises throughout the book that have been updated to reflect new applications and the latest software applications Detailed appendices that guide readers through the use of R and Minitab software for Bayesian analysis and Monte Carlo simulations, with all related macros available on the book's website Introduction to Bayesian Statistics, Third Edition is a textbook for upper-undergraduate or first-year graduate level courses on introductory statistics course with a Bayesian emphasis. It can also be used as a reference work for statisticians who require a working knowledge of Bayesian statistics.




Comprehensive Foodomics


Book Description

Comprehensive Foodomics, Three Volume Set offers a definitive collection of over 150 articles that provide researchers with innovative answers to crucial questions relating to food quality, safety and its vital and complex links to our health. Topics covered include transcriptomics, proteomics, metabolomics, genomics, green foodomics, epigenetics and noncoding RNA, food safety, food bioactivity and health, food quality and traceability, data treatment and systems biology. Logically structured into 10 focused sections, each article is authored by world leading scientists who cover the whole breadth of Omics and related technologies, including the latest advances and applications. By bringing all this information together in an easily navigable reference, food scientists and nutritionists in both academia and industry will find it the perfect, modern day compendium for frequent reference. List of sections and Section Editors: Genomics - Olivia McAuliffe, Dept of Food Biosciences, Moorepark, Fermoy, Co. Cork, Ireland Epigenetics & Noncoding RNA - Juan Cui, Department of Computer Science & Engineering, University of Nebraska-Lincoln, Lincoln, NE Transcriptomics - Robert Henry, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Australia Proteomics - Jens Brockmeyer, Institute of Biochemistry and Technical Biochemistry, University Stuttgart, Germany Metabolomics - Philippe Schmitt-Kopplin, Research Unit Analytical BioGeoChemistry, Neuherberg, Germany Omics data treatment, System Biology and Foodomics - Carlos Leon Canseco, Visiting Professor, Biomedical Engineering, Universidad Carlos III de Madrid Green Foodomics - Elena Ibanez, Foodomics Lab, CIAL, CSIC, Madrid, Spain Food safety and Foodomics - Djuro Josic, Professor Medicine (Research) Warren Alpert Medical School, Brown University, Providence, RI, USA & Sandra Kraljevic Pavelic, University of Rijeka, Department of Biotechnology, Rijeka, Croatia Food Quality, Traceability and Foodomics - Daniel Cozzolino, Centre for Nutrition and Food Sciences, The University of Queensland, Queensland, Australia Food Bioactivity, Health and Foodomics - Miguel Herrero, Department of Bioactivity and Food Analysis, Foodomics Lab, CIAL, CSIC, Madrid, Spain Brings all relevant foodomics information together in one place, offering readers a ‘one-stop,’ comprehensive resource for access to a wealth of information Includes articles written by academics and practitioners from various fields and regions Provides an ideal resource for students, researchers and professionals who need to find relevant information quickly and easily Includes content from high quality authors from across the globe




Horizons in Materials


Book Description

The Frontiers in Materials Editorial Office team are delighted to present the “Horizons in Materials” article collection, showcasing high-impact, authoritative, and accessible Review articles covering important topics at the forefront of the materials science and engineering field. All contributing authors were nominated by the Chief Editors and Editorial Office in recognition of their prominence and influence in their respective fields. The cutting-edge work presented in this article collection highlights the diversity of research performed across the entire breadth of the materials science and engineering field and reflects on the latest advances in theory, experiment, and methodology with applications to compelling problems. This Editorial features the corresponding author(s) of each paper published within this important collection, ordered by section alphabetically, highlighting them as the great researchers of the future. The Frontiers in Materials Chief Editors and Editorial Office team would like to thank each researcher who contributed their work to this collection. We are excited to see each article gain the deserved visibility and traction within the wider community, ensuring the collection’s truly global impact and success. Emily Young Journal Manager




Data Mining and Predictive Analytics


Book Description

Learn methods of data analysis and their application to real-world data sets This updated second edition serves as an introduction to data mining methods and models, including association rules, clustering, neural networks, logistic regression, and multivariate analysis. The authors apply a unified “white box” approach to data mining methods and models. This approach is designed to walk readers through the operations and nuances of the various methods, using small data sets, so readers can gain an insight into the inner workings of the method under review. Chapters provide readers with hands-on analysis problems, representing an opportunity for readers to apply their newly-acquired data mining expertise to solving real problems using large, real-world data sets. Data Mining and Predictive Analytics: Offers comprehensive coverage of association rules, clustering, neural networks, logistic regression, multivariate analysis, and R statistical programming language Features over 750 chapter exercises, allowing readers to assess their understanding of the new material Provides a detailed case study that brings together the lessons learned in the book Includes access to the companion website, www.dataminingconsultant, with exclusive password-protected instructor content Data Mining and Predictive Analytics will appeal to computer science and statistic students, as well as students in MBA programs, and chief executives.




Graph Representation Learning


Book Description

Graph-structured data is ubiquitous throughout the natural and social sciences, from telecommunication networks to quantum chemistry. Building relational inductive biases into deep learning architectures is crucial for creating systems that can learn, reason, and generalize from this kind of data. Recent years have seen a surge in research on graph representation learning, including techniques for deep graph embeddings, generalizations of convolutional neural networks to graph-structured data, and neural message-passing approaches inspired by belief propagation. These advances in graph representation learning have led to new state-of-the-art results in numerous domains, including chemical synthesis, 3D vision, recommender systems, question answering, and social network analysis. This book provides a synthesis and overview of graph representation learning. It begins with a discussion of the goals of graph representation learning as well as key methodological foundations in graph theory and network analysis. Following this, the book introduces and reviews methods for learning node embeddings, including random-walk-based methods and applications to knowledge graphs. It then provides a technical synthesis and introduction to the highly successful graph neural network (GNN) formalism, which has become a dominant and fast-growing paradigm for deep learning with graph data. The book concludes with a synthesis of recent advancements in deep generative models for graphs—a nascent but quickly growing subset of graph representation learning.