Statistics for High-Dimensional Data


Book Description

Modern statistics deals with large and complex data sets, and consequently with models containing a large number of parameters. This book presents a detailed account of recently developed approaches, including the Lasso and versions of it for various models, boosting methods, undirected graphical modeling, and procedures controlling false positive selections. A special characteristic of the book is that it contains comprehensive mathematical theory on high-dimensional statistics combined with methodology, algorithms and illustrations with real data examples. This in-depth approach highlights the methods’ great potential and practical applicability in a variety of settings. As such, it is a valuable resource for researchers, graduate students and experts in statistics, applied mathematics and computer science.




Statistical Analysis for High-Dimensional Data


Book Description

This book features research contributions from The Abel Symposium on Statistical Analysis for High Dimensional Data, held in Nyvågar, Lofoten, Norway, in May 2014. The focus of the symposium was on statistical and machine learning methodologies specifically developed for inference in “big data” situations, with particular reference to genomic applications. The contributors, who are among the most prominent researchers on the theory of statistics for high dimensional inference, present new theories and methods, as well as challenging applications and computational solutions. Specific themes include, among others, variable selection and screening, penalised regression, sparsity, thresholding, low dimensional structures, computational challenges, non-convex situations, learning graphical models, sparse covariance and precision matrices, semi- and non-parametric formulations, multiple testing, classification, factor models, clustering, and preselection. Highlighting cutting-edge research and casting light on future research directions, the contributions will benefit graduate students and researchers in computational biology, statistics and the machine learning community.




Applied Biclustering Methods for Big and High-Dimensional Data Using R


Book Description

Proven Methods for Big Data Analysis As big data has become standard in many application areas, challenges have arisen related to methodology and software development, including how to discover meaningful patterns in the vast amounts of data. Addressing these problems, Applied Biclustering Methods for Big and High-Dimensional Data Using R shows how to apply biclustering methods to find local patterns in a big data matrix. The book presents an overview of data analysis using biclustering methods from a practical point of view. Real case studies in drug discovery, genetics, marketing research, biology, toxicity, and sports illustrate the use of several biclustering methods. References to technical details of the methods are provided for readers who wish to investigate the full theoretical background. All the methods are accompanied with R examples that show how to conduct the analyses. The examples, software, and other materials are available on a supplementary website.




Uncertainty Quantification Techniques in Statistics


Book Description

Uncertainty quantification (UQ) is a mainstream research topic in applied mathematics and statistics. To identify UQ problems, diverse modern techniques for large and complex data analyses have been developed in applied mathematics, computer science, and statistics. This Special Issue of Mathematics (ISSN 2227-7390) includes diverse modern data analysis methods such as skew-reflected-Gompertz information quantifiers with application to sea surface temperature records, the performance of variable selection and classification via a rank-based classifier, two-stage classification with SIS using a new filter ranking method in high throughput data, an estimation of sensitive attribute applying geometric distribution under probability proportional to size sampling, combination of ensembles of regularized regression models with resampling-based lasso feature selection in high dimensional data, robust linear trend test for low-coverage next-generation sequence data controlling for covariates, and comparing groups of decision-making units in efficiency based on semiparametric regression.




Modern Statistical Methods for Health Research


Book Description

This book brings together the voices of leading experts in the frontiers of biostatistics, biomedicine, and the health sciences to discuss the statistical procedures, useful methods, and novel applications in biostatistics research. It also includes discussions of potential future directions of biomedicine and new statistical developments for health research, with the intent of stimulating research and fostering the interactions of scholars across health research related disciplines. Topics covered include: Health data analysis and applications to EHR data Clinical trials, FDR, and applications in health science Big network analytics and its applications in GWAS Survival analysis and functional data analysis Graphical modelling in genomic studies The book will be valuable to data scientists and statisticians who are working in biomedicine and health, other practitioners in the health sciences, and graduate students and researchers in biostatistics and health.







Multiple Classifier Systems


Book Description

This book constitutes the refereed proceedings of the 12th International Workshop on Multiple Classifier Systems, MCS 2015, held in Günzburg, Germany, in June/July 2015. The 19 revised papers presented were carefully reviewed and selected from 25 submissions. The papers address issues in multiple classifier systems and ensemble methods, including pattern recognition, machine learning, neural network, data mining and statistics. They are organized in topical sections on theory and algorithms and application and evaluation.




Artificial Intelligence on Medical Data


Book Description

This book includes high-quality papers presented at the Second International Symposium on Computer Vision and Machine Intelligence in Medical Image Analysis (ISCMM 2021), organized by Computer Applications Department, SMIT in collaboration with Department of Pathology, SMIMS, Sikkim, India, and funded by Indian Council of Medical Research, during 11 – 12 November 2021. It discusses common research problems and challenges in medical image analysis, such as deep learning methods. It also discusses how these theories can be applied to a broad range of application areas, including lung and chest x-ray, breast CAD, microscopy and pathology. The studies included mainly focus on the detection of events from biomedical signals.




Monte-Carlo Simulation-Based Statistical Modeling


Book Description

This book brings together expert researchers engaged in Monte-Carlo simulation-based statistical modeling, offering them a forum to present and discuss recent issues in methodological development as well as public health applications. It is divided into three parts, with the first providing an overview of Monte-Carlo techniques, the second focusing on missing data Monte-Carlo methods, and the third addressing Bayesian and general statistical modeling using Monte-Carlo simulations. The data and computer programs used here will also be made publicly available, allowing readers to replicate the model development and data analysis presented in each chapter, and to readily apply them in their own research. Featuring highly topical content, the book has the potential to impact model development and data analyses across a wide spectrum of fields, and to spark further research in this direction.




Machine Learning, Optimization, and Data Science


Book Description

This two-volume set, LNCS 13163-13164, constitutes the refereed proceedings of the 7th International Conference on Machine Learning, Optimization, and Data Science, LOD 2021, together with the first edition of the Symposium on Artificial Intelligence and Neuroscience, ACAIN 2021. The total of 86 full papers presented in this two-volume post-conference proceedings set was carefully reviewed and selected from 215 submissions. These research articles were written by leading scientists in the fields of machine learning, artificial intelligence, reinforcement learning, computational optimization, neuroscience, and data science presenting a substantial array of ideas, technologies, algorithms, methods, and applications.