The Random Projection Method


Book Description

Random projection is a simple geometric technique for reducing the dimensionality of a set of points in Euclidean space while preserving pairwise distances approximately. The technique plays a key role in several breakthrough developments in the field of algorithms. In other cases, it provides elegant alternative proofs. The book begins with an elementary description of the technique and its basic properties. Then it develops the method in the context of applications, which are divided into three groups. The first group consists of combinatorial optimization problems such as maxcut, graph coloring, minimum multicut, graph bandwidth and VLSI layout. Presented in this context is the theory of Euclidean embeddings of graphs. The next group is machine learning problems, specifically, learning intersections of halfspaces and learning large margin hypotheses. The projection method is further refined for the latter application. The last set consists of problems inspired by information retrieval, namely, nearest neighbor search, geometric clustering and efficient low-rank approximation. Motivated by the first two applications, an extension of random projection to the hypercube is developed here. Throughout the book, random projection is used as a way to understand, simplify and connect progress on these important and seemingly unrelated problems. The book is suitable for graduate students and research mathematicians interested in computational geometry.




The Practice of Entrepreneurship


Book Description

Intended to help individuals in self development for business ownership, this volume presents personal characteristics, planning and control and the variety and use of resources for the entrepreneur. Includes numerous checklists, formula and graphic analytical devices and practical techniques.




The Random Projection Method


Book Description

Random projection is a simple geometric technique for reducing the dimensionality of a set of points in Euclidean space while preserving pairwise distances approximately. The technique plays a key role in several breakthrough developments in the field of algorithms. In other cases, it provides elegant alternative proofs. The book begins with an elementary description of the technique and its basic properties. Then it develops the method in the context of applications, which are divided into three groups. The first group consists of combinatorial optimization problems such as maxcut, graph colo.




Subspace, Latent Structure and Feature Selection


Book Description

Many of the papers in this proceedings volume were presented at the PASCAL Workshop entitled Subspace, Latent Structure and Feature Selection Techniques: Statistical and Optimization Perspectives which took place in Bohinj, Slovenia during February, 23–25 2005.




The Essentials of Machine Learning in Finance and Accounting


Book Description

This book introduces machine learning in finance and illustrates how we can use computational tools in numerical finance in real-world context. These computational techniques are particularly useful in financial risk management, corporate bankruptcy prediction, stock price prediction, and portfolio management. The book also offers practical and managerial implications of financial and managerial decision support systems and how these systems capture vast amount of financial data. Business risk and uncertainty are two of the toughest challenges in the financial industry. This book will be a useful guide to the use of machine learning in forecasting, modeling, trading, risk management, economics, credit risk, and portfolio management.




Trends and Applications in Knowledge Discovery and Data Mining


Book Description

This book constitutes the thoroughly refereed post-workshop proceedings of the workshops that were held in conjunction with the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020, in Singapore, Singapore, in May 2020. The 17 revised full papers presented were carefully reviewed and selected from a total of 50 submissions. The five workshops were as follows: · First International Workshop on Literature-Based Discovery (LBD 2020) · Workshop on Data Science for Fake News (DSFN 2020) · Learning Data Representation for Clustering (LDRC 2020) · Ninth Workshop on Biologically Inspired Techniques for Data Mining (BDM · 2020) · First Pacific Asia Workshop on Game Intelligence & Informatics (GII 2020)




High-Dimensional Probability


Book Description

An integrated package of powerful probabilistic tools and key applications in modern mathematical data science.




Big Data and Security


Book Description

This book constitutes the refereed proceedings of the Second International Conference on Big Data and Security, ICBDS 2020, held in Singapore, Singapore, in December 2020. The 44 revised full papers and 8 short papers were carefully reviewed and selected out of 153 submissions. The papers included in this book are organized according to the topical sections on cybersecurity and privacy, big data, blockchain and internet of things, and artificial intelligence/ machine learning security.




Similarity Search and Applications


Book Description

This book constitutes the refereed proceedings of the 14th International Conference on Similarity Search and Applications, SISAP 2021, held in Dortmund, Germany, in September/October 2021. The conference was held virtually due to the COVID-19 pandemic.The 23 full papers presented together with 5 short and 3 doctoral symposium papers were carefully reviewed and selected from 50 submissions. The papers are organized in the topical sections named: ​Similarity Search and Retrieval; Intrinsic Dimensionality; Clustering and Classification; Applications of Similarity Search; Similarity Search in Graph-Structured Data; Doctoral Symposium.




Foundations of Data Science


Book Description

This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks. Topics include the counterintuitive nature of data in high dimensions, important linear algebraic techniques such as singular value decomposition, the theory of random walks and Markov chains, the fundamentals of and important algorithms for machine learning, algorithms and analysis for clustering, probabilistic models for large networks, representation learning including topic modelling and non-negative matrix factorization, wavelets and compressed sensing. Important probabilistic techniques are developed including the law of large numbers, tail inequalities, analysis of random projections, generalization guarantees in machine learning, and moment methods for analysis of phase transitions in large random graphs. Additionally, important structural and complexity measures are discussed such as matrix norms and VC-dimension. This book is suitable for both undergraduate and graduate courses in the design and analysis of algorithms for data.