The Random Projection Method


Book Description

Random projection is a simple geometric technique for reducing the dimensionality of a set of points in Euclidean space while preserving pairwise distances approximately. The technique plays a key role in several breakthrough developments in the field of algorithms. In other cases, it provides elegant alternative proofs. The book begins with an elementary description of the technique and its basic properties. Then it develops the method in the context of applications, which are divided into three groups. The first group consists of combinatorial optimization problems such as maxcut, graph coloring, minimum multicut, graph bandwidth and VLSI layout. Presented in this context is the theory of Euclidean embeddings of graphs. The next group is machine learning problems, specifically, learning intersections of halfspaces and learning large margin hypotheses. The projection method is further refined for the latter application. The last set consists of problems inspired by information retrieval, namely, nearest neighbor search, geometric clustering and efficient low-rank approximation. Motivated by the first two applications, an extension of random projection to the hypercube is developed here. Throughout the book, random projection is used as a way to understand, simplify and connect progress on these important and seemingly unrelated problems. The book is suitable for graduate students and research mathematicians interested in computational geometry.




The Practice of Entrepreneurship


Book Description

Intended to help individuals in self development for business ownership, this volume presents personal characteristics, planning and control and the variety and use of resources for the entrepreneur. Includes numerous checklists, formula and graphic analytical devices and practical techniques.




The Essentials of Machine Learning in Finance and Accounting


Book Description

This book introduces machine learning in finance and illustrates how we can use computational tools in numerical finance in real-world context. These computational techniques are particularly useful in financial risk management, corporate bankruptcy prediction, stock price prediction, and portfolio management. The book also offers practical and managerial implications of financial and managerial decision support systems and how these systems capture vast amount of financial data. Business risk and uncertainty are two of the toughest challenges in the financial industry. This book will be a useful guide to the use of machine learning in forecasting, modeling, trading, risk management, economics, credit risk, and portfolio management.




Foundations of Data Science


Book Description

This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks. Topics include the counterintuitive nature of data in high dimensions, important linear algebraic techniques such as singular value decomposition, the theory of random walks and Markov chains, the fundamentals of and important algorithms for machine learning, algorithms and analysis for clustering, probabilistic models for large networks, representation learning including topic modelling and non-negative matrix factorization, wavelets and compressed sensing. Important probabilistic techniques are developed including the law of large numbers, tail inequalities, analysis of random projections, generalization guarantees in machine learning, and moment methods for analysis of phase transitions in large random graphs. Additionally, important structural and complexity measures are discussed such as matrix norms and VC-dimension. This book is suitable for both undergraduate and graduate courses in the design and analysis of algorithms for data.




Similarity Search and Applications


Book Description

This book constitutes the refereed proceedings of the 14th International Conference on Similarity Search and Applications, SISAP 2021, held in Dortmund, Germany, in September/October 2021. The conference was held virtually due to the COVID-19 pandemic.The 23 full papers presented together with 5 short and 3 doctoral symposium papers were carefully reviewed and selected from 50 submissions. The papers are organized in the topical sections named: ​Similarity Search and Retrieval; Intrinsic Dimensionality; Clustering and Classification; Applications of Similarity Search; Similarity Search in Graph-Structured Data; Doctoral Symposium.




A Dynamical Approach to Random Matrix Theory


Book Description

A co-publication of the AMS and the Courant Institute of Mathematical Sciences at New York University This book is a concise and self-contained introduction of recent techniques to prove local spectral universality for large random matrices. Random matrix theory is a fast expanding research area, and this book mainly focuses on the methods that the authors participated in developing over the past few years. Many other interesting topics are not included, and neither are several new developments within the framework of these methods. The authors have chosen instead to present key concepts that they believe are the core of these methods and should be relevant for future applications. They keep technicalities to a minimum to make the book accessible to graduate students. With this in mind, they include in this book the basic notions and tools for high-dimensional analysis, such as large deviation, entropy, Dirichlet form, and the logarithmic Sobolev inequality. This manuscript has been developed and continuously improved over the last five years. The authors have taught this material in several regular graduate courses at Harvard, Munich, and Vienna, in addition to various summer schools and short courses. Titles in this series are co-published with the Courant Institute of Mathematical Sciences at New York University.




Trends and Applications in Knowledge Discovery and Data Mining


Book Description

This book constitutes the thoroughly refereed post-workshop proceedings of the workshops that were held in conjunction with the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020, in Singapore, Singapore, in May 2020. The 17 revised full papers presented were carefully reviewed and selected from a total of 50 submissions. The five workshops were as follows: · First International Workshop on Literature-Based Discovery (LBD 2020) · Workshop on Data Science for Fake News (DSFN 2020) · Learning Data Representation for Clustering (LDRC 2020) · Ninth Workshop on Biologically Inspired Techniques for Data Mining (BDM · 2020) · First Pacific Asia Workshop on Game Intelligence & Informatics (GII 2020)




High-Dimensional Probability


Book Description

An integrated package of powerful probabilistic tools and key applications in modern mathematical data science.




Spectral Algorithms


Book Description

Spectral methods refer to the use of eigenvalues, eigenvectors, singular values and singular vectors. They are widely used in Engineering, Applied Mathematics and Statistics. More recently, spectral methods have found numerous applications in Computer Science to "discrete" as well as "continuous" problems. Spectral Algorithms describes modern applications of spectral methods, and novel algorithms for estimating spectral parameters. The first part of the book presents applications of spectral methods to problems from a variety of topics including combinatorial optimization, learning and clustering. The second part of the book is motivated by efficiency considerations. A feature of many modern applications is the massive amount of input data. While sophisticated algorithms for matrix computations have been developed over a century, a more recent development is algorithms based on "sampling on the fly" from massive matrices. Good estimates of singular values and low rank approximations of the whole matrix can be provably derived from a sample. The main emphasis in the second part of the book is to present these sampling methods with rigorous error bounds. It also presents recent extensions of spectral methods from matrices to tensors and their applications to some combinatorial optimization problems.




Proceedings of ELM 2018


Book Description

This book contains some selected papers from the International Conference on Extreme Learning Machine 2018, which was held in Singapore, November 21–23, 2018. This conference provided a forum for academics, researchers and engineers to share and exchange R&D experience on both theoretical studies and practical applications of the ELM technique and brain learning. Extreme Learning Machines (ELM) aims to enable pervasive learning and pervasive intelligence. As advocated by ELM theories, it is exciting to see the convergence of machine learning and biological learning from the long-term point of view. ELM may be one of the fundamental “learning particles” filling the gaps between machine learning and biological learning (of which activation functions are even unknown). ELM represents a suite of (machine and biological) learning techniques in which hidden neurons need not be tuned: inherited from their ancestors or randomly generated. ELM learning theories show that effective learning algorithms can be derived based on randomly generated hidden neurons (biological neurons, artificial neurons, wavelets, Fourier series, etc.) as long as they are nonlinear piecewise continuous, independent of training data and application environments. Increasingly, evidence from neuroscience suggests that similar principles apply in biological learning systems. ELM theories and algorithms argue that “random hidden neurons” capture an essential aspect of biological learning mechanisms as well as the intuitive sense that the efficiency of biological learning need not rely on computing power of neurons. ELM theories thus hint at possible reasons why the brain is more intelligent and effective than current computers. The main theme of ELM2018 is Hierarchical ELM, AI for IoT, Synergy of Machine Learning and Biological Learning. This book covers theories, algorithms and applications of ELM. It gives readers a glance at the most recent advances of ELM.