Data Mining in Time Series Databases


Book Description

Adding the time dimension to real-world databases produces Time Series Databases (TSDB) and introduces new aspects and difficulties to data mining and knowledge discovery. This book covers the state-of-the-art methodology for mining time series databases. The novel data mining methods presented in the book include techniques for efficient segmentation, indexing, and classification of noisy and dynamic time series. A graph-based method for anomaly detection in time series is described and the book also studies the implications of a novel and potentially useful representation of time series as strings. The problem of detecting changes in data mining models that are induced from temporal databases is additionally discussed. Contents: A Survey of Recent Methods for Efficient Retrieval of Similar Time Sequences (H M Lie); Indexing of Compressed Time Series (E Fink & K Pratt); Boosting Interval-Based Literal: Variable Length and Early Classification (J J Rodriguez Diez); Segmenting Time Series: A Survey and Novel Approach (E Keogh et al.); Indexing Similar Time Series under Conditions of Noise (M Vlachos et al.); Classification of Events in Time Series of Graphs (H Bunke & M Kraetzl); Median Strings--A Review (X Jiang et al.); Change Detection in Classfication Models of Data Mining (G Zeira et al.). Readership: Graduate students, reseachers and practitioners in the fields of data mining, machine learning, databases and statistics.




Data Mining In Time Series And Streaming Databases


Book Description

This compendium is a completely revised version of an earlier book, Data Mining in Time Series Databases, by the same editors. It provides a unique collection of new articles written by leading experts that account for the latest developments in the field of time series and data stream mining.The emerging topics covered by the book include weightless neural modeling for mining data streams, using ensemble classifiers for imbalanced and evolving data streams, document stream mining with active learning, and many more. In particular, it addresses the domain of streaming data, which has recently become one of the emerging topics in Data Science, Big Data, and related areas. Existing titles do not provide sufficient information on this topic.







Data Streams


Book Description

This book primarily discusses issues related to the mining aspects of data streams and it is unique in its primary focus on the subject. This volume covers mining aspects of data streams comprehensively: each contributed chapter contains a survey on the topic, the key ideas in the field for that particular topic, and future research directions. The book is intended for a professional audience composed of researchers and practitioners in industry. This book is also appropriate for advanced-level students in computer science.




Knowledge Discovery from Data Streams


Book Description

Since the beginning of the Internet age and the increased use of ubiquitous computing devices, the large volume and continuous flow of distributed data have imposed new constraints on the design of learning algorithms. Exploring how to extract knowledge structures from evolving and time-changing data, Knowledge Discovery from Data Streams presents




Mining of Massive Datasets


Book Description

Now in its second edition, this book focuses on practical algorithms for mining data from even the largest datasets.




Stream Data Mining: Algorithms and Their Probabilistic Properties


Book Description

This book presents a unique approach to stream data mining. Unlike the vast majority of previous approaches, which are largely based on heuristics, it highlights methods and algorithms that are mathematically justified. First, it describes how to adapt static decision trees to accommodate data streams; in this regard, new splitting criteria are developed to guarantee that they are asymptotically equivalent to the classical batch tree. Moreover, new decision trees are designed, leading to the original concept of hybrid trees. In turn, nonparametric techniques based on Parzen kernels and orthogonal series are employed to address concept drift in the problem of non-stationary regressions and classification in a time-varying environment. Lastly, an extremely challenging problem that involves designing ensembles and automatically choosing their sizes is described and solved. Given its scope, the book is intended for a professional audience of researchers and practitioners who deal with stream data, e.g. in telecommunication, banking, and sensor networks.




Advanced Data Mining and Applications


Book Description

This book constitutes the refereed proceedings of the 8th International Conference on Advanced Data Mining and Applications, ADMA 2012, held in Nanjing, China, in December 2012. The 32 regular papers and 32 short papers presented in this volume were carefully reviewed and selected from 168 submissions. They are organized in topical sections named: social media mining; clustering; machine learning: algorithms and applications; classification; prediction, regression and recognition; optimization and approximation; mining time series and streaming data; Web mining and semantic analysis; data mining applications; search and retrieval; information recommendation and hiding; outlier detection; topic modeling; and data cube computing.




Machine Learning for Data Streams


Book Description

A hands-on approach to tasks and techniques in data stream mining and real-time analytics, with examples in MOA, a popular freely available open-source software framework. Today many information sources—including sensor networks, financial markets, social networks, and healthcare monitoring—are so-called data streams, arriving sequentially and at high speed. Analysis must take place in real time, with partial data and without the capacity to store the entire data set. This book presents algorithms and techniques used in data stream mining and real-time analytics. Taking a hands-on approach, the book demonstrates the techniques using MOA (Massive Online Analysis), a popular, freely available open-source software framework, allowing readers to try out the techniques after reading the explanations. The book first offers a brief introduction to the topic, covering big data mining, basic methodologies for mining data streams, and a simple example of MOA. More detailed discussions follow, with chapters on sketching techniques, change, classification, ensemble methods, regression, clustering, and frequent pattern mining. Most of these chapters include exercises, an MOA-based lab session, or both. Finally, the book discusses the MOA software, covering the MOA graphical user interface, the command line, use of its API, and the development of new methods within MOA. The book will be an essential reference for readers who want to use data stream mining as a tool, researchers in innovation or data stream mining, and programmers who want to create new algorithms for MOA.




Advanced Data Mining and Applications


Book Description

This book constitutes the refereed proceedings of the First International Conference on Advanced Data Mining and Applications, ADMA 2005, held in Wuhan, China in July 2005. The conference was focused on sophisticated techniques and tools that can handle new fields of data mining, e.g. spatial data mining, biomedical data mining, and mining on high-speed and time-variant data streams; an expansion of data mining to new applications is also strived for. The 25 revised full papers and 75 revised short papers presented were carefully peer-reviewed and selected from over 600 submissions. The papers are organized in topical sections on association rules, classification, clustering, novel algorithms, text mining, multimedia mining, sequential data mining and time series mining, web mining, biomedical mining, advanced applications, security and privacy issues, spatial data mining, and streaming data mining.