Data Abstraction and Pattern Identification in Time-series Data


Book Description

Data sources such as simulations, sensor networks across many application domains generate large volumes of time-series data which exhibit characteristics that evolve over time. Visual data analysis methods can help us in exploring and understanding the underlying patterns present in time-series data but, due to their ever-increasing size, the visual data analysis process can become complex. Large data sets can be handled using data abstraction techniques by transforming the raw data into a simpler format while, at the same time, preserving significant features that are important for the user. When dealing with time-series data, abstraction techniques should also take into account the underlying temporal characteristics. This thesis focuses on different data abstraction and pattern identification methods particularly in the cases of large 1D time-series and 2D spatio-temporal time-series data which exhibit spatiotemporal discontinuity. Based on the dimensionality and characteristics of the data, this thesis proposes a variety of efficient data-adaptive and user-controlled data abstraction methods that transform the raw data into a symbol sequence. The transformation of raw time-series into a symbol sequence can act as input to different sequence analysis methods from data mining and machine learning communities to identify interesting patterns of user behavior. In the case of very long duration 1D time-series, locally adaptive and user-controlled data approximation methods were presented to simplify the data, while at the same time retaining the perceptually important features. The simplified data were converted into a symbol sequence and a sketch-based pattern identification was then used to identify patterns in the symbolic data using regular expression based pattern matching. The method was applied to financial time-series and patterns such as head-and-shoulders, double and triple-top patterns were identified using hand drawn sketches in an interactive manner. Through data smoothing, the data approximation step also enables visualization of inherent patterns in the time-series representation while at the same time retaining perceptually important points. Very long duration 2D spatio-temporal eye tracking data sets that exhibit spatio-temporal discontinuity was transformed into symbolic data using scalable clustering and hierarchical cluster merging processes, each of which can be parallelized. The raw data is transformed into a symbol sequence with each symbol representing a region of interest in the eye gaze data. The identified regions of interest can also be displayed in a Space-Time Cube (STC) that captures both the temporal and contextual information. Through interactive filtering, zooming and geometric transformation, the STC representation along with linked views enables interactive data exploration. Using different sequence analysis methods, the symbol sequences are analyzed further to identify temporal patterns in the data set. Data collected from air traffic control officers from the domain of Air traffic control were used as application examples to demonstrate the results.




Visualization Analysis and Design


Book Description

Learn How to Design Effective Visualization SystemsVisualization Analysis and Design provides a systematic, comprehensive framework for thinking about visualization in terms of principles and design choices. The book features a unified approach encompassing information visualization techniques for abstract data, scientific visualization techniques




Pattern Classification


Book Description

The first edition, published in 1973, has become a classicreference in the field. Now with the second edition, readers willfind information on key new topics such as neural networks andstatistical pattern recognition, the theory of machine learning,and the theory of invariances. Also included are worked examples,comparisons between different methods, extensive graphics, expandedexercises and computer project topics. An Instructor's Manual presenting detailed solutions to all theproblems in the book is available from the Wiley editorialdepartment.




Intelligent Data Engineering and Automated Learning -- IDEAL 2012


Book Description

This book constitutes the refereed proceedings of the 13th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2012, held in Natal, Brazil, in August 2012. The 100 revised full papers presented were carefully reviewed and selected from more than 200 submissions for inclusion in the book and present the latest theoretical advances and real-world applications in computational intelligence.




Intelligent Computing


Book Description

This book, gathering the Proceedings of the 2018 Computing Conference, offers a remarkable collection of chapters covering a wide range of topics in intelligent systems, computing and their real-world applications. The Conference attracted a total of 568 submissions from pioneering researchers, scientists, industrial engineers, and students from all around the world. These submissions underwent a double-blind peer review process. Of those 568 submissions, 192 submissions (including 14 poster papers) were selected for inclusion in these proceedings. Despite computer science’s comparatively brief history as a formal academic discipline, it has made a number of fundamental contributions to science and society—in fact, along with electronics, it is a founding science of the current epoch of human history (‘the Information Age’) and a main driver of the Information Revolution. The goal of this conference is to provide a platform for researchers to present fundamental contributions, and to be a premier venue for academic and industry practitioners to share new ideas and development experiences. This book collects state of the art chapters on all aspects of Computer Science, from classical to intelligent. It covers both the theory and applications of the latest computer technologies and methodologies. Providing the state of the art in intelligent methods and techniques for solving real-world problems, along with a vision of future research, the book will be interesting and valuable for a broad readership.




Artificial Intelligence, Big Data and Data Science in Statistics


Book Description

This book discusses the interplay between statistics, data science, machine learning and artificial intelligence, with a focus on environmental science, the natural sciences, and technology. It covers the state of the art from both a theoretical and a practical viewpoint and describes how to successfully apply machine learning methods, demonstrating the benefits of statistics for modeling and analyzing high-dimensional and big data. The book’s expert contributions include theoretical studies of machine learning methods, expositions of general methodologies for sound statistical analyses of data as well as novel approaches to modeling and analyzing data for specific problems and areas. In terms of applications, the contributions deal with data as arising in industrial quality control, autonomous driving, transportation and traffic, chip manufacturing, photovoltaics, football, transmission of infectious diseases, Covid-19 and public health. The book will appeal to statisticians and data scientists, as well as engineers and computer scientists working in related fields or applications.




Patterns Identification and Data Mining in Weather and Climate


Book Description

Advances in computer power and observing systems has led to the generation and accumulation of large scale weather & climate data begging for exploration and analysis. Pattern Identification and Data Mining in Weather and Climate presents, from different perspectives, most available, novel and conventional, approaches used to analyze multivariate time series in climate science to identify patterns of variability, teleconnections, and reduce dimensionality. The book discusses different methods to identify patterns of spatiotemporal fields. The book also presents machine learning with a particular focus on the main methods used in climate science. Applications to atmospheric and oceanographic data are also presented and discussed in most chapters. To help guide students and beginners in the field of weather & climate data analysis, basic Matlab skeleton codes are given is some chapters, complemented with a list of software links toward the end of the text. A number of technical appendices are also provided, making the text particularly suitable for didactic purposes. The topic of EOFs and associated pattern identification in space-time data sets has gone through an extraordinary fast development, both in terms of new insights and the breadth of applications. We welcome this text by Abdel Hannachi who not only has a deep insight in the field but has himself made several contributions to new developments in the last 15 years. - Huug van den Dool, Climate Prediction Center, NCEP, College Park, MD, U.S.A. Now that weather and climate science is producing ever larger and richer data sets, the topic of pattern extraction and interpretation has become an essential part. This book provides an up to date overview of the latest techniques and developments in this area. - Maarten Ambaum, Department of Meteorology, University of Reading, U.K. This nicely and expertly written book covers a lot of ground, ranging from classical linear pattern identification techniques to more modern machine learning, illustrated with examples from weather & climate science. It will be very valuable both as a tutorial for graduate and postgraduate students and as a reference text for researchers and practitioners in the field. - Frank Kwasniok, College of Engineering, University of Exeter, U.K.




Advances in Computing and Data Sciences


Book Description

This two-volume set (CCIS 905 and CCIS 906) constitutes the refereed proceedings of the Second International Conference on Advances in Computing and Data Sciences, ICACDS 2018, held in Dehradun, India, in April 2018. The 110 full papers were carefully reviewed and selected from 598 submissions. The papers are centered around topics like advanced computing, data sciences, distributed systems organizing principles, development frameworks and environments, software verification and validation, computational complexity and cryptography, machine learning theory, database theory, probabilistic representations.




Management of Data


Book Description




Artificial Immune Systems


Book Description

This book constitutes the refereed proceedings of the 6th International Conference on Artificial Immune Systems, ICARIS 2007, held in Santos, Brazil, in August 2007. The 36 revised full papers presented were carefully reviewed and selected from 58 submissions. The papers are organized in topical sections on search and optimization, classification and clustering, anomaly detection and negative selection, robotics, control and electronics, modeling papers, conceptual papers, as well as technical papers and general applications.