Advances in Distributed and Parallel Knowledge Discovery


Book Description

This book presents introductions to DKD and PKD, extensive reviews of the field, and state-of-the-art techniques. Foreword by Vipin Kumar Knowledge discovery and data mining (KDD) deals with the problem of extracting interesting associations, classifiers, clusters, and other patterns from data. The emergence of network-based distributed computing environments has introduced an important new dimension to this problem--distributed sources of data. Traditional centralized KDD typically requires central aggregation of distributed data, which may not always be feasible because of limited network bandwidth, security concerns, scalability problems, and other practical issues. Distributed knowledge discovery (DKD) works with the merger of communication and computation by analyzing data in a distributed fashion. This technology is particularly useful for large heterogeneous distributed environments such as the Internet, intranets, mobile computing environments, and sensor-networks.When the data sets are large, scaling up the speed of the KDD process is crucial. Parallel knowledge discovery (PKD) techniques addresses this problem by using high-performance multiprocessor machines. This book presents introductions to DKD and PKD, extensive reviews of the field, and state-of-the-art techniques. Contributors Rakesh Agrawal, Khaled AlSabti, Stuart Bailey, Philip Chan, David Cheung, Vincent Cho, Joydeep Ghosh, Robert Grossman, Yi-ke Guo, John Hale, John Hall, Daryl Hershberger, Ching-Tien Ho, Erik Johnson, Chris Jones, Chandrika Kamath, Hillol Kargupta, Charles Lo, Balinder Malhi, Ron Musick, Vincent Ng, Byung-Hoon Park, Srinivasan Parthasarathy, Andreas Prodromidis, Foster Provost, Jian Pun, Ashok Ramu, Sanjay Ranka, Mahesh Sreenivas, Salvatore Stolfo, Ramesh Subramonian, Janjao Sutiwaraphun, Kagan Tummer, Andrei Turinsky, Beat Wüthrich, Mohammed Zaki, Joshua Zhang




Advanced Methods for Knowledge Discovery from Complex Data


Book Description

The growth in the amount of data collected and generated has exploded in recent times with the widespread automation of various day-to-day activities, advances in high-level scienti?c and engineering research and the development of e?cient data collection tools. This has given rise to the need for automa- callyanalyzingthedatainordertoextractknowledgefromit,therebymaking the data potentially more useful. Knowledge discovery and data mining (KDD) is the process of identifying valid, novel, potentially useful and ultimately understandable patterns from massive data repositories. It is a multi-disciplinary topic, drawing from s- eral ?elds including expert systems, machine learning, intelligent databases, knowledge acquisition, case-based reasoning, pattern recognition and stat- tics. Many data mining systems have typically evolved around well-organized database systems (e.g., relational databases) containing relevant information. But, more and more, one ?nds relevant information hidden in unstructured text and in other complex forms. Mining in the domains of the world-wide web, bioinformatics, geoscienti?c data, and spatial and temporal applications comprise some illustrative examples in this regard. Discovery of knowledge, or potentially useful patterns, from such complex data often requires the - plication of advanced techniques that are better able to exploit the nature and representation of the data. Such advanced methods include, among o- ers, graph-based and tree-based approaches to relational learning, sequence mining, link-based classi?cation, Bayesian networks, hidden Markov models, neural networks, kernel-based methods, evolutionary algorithms, rough sets and fuzzy logic, and hybrid systems. Many of these methods are developed in the following chapters.




Data Mining and Knowledge Discovery Handbook


Book Description

This book organizes key concepts, theories, standards, methodologies, trends, challenges and applications of data mining and knowledge discovery in databases. It first surveys, then provides comprehensive yet concise algorithmic descriptions of methods, including classic methods plus the extensions and novel methods developed recently. It also gives in-depth descriptions of data mining applications in various interdisciplinary industries.




Handbook of Research on Geographic Information Systems Applications and Advancements


Book Description

The proper management of geographic data can provide assistance to a number of different sectors within society. As such, it is imperative to continue advancing research for spatial data analysis. The Handbook of Research on Geographic Information Systems Applications and Advancements presents a thorough overview of the latest developments in effective management techniques for collecting, processing, analyzing, and utilizing geographical data and information. Highlighting theoretical frameworks and relevant applications, this book is an ideal reference source for researchers, academics, professionals, and students actively involved in the field of geographic information systems.




Data Mining for Scientific and Engineering Applications


Book Description

Advances in technology are making massive data sets common in many scientific disciplines, such as astronomy, medical imaging, bio-informatics, combinatorial chemistry, remote sensing, and physics. To find useful information in these data sets, scientists and engineers are turning to data mining techniques. This book is a collection of papers based on the first two in a series of workshops on mining scientific datasets. It illustrates the diversity of problems and application areas that can benefit from data mining, as well as the issues and challenges that differentiate scientific data mining from its commercial counterpart. While the focus of the book is on mining scientific data, the work is of broader interest as many of the techniques can be applied equally well to data arising in business and web applications. Audience: This work would be an excellent text for students and researchers who are familiar with the basic principles of data mining and want to learn more about the application of data mining to their problem in science or engineering.




Encyclopedia of Data Warehousing and Mining


Book Description

Data Warehousing and Mining (DWM) is the science of managing and analyzing large datasets and discovering novel patterns and in recent years has emerged as a particularly exciting and industrially relevant area of research. Prodigious amounts of data are now being generated in domains as diverse as market research, functional genomics and pharmaceuticals; intelligently analyzing these data, with the aim of answering crucial questions and helping make informed decisions, is the challenge that lies ahead. The Encyclopedia of Data Warehousing and Mining provides a comprehensive, critical and descriptive examination of concepts, issues, trends, and challenges in this rapidly expanding field of data warehousing and mining (DWM). This encyclopedia consists of more than 350 contributors from 32 countries, 1,800 terms and definitions, and more than 4,400 references. This authoritative publication offers in-depth coverage of evolutions, theories, methodologies, functionalities, and applications of DWM in such interdisciplinary industries as healthcare informatics, artificial intelligence, financial modeling, and applied statistics, making it a single source of knowledge and latest discoveries in the field of DWM.




Data Mining Techniques in Grid Computing Environments


Book Description

Based around eleven international real life case studies and including contributions from leading experts in the field this groundbreaking book explores the need for the grid-enabling of data mining applications and provides a comprehensive study of the technology, techniques and management skills necessary to create them. This book provides a simultaneous design blueprint, user guide, and research agenda for current and future developments and will appeal to a broad audience; from developers and users of data mining and grid technology, to advanced undergraduate and postgraduate students interested in this field.




Data Mining


Book Description

"An overview of the multidisciplinary field of data mining, this book focuses specifically on new methodologies and case studies. Included are case studies written by 44 leading scientists and talented young scholars from seven different countries. Topics covered include data mining based on rough sets, the impact of missing data, and mining free text for structure. In addition, the four basic mining operations supported by numerous mining techniques are addressed: predictive model creation supported by supervised induction techniques; link analysis supported by association discovery and sequence discovery techniques; DB segmentation supported by clustering techniques; and deviation detection supported by statistical techniques."




Intelligent Information Agents


Book Description

This book presents 10 chapters on various aspects of intelligent information agents contributed by members of the respective AgentLink special interest group. The papers are organized in three parts on agent-based information systems, adaptive information agents, and coordination of information agents. Also included are a comprehensive introduction and surveys for each of the three parts.




Encyclopedia of Data Warehousing and Mining, Second Edition


Book Description

There are more than one billion documents on the Web, with the count continually rising at a pace of over one million new documents per day. As information increases, the motivation and interest in data warehousing and mining research and practice remains high in organizational interest. The Encyclopedia of Data Warehousing and Mining, Second Edition, offers thorough exposure to the issues of importance in the rapidly changing field of data warehousing and mining. This essential reference source informs decision makers, problem solvers, and data mining specialists in business, academia, government, and other settings with over 300 entries on theories, methodologies, functionalities, and applications.