Knowledge Discovery and Measures of Interest


Book Description

Knowledge Discovery and Measures of Interest is a reference book for knowledge discovery researchers, practitioners, and students. The knowledge discovery researcher will find that the material provides a theoretical foundation for measures of interest in data mining applications where diversity measures are used to rank summaries generated from databases. The knowledge discovery practitioner will find solid empirical evidence on which to base decisions regarding the choice of measures in data mining applications. The knowledge discovery student in a senior undergraduate or graduate course in databases and data mining will find the book is a good introduction to the concepts and techniques of measures of interest. In Knowledge Discovery and Measures of Interest, we study two closely related steps in any knowledge discovery system: the generation of discovered knowledge; and the interpretation and evaluation of discovered knowledge. In the generation step, we study data summarization, where a single dataset can be generalized in many different ways and to many different levels of granularity according to domain generalization graphs. In the interpretation and evaluation step, we study diversity measures as heuristics for ranking the interestingness of the summaries generated. The objective of this work is to introduce and evaluate a technique for ranking the interestingness of discovered patterns in data. It consists of four primary goals: To introduce domain generalization graphs for describing and guiding the generation of summaries from databases. To introduce and evaluate serial and parallel algorithms that traverse the domain generalization space described by the domain generalization graphs. To introduce and evaluate diversity measures as heuristic measures of interestingness for ranking summaries generated from databases. To develop the preliminary foundation for a theory of interestingness within the context of ranking summaries generated from databases. Knowledge Discovery and Measures of Interest is suitable as a secondary text in a graduate level course and as a reference for researchers and practitioners in industry.





Book Description




Logic Programming


Book Description

The themes of the 1997 conference are new theoretical and practical accomplishments in logic programming, new research directions where ideas originating from logic programming can play a fundamental role, and relations between logic programming and other fields of computer science. The annual International Logic Programming Symposium, traditionally held in North America, is one of the main international conferences sponsored by the Association of Logic Programming. The themes of the 1997 conference are new theoretical and practical accomplishments in logic programming, new research directions where ideas originating from logic programming can play a fundamental role, and relations between logic programming and other fields of computer science. Topics include theoretical foundations, constraints, concurrency and parallelism, deductive databases, language design and implementation, nonmonotonic reasoning, and logic programming and the Internet.




Intelligent Databases


Book Description

"This book integrates data management in databases with intelligent data processing and analysis in artificial intelligence. It challenges today's database technology and promotes its evolution"--Provided by publisher.




Methodologies for Knowledge Discovery and Data Mining


Book Description

This book constitutes the refereed proceedings of the Third Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD '99, held in Beijing, China, in April 1999. The 29 revised full papers presented together with 37 short papers were carefully selected from a total of 158 submissions. The book is divided into sections on emerging KDD technology; association rules; feature selection and generation; mining in semi-unstructured data; interestingness, surprisingness, and exceptions; rough sets, fuzzy logic, and neural networks; induction, classification, and clustering; visualization; causal models and graph-based methods; agent-based and distributed data mining; and advanced topics and new methodologies.




Advances in Knowledge Discovery and Data Mining


Book Description

Knowledge discovery and data mining have become areas of growing significance because of the recent increasing demand for KDD techniques, including those used in machine learning, databases, statistics, knowledge acquisition, data visualization, and high performance computing. In view of this, and following the success of the five previous PAKDD conferences, the sixth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2002) aimed to provide a forum for the sharing of original research results, innovative ideas, state-of-the-art developments, and implementation experiences in knowledge discovery and data mining among researchers in academic and industrial organizations. Much work went into preparing a program of high quality. We received 128 submissions. Every paper was reviewed by 3 program committee members, and 32 were selected as regular papers and 20 were selected as short papers, representing a 25% acceptance rate for regular papers. The PAKDD 2002 program was further enhanced by two keynote speeches, delivered by Vipin Kumar from the Univ. of Minnesota and Rajeev Rastogi from AT&T. In addition, PAKDD 2002 was complemented by three tutorials, XML and data mining (by Kyuseok Shim and Surajit Chadhuri), mining customer data across various customer touchpoints at- commerce sites (by Jaideep Srivastava), and data clustering analysis, from simple groupings to scalable clustering with constraints (by Osmar Zaiane and Andrew Foss).




Pattern Detection and Discovery


Book Description

The collation of large electronic databases of scienti?c and commercial infor- tion has led to a dramatic growth of interest in methods for discovering struc- res in such databases. These methods often go under the general name of data mining. One important subdiscipline within data mining is concerned with the identi?cation and detection of anomalous, interesting, unusual, or valuable - cords or groups of records, which we call patterns. Familiar examples are the detection of fraud in credit-card transactions, of particular coincident purchases in supermarket transactions, of important nucleotide sequences in gene sequence analysis, and of characteristic traces in EEG records. Tools for the detection of such patterns have been developed within the data mining community, but also within other research communities, typically without an awareness that the - sic problem was common to many disciplines. This is not unreasonable: each of these disciplines has a large literature of its own, and a literature which is growing rapidly. Keeping up with any one of these is di?cult enough, let alone keeping up with others as well, which may in any case be couched in an - familiar technical language. But, of course, this means that opportunities are being lost, discoveries relating to the common problem made in one area are not transferred to the other area, and breakthroughs and problem solutions are being rediscovered, or not discovered for a long time, meaning that e?ort is being wasted and opportunities may be lost.




Database and Expert Systems Applications


Book Description

th DEXA 2001, the 12 International Conference on Database and Expert Systems Applications was held on September 3–5, 2001, at the Technical University of Munich, Germany. The rapidly growing spectrum of database applications has led to the establishment of more specialized discussion platforms (DaWaK conference, EC Web conference, and DEXA workshop), which were all held in parallel with the DEXA conference in Munich. In your hands are the results of much effort, beginning with the preparation of the submitted papers. The papers then passed through the reviewing process, and the accepted papers were revised to final versions by their authors and arranged with the conference program. All this culminated in the conference itself. A total of 175 papers were submitted to this conference, and I would like to thank all the authors. They are the real base of the conference. The program committee and the supporting reviewers produced altogether 497 referee reports, on average of 2.84 reports per paper, and selected 93 papers for presentation. Comparing the weight or more precisely the number of papers devoted to particular topics at several recent DEXA conferences, an increase can be recognized in the areas of XMS databases, active databases, and multi and hypermedia efforts. The space devoted to the more classical topics such as information retrieval, distribution and Web aspects, and transaction, indexing and query aspects has remained more or less unchanged. Some decrease is visible for object orientation.




Advances in Knowledge Discovery and Data Mining


Book Description

The 7th Paci?c Asia Conference on Knowledge Discovery and Data Mining (PAKDD) was held from April 30 to May 2, 2003 in the Convention and Ex- bition Center (COEX), Seoul, Korea. The PAKDD conference is a major forum for academic researchers and industry practitioners in the Paci?c Asia region to share original research results and development experiences from di?erent KDD-related areas such as data mining, data warehousing, machine learning, databases, statistics, knowledge acquisition and discovery, data visualization, and knowledge-based systems. The conference was organized by the Advanced Information Technology Research Center (AITrc) at KAIST and the Statistical Research Center for Complex Systems (SRCCS) at Seoul National University. It was sponsored by the Korean Datamining Society (KDMS), the Korea Inf- mation Science Society (KISS), the United States Air Force O?ce of Scienti?c Research, the Asian O?ce of Aerospace Research & Development, and KAIST. It was held with cooperation from ACM’s Special Group on Knowledge Dis- very and Data Mining (SIGKDD).




Temporal, Spatial, and Spatio-Temporal Data Mining


Book Description

This volume contains updated versions of the ten papers presented at the First International Workshop on Temporal, Spatial and Spatio-Temporal Data Mining (TSDM 2000) held in conjunction with the 4th European Conference on Prin- ples and Practice of Knowledge Discovery in Databases (PKDD 2000) in Lyons, France in September, 2000. The aim of the workshop was to bring together experts in the analysis of temporal and spatial data mining and knowledge discovery in temporal, spatial or spatio-temporal database systems as well as knowledge engineers and domain experts from allied disciplines. The workshop focused on research and practice of knowledge discovery from datasets containing explicit or implicit temporal, spatial or spatio-temporal information. The ten original papers in this volume represent those accepted by peer review following an international call for papers. All papers submitted were refereed by an international team of data mining researchers listed below. We would like to thank the team for their expert and useful help with this process. Following the workshop, authors were invited to amend their papers to enable the feedback received from the conference to be included in the ?nal papers appearing in this volume. A workshop report was compiled by Kathleen Hornsby which also discusses the panel session that was held.