Book Description
The collation of large electronic databases of scienti?c and commercial infor- tion has led to a dramatic growth of interest in methods for discovering struc- res in such databases. These methods often go under the general name of data mining. One important subdiscipline within data mining is concerned with the identi?cation and detection of anomalous, interesting, unusual, or valuable - cords or groups of records, which we call patterns. Familiar examples are the detection of fraud in credit-card transactions, of particular coincident purchases in supermarket transactions, of important nucleotide sequences in gene sequence analysis, and of characteristic traces in EEG records. Tools for the detection of such patterns have been developed within the data mining community, but also within other research communities, typically without an awareness that the - sic problem was common to many disciplines. This is not unreasonable: each of these disciplines has a large literature of its own, and a literature which is growing rapidly. Keeping up with any one of these is di?cult enough, let alone keeping up with others as well, which may in any case be couched in an - familiar technical language. But, of course, this means that opportunities are being lost, discoveries relating to the common problem made in one area are not transferred to the other area, and breakthroughs and problem solutions are being rediscovered, or not discovered for a long time, meaning that e?ort is being wasted and opportunities may be lost.