Rule Extraction from Support Vector Machines


Book Description

Support vector machines (SVMs) are one of the most active research areas in machine learning. SVMs have shown good performance in a number of applications, including text and image classification. However, the learning capability of SVMs comes at a cost – an inherent inability to explain in a comprehensible form, the process by which a learning result was reached. Hence, the situation is similar to neural networks, where the apparent lack of an explanation capability has led to various approaches aiming at extracting symbolic rules from neural networks. For SVMs to gain a wider degree of acceptance in fields such as medical diagnosis and security sensitive areas, it is desirable to offer an explanation capability. User explanation is often a legal requirement, because it is necessary to explain how a decision was reached or why it was made. This book provides an overview of the field and introduces a number of different approaches to extracting rules from support vector machines developed by key researchers. In addition, successful applications are outlined and future research opportunities are discussed. The book is an important reference for researchers and graduate students, and since it provides an introduction to the topic, it will be important in the classroom as well. Because of the significance of both SVMs and user explanation, the book is of relevance to data mining practitioners and data analysts.




Rule Extraction from Support Vector Machines


Book Description

Support vector machines (SVMs) are one of the most active research areas in machine learning. SVMs have shown good performance in a number of applications, including text and image classification. However, the learning capability of SVMs comes at a cost – an inherent inability to explain in a comprehensible form, the process by which a learning result was reached. Hence, the situation is similar to neural networks, where the apparent lack of an explanation capability has led to various approaches aiming at extracting symbolic rules from neural networks. For SVMs to gain a wider degree of acceptance in fields such as medical diagnosis and security sensitive areas, it is desirable to offer an explanation capability. User explanation is often a legal requirement, because it is necessary to explain how a decision was reached or why it was made. This book provides an overview of the field and introduces a number of different approaches to extracting rules from support vector machines developed by key researchers. In addition, successful applications are outlined and future research opportunities are discussed. The book is an important reference for researchers and graduate students, and since it provides an introduction to the topic, it will be important in the classroom as well. Because of the significance of both SVMs and user explanation, the book is of relevance to data mining practitioners and data analysts.




Rule Extraction from Support Vector Machine


Book Description

Doctoral Thesis / Dissertation from the year 2010 in the subject Computer Science - Applied, grade: none, , course: Department of Computers and Information Sciences - Ph.D., language: English, abstract: Although Support Vector Machines have been used to develop highly accurate classification and regression models in various real-world problem domains, the most significant barrier is that SVM generates black box model that is difficult to understand. The procedure to convert these opaque models into transparent models is called rule extraction. This thesis investigates the task of extracting comprehensible models from trained SVMs, thereby alleviating this limitation. The primary contribution of the thesis is the proposal of various algorithms to overcome the significant limitations of SVM by taking a novel approach to the task of extracting comprehensible models. The basic contribution of the thesis are systematic review of literature on rule extraction from SVM, identifying gaps in the literature and proposing novel approaches for addressing the gaps. The contributions are grouped under three classes, decompositional, pedagogical and eclectic/hybrid approaches. Decompositional approach is closely intertwined with the internal workings of the SVM. Pedagogical approach uses SVM as an oracle to re-label training examples as well as artificially generated examples. In the eclectic/hybrid approach, a combination of these two methods is adopted. The thesis addresses various problems from the finance domain such as bankruptcy prediction in banks/firms, churn prediction in analytical CRM and Insurance fraud detection. Apart from this various benchmark datasets such as iris, wine and WBC for classification problems and auto MPG, body fat, Boston housing, forest fires and pollution for regression problems are also tested using the proposed appraoch. In addition, rule extraction from unbalanced datasets as well as from active learning based approaches has been explored. For classification problems, various rule extraction methods such as FRBS, DT, ANFIS, CART and NBTree have been utilized. Additionally for regression problems, rule extraction methods such as ANFIS, DENFIS and CART have also been employed. Results are analyzed using accuracy, sensitivity, specificity, fidelity, AUC and t-test measures. Proposed approaches demonstrate their viability in extracting accurate, effective and comprehensible rule sets in various benchmark and real world problem domains across classification and regression problems. Future directions have been indicated to extend the approaches to newer variations of SVM as well as to other problem domains.




Soft Computing for Knowledge Discovery and Data Mining


Book Description

Data Mining is the science and technology of exploring large and complex bodies of data in order to discover useful patterns. It is extremely important because it enables modeling and knowledge extraction from abundant data availability. This book introduces soft computing methods extending the envelope of problems that data mining can solve efficiently. It presents practical soft-computing approaches in data mining and includes various real-world case studies with detailed results.




Support Vector Machines


Book Description

Support Vector Machines: Optimization Based Theory, Algorithms, and Extensions presents an accessible treatment of the two main components of support vector machines (SVMs)-classification problems and regression problems. The book emphasizes the close connection between optimization theory and SVMs since optimization is one of the pillars on which




Rough Sets, Fuzzy Sets, Data Mining and Granular Computing


Book Description

This book constitutes the refereed proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, RSFDGrC 2009, held in Delhi, India in December 2009 in conjunction with the Third International Conference on Pattern Recognition and Machine Intelligence, PReMI 2009. RSFDGrC 2009 is the core component of a broader Rough Set Year in India initiative, RSIndia09. The 56 revised full papers presented together with 6 invited papers and a report on the Rough Set Year in India 2009 project were carefully reviewed and selected from a total of 130 submissions. The papers are organized in topical sections on foundations of rough sets and beyond; rought set algorithms and applications; fuzzy set foundations and applications; data mining and knowledge discovery; clustering and current trends in computing; and information retrieval and text mining.




Support Vector Machines and Evolutionary Algorithms for Classification


Book Description

When discussing classification, support vector machines are known to be a capable and efficient technique to learn and predict with high accuracy within a quick time frame. Yet, their black box means to do so make the practical users quite circumspect about relying on it, without much understanding of the how and why of its predictions. The question raised in this book is how can this ‘masked hero’ be made more comprehensible and friendly to the public: provide a surrogate model for its hidden optimization engine, replace the method completely or appoint a more friendly approach to tag along and offer the much desired explanations? Evolutionary algorithms can do all these and this book presents such possibilities of achieving high accuracy, comprehensibility, reasonable runtime as well as unconstrained performance.




Knowledge Discovery with Support Vector Machines


Book Description

An easy-to-follow introduction to support vector machines This book provides an in-depth, easy-to-follow introduction to support vector machines drawing only from minimal, carefully motivated technical and mathematical background material. It begins with a cohesive discussion of machine learning and goes on to cover: Knowledge discovery environments Describing data mathematically Linear decision surfaces and functions Perceptron learning Maximum margin classifiers Support vector machines Elements of statistical learning theory Multi-class classification Regression with support vector machines Novelty detection Complemented with hands-on exercises, algorithm descriptions, and data sets, Knowledge Discovery with Support Vector Machines is an invaluable textbook for advanced undergraduate and graduate courses. It is also an excellent tutorial on support vector machines for professionals who are pursuing research in machine learning and related areas.




Support Vector Machines


Book Description

Every mathematical discipline goes through three periods of development: the naive, the formal, and the critical. David Hilbert The goal of this book is to explain the principles that made support vector machines (SVMs) a successful modeling and prediction tool for a variety of applications. We try to achieve this by presenting the basic ideas of SVMs together with the latest developments and current research questions in a uni?ed style. In a nutshell, we identify at least three reasons for the success of SVMs: their ability to learn well with only a very small number of free parameters, their robustness against several types of model violations and outliers, and last but not least their computational e?ciency compared with several other methods. Although there are several roots and precursors of SVMs, these methods gained particular momentum during the last 15 years since Vapnik (1995, 1998) published his well-known textbooks on statistical learning theory with aspecialemphasisonsupportvectormachines. Sincethen,the?eldofmachine learninghaswitnessedintenseactivityinthestudyofSVMs,whichhasspread moreandmoretootherdisciplinessuchasstatisticsandmathematics. Thusit seems fair to say that several communities are currently working on support vector machines and on related kernel-based methods. Although there are many interactions between these communities, we think that there is still roomforadditionalfruitfulinteractionandwouldbegladifthistextbookwere found helpful in stimulating further research. Many of the results presented in this book have previously been scattered in the journal literature or are still under review. As a consequence, these results have been accessible only to a relativelysmallnumberofspecialists,sometimesprobablyonlytopeoplefrom one community but not the others.




Computational Optimization, Methods and Algorithms


Book Description

Computational optimization is an important paradigm with a wide range of applications. In virtually all branches of engineering and industry, we almost always try to optimize something - whether to minimize the cost and energy consumption, or to maximize profits, outputs, performance and efficiency. In many cases, this search for optimality is challenging, either because of the high computational cost of evaluating objectives and constraints, or because of the nonlinearity, multimodality, discontinuity and uncertainty of the problem functions in the real-world systems. Another complication is that most problems are often NP-hard, that is, the solution time for finding the optimum increases exponentially with the problem size. The development of efficient algorithms and specialized techniques that address these difficulties is of primary importance for contemporary engineering, science and industry. This book consists of 12 self-contained chapters, contributed from worldwide experts who are working in these exciting areas. The book strives to review and discuss the latest developments concerning optimization and modelling with a focus on methods and algorithms for computational optimization. It also covers well-chosen, real-world applications in science, engineering and industry. Main topics include derivative-free optimization, multi-objective evolutionary algorithms, surrogate-based methods, maximum simulated likelihood estimation, support vector machines, and metaheuristic algorithms. Application case studies include aerodynamic shape optimization, microwave engineering, black-box optimization, classification, economics, inventory optimization and structural optimization. This graduate level book can serve as an excellent reference for lecturers, researchers and students in computational science, engineering and industry.