Scientific Data Mining


Book Description

Chandrika Kamath describes how techniques from the multi-disciplinary field of data mining can be used to address the modern problem of data overload in science and engineering domains. Starting with a survey of analysis problems in different applications, it identifies the common themes across these domains.




Next Generation of Data-Mining Applications


Book Description

Discover the next generation of data-mining tools and technology This book brings together an international team of eighty experts to present readers with the next generation of data-mining applications. Unlike other publications that take a strictly academic and theoretical approach, this book features authors who have successfully developed data-mining solutions for a variety of customer types. Presenting their state-of-the-art methodologies and techniques, the authors show readers how they can analyze enormous quantities of data and make new discoveries by connecting key pieces of data that may be spread across several different databases and file servers. The latest data-mining techniques that will revolutionize research across a wide variety of fields including business, science, healthcare, and industry are all presented. Organized by application, the twenty-five chapters cover applications in: Industry and business Science and engineering Bioinformatics and biotechnology Medicine and pharmaceuticals Web and text-mining Security New trends in data-mining technology And much more . . . Readers from a variety of disciplines will learn how the next generation of data-mining applications can radically enhance their ability to analyze data and open the doors to new opportunities. Readers will discover: New data-mining tools to automate the evaluation and qualification of sales opportunities The latest tools needed for gene mapping and proteomic data analysis Sophisticated techniques that can be engaged in crime fighting and prevention With its coverage of the most advanced applications, Next Generation of Data-Mining Applications is essential reading for all researchers working in data mining or who are tasked with making sense of an ever-growing quantity of data. The publication also serves as an excellent textbook for upper-level undergraduate and graduate courses in computer science, information management, and statistics.




Machine Learning and Knowledge Discovery in Databases: Applied Data Science Track


Book Description

The 5-volume proceedings, LNAI 12457 until 12461 constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2020, which was held during September 14-18, 2020. The conference was planned to take place in Ghent, Belgium, but had to change to an online format due to the COVID-19 pandemic. The 232 full papers and 10 demo papers presented in this volume were carefully reviewed and selected for inclusion in the proceedings. The volumes are organized in topical sections as follows: Part I: Pattern Mining; clustering; privacy and fairness; (social) network analysis and computational social science; dimensionality reduction and autoencoders; domain adaptation; sketching, sampling, and binary projections; graphical models and causality; (spatio-) temporal data and recurrent neural networks; collaborative filtering and matrix completion. Part II: deep learning optimization and theory; active learning; adversarial learning; federated learning; Kernel methods and online learning; partial label learning; reinforcement learning; transfer and multi-task learning; Bayesian optimization and few-shot learning. Part III: Combinatorial optimization; large-scale optimization and differential privacy; boosting and ensemble methods; Bayesian methods; architecture of neural networks; graph neural networks; Gaussian processes; computer vision and image processing; natural language processing; bioinformatics. Part IV: applied data science: recommendation; applied data science: anomaly detection; applied data science: Web mining; applied data science: transportation; applied data science: activity recognition; applied data science: hardware and manufacturing; applied data science: spatiotemporal data. Part V: applied data science: social good; applied data science: healthcare; applied data science: e-commerce and finance; applied data science: computational social science; applied data science: sports; demo track.




Interpretable Machine Learning


Book Description

This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simple, interpretable models such as decision trees, decision rules and linear regression. Later chapters focus on general model-agnostic methods for interpreting black box models like feature importance and accumulated local effects and explaining individual predictions with Shapley values and LIME. All interpretation methods are explained in depth and discussed critically. How do they work under the hood? What are their strengths and weaknesses? How can their outputs be interpreted? This book will enable you to select and correctly apply the interpretation method that is most suitable for your machine learning project.




Encyclopedia of Computer Science and Technology


Book Description

With breadth and depth of coverage, the Encyclopedia of Computer Science and Technology, Second Edition has a multi-disciplinary scope, drawing together comprehensive coverage of the inter-related aspects of computer science and technology. The topics covered in this encyclopedia include: General and reference Hardware Computer systems organization Networks Software and its engineering Theory of computation Mathematics of computing Information systems Security and privacy Human-centered computing Computing methodologies Applied computing Professional issues Leading figures in the history of computer science The encyclopedia is structured according to the ACM Computing Classification System (CCS), first published in 1988 but subsequently revised in 2012. This classification system is the most comprehensive and is considered the de facto ontological framework for the computing field. The encyclopedia brings together the information and historical context that students, practicing professionals, researchers, and academicians need to have a strong and solid foundation in all aspects of computer science and technology.




3D Surface Reconstruction


Book Description

3D Surface Reconstruction: Multi-Scale Hierarchical Approaches presents methods to model 3D objects in an incremental way so as to capture more finer details at each step. The configuration of the model parameters, the rationale and solutions are described and discussed in detail so the reader has a strong understanding of the methodology. Modeling starts from data captured by 3D digitizers and makes the process even more clear and engaging. Innovative approaches, based on two popular machine learning paradigms, namely Radial Basis Functions and the Support Vector Machines, are also introduced. These paradigms are innovatively extended to a multi-scale incremental structure, based on a hierarchical scheme. The resulting approaches allow readers to achieve high accuracy with limited computational complexity, and makes the approaches appropriate for online, real-time operation. Applications can be found in any domain in which regression is required. 3D Surface Reconstruction: Multi-Scale Hierarchical Approaches is designed as a secondary text book or reference for advanced-level students and researchers in computer science. This book also targets practitioners working in computer vision or machine learning related fields.







Explainable AI: Interpreting, Explaining and Visualizing Deep Learning


Book Description

The development of “intelligent” systems that can take decisions and perform autonomously might lead to faster and more consistent decisions. A limiting factor for a broader adoption of AI technology is the inherent risks that come with giving up human control and oversight to “intelligent” machines. For sensitive tasks involving critical infrastructures and affecting human well-being or health, it is crucial to limit the possibility of improper, non-robust and unsafe decisions and actions. Before deploying an AI system, we see a strong need to validate its behavior, and thus establish guarantees that it will continue to perform as expected when deployed in a real-world environment. In pursuit of that objective, ways for humans to verify the agreement between the AI decision structure and their own ground-truth knowledge have been explored. Explainable AI (XAI) has developed as a subfield of AI, focused on exposing complex AI models to humans in a systematic and interpretable manner. The 22 chapters included in this book provide a timely snapshot of algorithms, theory, and applications of interpretable and explainable AI and AI techniques that have been proposed recently reflecting the current discourse in this field and providing directions of future development. The book is organized in six parts: towards AI transparency; methods for interpreting AI systems; explaining the decisions of AI systems; evaluating interpretability and explanations; applications of explainable AI; and software for explainable AI.




Advanced Research and Trends in New Technologies, Software, Human-Computer Interaction, and Communicability


Book Description

"This book presents scientific, theoretical, and practical insight on the software and technology of social networks and the factors that boost communicability, highlighting different disciplines in the computer and social sciences fields"--Provided by publisher.




Social Science Research


Book Description

This book is designed to introduce doctoral and graduate students to the process of conducting scientific research in the social sciences, business, education, public health, and related disciplines. It is a one-stop, comprehensive, and compact source for foundational concepts in behavioral research, and can serve as a stand-alone text or as a supplement to research readings in any doctoral seminar or research methods class. This book is currently used as a research text at universities on six continents and will shortly be available in nine different languages.