Spectral Feature Selection for Data Mining


Book Description

Spectral Feature Selection for Data Mining introduces a novel feature selection technique that establishes a general platform for studying existing feature selection algorithms and developing new algorithms for emerging problems in real-world applications. This technique represents a unified framework for supervised, unsupervised, and semisupervise




Fundamentals, Sensor Systems, Spectral Libraries, and Data Mining for Vegetation


Book Description

Written by leading global experts, including pioneers in the field, the four-volume set on Hyperspectral Remote Sensing of Vegetation, Second Edition, reviews existing state-of-the-art knowledge, highlights advances made in different areas, and provides guidance for the appropriate use of hyperspectral data in the study and management of agricultural crops and natural vegetation. Volume I, Fundamentals, Sensor Systems, Spectral Libraries, and Data Mining for Vegetation introduces the fundamentals of hyperspectral or imaging spectroscopy data, including hyperspectral data processes, sensor systems, spectral libraries, and data mining and analysis, covering both the strengths and limitations of these topics. This book also presents and discusses hyperspectral narrowband data acquired in numerous unique spectral bands in the entire length of the spectrum from various ground-based, airborne, and spaceborne platforms. The concluding chapter provides readers with useful guidance on the highlights and essence of Volume I through the editors’ perspective. Key Features of Volume I: Provides the fundamentals of hyperspectral remote sensing used in agricultural crops and vegetation studies. Discusses the latest advances in hyperspectral remote sensing of ecosystems and croplands. Develops online hyperspectral libraries, proximal sensing and phenotyping for understanding, modeling, mapping, and monitoring crop and vegetation traits. Implements reflectance spectroscopy of soils and vegetation. Enumerates hyperspectral data mining and data processing methods, approaches, and machine learning algorithms. Explores methods and approaches for data mining and overcoming data redundancy; Highlights the advanced methods for hyperspectral data processing steps by developing or implementing appropriate algorithms and coding the same for processing on a cloud computing platform like the Google Earth Engine. Integrates hyperspectral with other data, such as the LiDAR data, in the study of vegetation. Includes best global expertise on hyperspectral remote sensing of agriculture, crop water use, plant species detection, crop productivity and water productivity mapping, and modeling.




Computational Methods of Feature Selection


Book Description

Due to increasing demands for dimensionality reduction, research on feature selection has deeply and widely expanded into many fields, including computational statistics, pattern recognition, machine learning, data mining, and knowledge discovery. Highlighting current research issues, Computational Methods of Feature Selection introduces the




Feature Engineering for Machine Learning and Data Analytics


Book Description

Feature engineering plays a vital role in big data analytics. Machine learning and data mining algorithms cannot work without data. Little can be achieved if there are few features to represent the underlying data objects, and the quality of results of those algorithms largely depends on the quality of the available features. Feature Engineering for Machine Learning and Data Analytics provides a comprehensive introduction to feature engineering, including feature generation, feature extraction, feature transformation, feature selection, and feature analysis and evaluation. The book presents key concepts, methods, examples, and applications, as well as chapters on feature engineering for major data types such as texts, images, sequences, time series, graphs, streaming data, software engineering data, Twitter data, and social media data. It also contains generic feature generation approaches, as well as methods for generating tried-and-tested, hand-crafted, domain-specific features. The first chapter defines the concepts of features and feature engineering, offers an overview of the book, and provides pointers to topics not covered in this book. The next six chapters are devoted to feature engineering, including feature generation for specific data types. The subsequent four chapters cover generic approaches for feature engineering, namely feature selection, feature transformation based feature engineering, deep learning based feature engineering, and pattern based feature generation and engineering. The last three chapters discuss feature engineering for social bot detection, software management, and Twitter-based applications respectively. This book can be used as a reference for data analysts, big data scientists, data preprocessing workers, project managers, project developers, prediction modelers, professors, researchers, graduate students, and upper level undergraduate students. It can also be used as the primary text for courses on feature engineering, or as a supplement for courses on machine learning, data mining, and big data analytics.




Feature Selection for High-Dimensional Data


Book Description

This book offers a coherent and comprehensive approach to feature subset selection in the scope of classification problems, explaining the foundations, real application problems and the challenges of feature selection for high-dimensional data. The authors first focus on the analysis and synthesis of feature selection algorithms, presenting a comprehensive review of basic concepts and experimental results of the most well-known algorithms. They then address different real scenarios with high-dimensional data, showing the use of feature selection algorithms in different contexts with different requirements and information: microarray data, intrusion detection, tear film lipid layer classification and cost-based features. The book then delves into the scenario of big dimension, paying attention to important problems under high-dimensional spaces, such as scalability, distributed processing and real-time processing, scenarios that open up new and interesting challenges for researchers. The book is useful for practitioners, researchers and graduate students in the areas of machine learning and data mining.




Pattern Recognition


Book Description

This book constitutes the proceedings of the 11th Mexican Conference on Pattern Recognition, MCPR 2019, held in Querétaro, Mexico, in June 2019. The 40 papers presented in this volume were carefully reviewed and selected from 86 submissions. They were organized in topical sections named: artificial intelligence techniques and recognition; computer vision; industrial and medical applications of pattern recognition; image processing and analysis; pattern recognition techniques; signal processing and analysis; natural language, and processing and recognition.




Advances in Knowledge Discovery and Data Mining


Book Description

The two-volume set LNAI 8443 + LNAI 8444 constitutes the refereed proceedings of the 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2014, held in Tainan, Taiwan, in May 2014. The 40 full papers and the 60 short papers presented within these proceedings were carefully reviewed and selected from 371 submissions. They cover the general fields of pattern mining; social network and social media; classification; graph and network mining; applications; privacy preserving; recommendation; feature selection and reduction; machine learning; temporal and spatial data; novel algorithms; clustering; biomedical data mining; stream mining; outlier and anomaly detection; multi-sources mining; and unstructured data and text mining.




Computational Complexity


Book Description

Complex systems are systems that comprise many interacting parts with the ability to generate a new quality of collective behavior through self-organization, e.g. the spontaneous formation of temporal, spatial or functional structures. These systems are often characterized by extreme sensitivity to initial conditions as well as emergent behavior that are not readily predictable or even completely deterministic. The recognition that the collective behavior of the whole system cannot be simply inferred from an understanding of the behavior of the individual components has led to the development of numerous sophisticated new computational and modeling tools with applications to a wide range of scientific, engineering, and societal phenomena. Computational Complexity: Theory, Techniques and Applications presents a detailed and integrated view of the theoretical basis, computational methods, and state-of-the-art approaches to investigating and modeling of inherently difficult problems whose solution requires extensive resources approaching the practical limits of present-day computer systems. This comprehensive and authoritative reference examines key components of computational complexity, including cellular automata, graph theory, data mining, granular computing, soft computing, wavelets, and more.




Data Clustering


Book Description

Research on the problem of clustering tends to be fragmented across the pattern recognition, database, data mining, and machine learning communities. Addressing this problem in a unified way, Data Clustering: Algorithms and Applications provides complete coverage of the entire area of clustering, from basic methods to more refined and complex data clustering approaches. It pays special attention to recent issues in graphs, social networks, and other domains. The book focuses on three primary aspects of data clustering: Methods, describing key techniques commonly used for clustering, such as feature selection, agglomerative clustering, partitional clustering, density-based clustering, probabilistic clustering, grid-based clustering, spectral clustering, and nonnegative matrix factorization Domains, covering methods used for different domains of data, such as categorical data, text data, multimedia data, graph data, biological data, stream data, uncertain data, time series clustering, high-dimensional clustering, and big data Variations and Insights, discussing important variations of the clustering process, such as semisupervised clustering, interactive clustering, multiview clustering, cluster ensembles, and cluster validation In this book, top researchers from around the world explore the characteristics of clustering problems in a variety of application areas. They also explain how to glean detailed insight from the clustering process—including how to verify the quality of the underlying clusters—through supervision, human intervention, or the automated generation of alternative clusters.




Pattern Recognition


Book Description

This book constitutes the proceedings of the 13th Mexican Conference on Pattern Recognition, MCPR 2021, which was planned to be held in Mexico City, Mexico, in June 2021. The conference was instead held virtually. The 35 papers presented in this volume were carefully reviewed and selected from 75 submissions. They are organized in the following topical sections: artificial intelligence techniques and recognition; pattern recognition techniques; neural networks and deep learning; computer vision; image processing and analysis; and medical applications of pattern recognition.