Selected Contributions in Data Analysis and Classification


Book Description

This volume presents recent methodological developments in data analysis and classification. It covers a wide range of topics, including methods for classification and clustering, dissimilarity analysis, consensus methods, conceptual analysis of data, and data mining and knowledge discovery in databases. The book also presents a wide variety of applications, in fields such as biology, micro-array analysis, cyber traffic, and bank fraud detection.




Classification and Data Analysis


Book Description

This volume gathers peer-reviewed contributions on data analysis, classification and related areas presented at the 28th Conference of the Section on Classification and Data Analysis of the Polish Statistical Association, SKAD 2019, held in Szczecin, Poland, on September 18–20, 2019. Providing a balance between theoretical and methodological contributions and empirical papers, it covers a broad variety of topics, ranging from multivariate data analysis, classification and regression, symbolic (and other) data analysis, visualization, data mining, and computer methods to composite measures, and numerous applications of data analysis methods in economics, finance and other social sciences. The book is intended for a wide audience, including researchers at universities and research institutions, graduate and doctoral students, practitioners, data scientists and employees in public statistical institutions.




Data Analysis, Classification, and Related Methods


Book Description

This volume contains a selection of papers presented at the Seven~h Confer ence of the International Federation of Classification Societies (IFCS-2000), which was held in Namur, Belgium, July 11-14,2000. From the originally sub mitted papers, a careful review process involving two reviewers per paper, led to the selection of 65 papers that were considered suitable for publication in this book. The present book contains original research contributions, innovative ap plications and overview papers in various fields within data analysis, classifi cation, and related methods. Given the fast publication process, the research results are still up-to-date and coincide with their actual presentation at the IFCS-2000 conference. The topics captured are: • Cluster analysis • Comparison of clusterings • Fuzzy clustering • Discriminant analysis • Mixture models • Analysis of relationships data • Symbolic data analysis • Regression trees • Data mining and neural networks • Pattern recognition • Multivariate data analysis • Robust data analysis • Data science and sampling The IFCS (International Federation of Classification Societies) The IFCS promotes the dissemination of technical and scientific information data analysis, classification, related methods, and their applica concerning tions.




Data Analysis


Book Description

The first part of this book is devoted to methods seeking relevant dimensions of data. The variables thus obtained provide a synthetic description which often results in a graphical representation of the data. After a general presentation of the discriminating analysis, the second part is devoted to clustering methods which constitute another method, often complementary to the methods described in the first part, to synthesize and to analyze the data. The book concludes by examining the links existing between data mining and data analysis.




Clustering Methodology for Symbolic Data


Book Description

Covers everything readers need to know about clustering methodology for symbolic data—including new methods and headings—while providing a focus on multi-valued list data, interval data and histogram data This book presents all of the latest developments in the field of clustering methodology for symbolic data—paying special attention to the classification methodology for multi-valued list, interval-valued and histogram-valued data methodology, along with numerous worked examples. The book also offers an expansive discussion of data management techniques showing how to manage the large complex dataset into more manageable datasets ready for analyses. Filled with examples, tables, figures, and case studies, Clustering Methodology for Symbolic Data begins by offering chapters on data management, distance measures, general clustering techniques, partitioning, divisive clustering, and agglomerative and pyramid clustering. Provides new classification methodologies for histogram valued data reaching across many fields in data science Demonstrates how to manage a large complex dataset into manageable datasets ready for analysis Features very large contemporary datasets such as multi-valued list data, interval-valued data, and histogram-valued data Considers classification models by dynamical clustering Features a supporting website hosting relevant data sets Clustering Methodology for Symbolic Data will appeal to practitioners of symbolic data analysis, such as statisticians and economists within the public sectors. It will also be of interest to postgraduate students of, and researchers within, web mining, text mining and bioengineering.




Classification, Data Analysis, and Knowledge Organization


Book Description

In science, industry, public administration and documentation centers large amounts of data and information are collected which must be analyzed, ordered, visualized, classified and stored efficiently in order to be useful for practical applications. This volume contains 50 selected theoretical and applied papers presenting a wealth of new and innovative ideas, methods, models and systems which can be used for this purpose. It combines papers and strategies from two main streams of research in an interdisciplinary, dynamic and exciting way: On the one hand, mathematical and statistical methods are described which allow a quantitative analysis of data, provide strategies for classifying objects or making exploratory searches for interesting structures, and give ways to make comprehensive graphical displays of large arrays of data. On the other hand, papers related to information sciences, informatics and data bank systems provide powerful tools for representing, modelling, storing and retrieving facts, data and knowledge characterized by qualitative descriptors, semantic relations, or linguistic concepts. The integration of both fields and a special part on applied problems from biology, medicine, archeology, industry and administration assure that this volume will be informative and useful for theory and practice.




Foundations and Methods in Combinatorial and Statistical Data Analysis and Clustering


Book Description

This book offers an original and broad exploration of the fundamental methods in Clustering and Combinatorial Data Analysis, presenting new formulations and ideas within this very active field. With extensive introductions, formal and mathematical developments and real case studies, this book provides readers with a deeper understanding of the mutual relationships between these methods, which are clearly expressed with respect to three facets: logical, combinatorial and statistical. Using relational mathematical representation, all types of data structures can be handled in precise and unified ways which the author highlights in three stages: Clustering a set of descriptive attributes Clustering a set of objects or a set of object categories Establishing correspondence between these two dual clusterings Tools for interpreting the reasons of a given cluster or clustering are also included. Foundations and Methods in Combinatorial and Statistical Data Analysis and Clustering will be a valuable resource for students and researchers who are interested in the areas of Data Analysis, Clustering, Data Mining and Knowledge Discovery.




Statistical Models for Data Analysis


Book Description

The papers in this book cover issues related to the development of novel statistical models for the analysis of data. They offer solutions for relevant problems in statistical data analysis and contain the explicit derivation of the proposed models as well as their implementation. The book assembles the selected and refereed proceedings of the biannual conference of the Italian Classification and Data Analysis Group (CLADAG), a section of the Italian Statistical Society. ​




Analysis of Distributional Data


Book Description

In a time when increasingly larger and complex data collections are being produced, it is clear that new and adaptive forms of data representation and analysis have to be conceived and implemented. Distributional data, i.e., data where a distribution rather than a single value is recorded for each descriptor, on each unit, come into this framework. Distributional data may result from the aggregation of large amounts of open/collected/generated data, or it may be directly available in a structured or unstructured form, describing the variability of some features. This book provides models and methods for the representation, analysis, interpretation, and organization of distributional data, taking into account its specific nature, and not relying on a reduction to single values, to be conform to classical paradigms. Conceived as an edited book, gathering contributions from multiple authors, the book presents alternative representations and analysis’ methods for distributional data of different types, and in particular, -Uni- and bi-variate descriptive statistics for distributional data -Clustering and classification methodologies -Methods for the representation in low-dimensional spaces -Regression models and forecasting approaches for distribution-valued variables Furthermore, the different chapters -Feature applications to show how the proposed methods work in practice, and how results are to be interpreted, -Often provide information about available software. The methodologies presented in this book constitute cutting-edge developments for stakeholders from all domains who produce and analyse large amounts of complex data, to be analysed in the form of distributions. The book is hence of interest for companies operating not only in the area of data analytics, but also on logistics, energy and finance. It also concerns national statistical institutes and other institutions at European and international level, where microdata is aggregated to preserve confidentiality and allow for analysis at the appropriate regional level. Academics will find in the analysis of distributional data a challenging up-to-date field of research.




Selected Contributions in Data Analysis and Classification


Book Description

This volume presents recent methodological developments in data analysis and classification. It covers a wide range of topics, including methods for classification and clustering, dissimilarity analysis, consensus methods, conceptual analysis of data, and data mining and knowledge discovery in databases. The book also presents a wide variety of applications, in fields such as biology, micro-array analysis, cyber traffic, and bank fraud detection.