Bridging the Semantic Gap in Image and Video Analysis


Book Description

This book presents cutting-edge research on various ways to bridge the semantic gap in image and video analysis. The respective chapters address different stages of image processing, revealing that the first step is a future extraction, the second is a segmentation process, the third is object recognition, and the fourth and last involve the semantic interpretation of the image. The semantic gap is a challenging area of research, and describes the difference between low-level features extracted from the image and the high-level semantic meanings that people can derive from the image. The result greatly depends on lower level vision techniques, such as feature selection, segmentation, object recognition, and so on. The use of deep models has freed humans from manually selecting and extracting the set of features. Deep learning does this automatically, developing more abstract features at the successive levels. The book offers a valuable resource for researchers, practitioners, students and professors in Computer Engineering, Computer Science and related fields whose work involves images, video analysis, image interpretation and so on.




Distributed Multimedia Databases


Book Description

In the last few years we have observed an explosive growth of multimedia computing, communication and applications. This revolution is transforming the way people live, work, and interact with each other, and is impacting the way business, government services, education, entertainment and healthcare are operating. Yet, several issues related to modeling, specification, analysis and design of distributed multimedia database systems and multimedia information retrieval are still challenging to both researchers and praclitioners. Distributed Multimedia Databases: Techniques and Applications points out these challenges and provides valuable suggestions toward the necessary solutions, by focusing on multimedia database techniques.




Concept-Based Video Retrieval


Book Description

In this paper, we review 300 references on video retrieval, indicating when text-only solutions are unsatisfactory and showing the promising alternatives which are in majority concept-based. Therefore, central to our discussion is the notion of a semantic concept: an objective linguistic description of an observable entity. Specifically, we present our view on how its automated detection, selection under uncertainty, and interactive usage might solve the major scientific problem for video retrieval: the semantic gap. To bridge the gap, we lay down the anatomy of a concept-based video search engine. We present a component-wise decomposition of such an interdisciplinary multimedia system, covering influences from information retrieval, computer vision, machine learning, and human-computer interaction. For each of the components we review state-of-the-art solutions in the literature, each having different characteristics and merits. Because of these differences, we cannot understand the progress in video retrieval without serious evaluation efforts such as carried out in the NIST TRECVID benchmark. We discuss its data, tasks, results, and the many derived community initiatives in creating annotations and baselines for repeatable experiments. We conclude with our perspective on future challenges and opportunities.




KI 2008: Advances in Artificial Intelligence


Book Description

KI 2008 was the 31st Annual German Conference on Arti?cial Intelligence held September 23–26 at the University of Kaiserslautern and the German Research Center for Arti?cial Intelligence DFKI GmbH in Kaiserslautern, Germany. The conference series started in 1975 with the German Workshop on AI (GWAI), which took place in Bonn, and represents the ?rst forum of its type for the German AI Community. Over the years AI has become a major ?eld in c- puter scienceinGermanyinvolvinga numberof successfulprojects thatreceived much international attention. Today KI conferences are international forums where participants from academia and industry from all over the world meet to exchange their recent research results and to discuss trends in the ?eld. Since 1993 the meeting has been called the “Annual German Conference on Arti?cial Intelligence,” designated by the German acronym KI. This volume contains the papers selected out of 77 submissions, including a number of submissions from outside German-speaking countries. In total, 15 submissions (19%) were accepted for oral and 30 (39%) for poster presentation. Oralpresentationsattheconferenceweresingletrack. Becauseofthis,thechoice of presentation form (oral, poster) was based on how well reviews indicated that the paper would ?t into one or the other format. The proceedings allocate the same space to both types of papers. In addition, we selected six papers that show high application potential - scribing systems or prototypical implementations of innovative AI technologies. They are also included in this volume as two-page extended abstracts.




Soft Computing Approach to Pattern Recognition and Image Processing


Book Description

This volume provides a collection of sixteen articles containing review and new material. In a unified way, they describe the recent development of theories and methodologies in pattern recognition, image processing and vision using fuzzy logic, artificial neural networks, genetic algorithms, rough sets and wavelets with significant real life applications. The book details the theory of granular computing and the role of a rough-neuro approach as a way of computing with words and designing intelligent recognition systems. It also demonstrates applications of the soft computing paradigm to case based reasoning, data mining and bio-informatics with a scope for future research. The contributors from around the world present a balanced mixture of current theory, algorithms and applications, making the book an extremely useful resource for students and researchers alike. Contents: Pattern Recognition: Multiple Classifier Systems; Building Decision Trees from the Fourier Spectrum of a Tree Ensemble; Clustering Large Data Sets; Multi-objective Variable String Genetic Classifier: Application to Remote Sensing Imagery; Image Processing and Vision: Dissimilarity Measures Between Fuzzy Sets or Fuzzy Structures; Early Vision: Concepts and Algorithms; Self-organizing Neural Network for Multi-level Image Segmentation; Geometric Transformation by Moment Method with Wavelet Matrix; New Computationally Efficient Algorithms for Video Coding; Soft Computing for Computational Media Aesthetics: Analyzing Video Content for Meaning; Granular Computing and Case Based Reasoning: Towards Granular Multi-agent Systems; Granular Computing and Pattern Recognition; Case Base Maintenance: A Soft Computing Perspective; Real Life Applications: Autoassociative Neural Network Models for Pattern Recognition Tasks in Speech and Image; Protein Structure Prediction Using Soft Computing; Pattern Classification for Biological Data Mining. Readership: Upper level undergraduates, graduates, researchers, academics and industrialists.




Semantic Multimedia


Book Description

This book constitutes the refereed proceedings of the Second International Conference on Semantics and Digital Media Technologies, SAMT 2007, held in Genoa, Italy, in December 2007. The conference brings together forums, projects, institutions and individuals investigating the integration of knowledge, semantics and low-level multimedia processing, including new emerging media and application areas. The papers are organized in topical sections.




Multimedia Information Retrieval


Book Description

Novel processing and searching tools for the management of new multimedia documents have developed. Multimedia Information Retrieval (MIR) is an organic system made up of Text Retrieval (TR); Visual Retrieval (VR); Video Retrieval (VDR); and Audio Retrieval (AR) systems. So that each type of digital document may be analysed and searched by the elements of language appropriate to its nature, search criteria must be extended. Such an approach is known as the Content Based Information Retrieval (CBIR), and is the core of MIR. This novel content-based concept of information handling needs to be integrated with more traditional semantics. Multimedia Information Retrieval focuses on the tools of processing and searching applicable to the content-based management of new multimedia documents. Translated from Italian by Giles Smith, the book is divided into two parts. Part one discusses MIR and related theories, and puts forward new methodologies; part two reviews various experimental and operating MIR systems, and presents technical and practical conclusions. - Gives a complete, organic picture of MIR and CBIR - Proposes a novel conceptualisation around the ideas of Information Retrieval (IR) and digital document management in the context of Library and Information Science (LIS) - Relevant for both library and information science and information technology specialists




Video Segmentation and Its Applications


Book Description

Video segmentation has become one of the core areas in visual signal processing research. The objective of Video Segmentation and Its Applications is to present the latest advances in video segmentation and analysis techniques while covering the theoretical approaches, real applications and methods being developed in the computer vision and video analysis community. The book will also provide researchers and practitioners a comprehensive understanding of state-of-the-art of video segmentation techniques and a resource for potential applications and successful practice.




Bridging the Semantic Gap


Book Description




Unlocking Artificial Intelligence


Book Description

This open access book provides a state-of-the-art overview of current machine learning research and its exploitation in various application areas. It has become apparent that the deep integration of artificial intelligence (AI) methods in products and services is essential for companies to stay competitive. The use of AI allows large volumes of data to be analyzed, patterns and trends to be identified, and well-founded decisions to be made on an informative basis. It also enables the optimization of workflows, the automation of processes and the development of new services, thus creating potential for new business models and significant competitive advantages. The book is divided in two main parts: First, in a theoretically oriented part, various AI/ML-related approaches like automated machine learning, sequence-based learning, deep learning, learning from experience and data, and process-aware learning are explained. In a second part, various applications are presented that benefit from the exploitation of recent research results. These include autonomous systems, indoor localization, medical applications, energy supply and networks, logistics networks, traffic control, image processing, and IoT applications. Overall, the book offers professionals and applied researchers an excellent overview of current exploitations, approaches, and challenges of AI/ML-related research.