Novel Motion Anchoring Strategies for Wavelet-based Highly Scalable Video Compression


Book Description

A key element of any modern video codec is the efficient exploitation of temporal redundancy via motion-compensated prediction. In this book, a novel paradigm of representing and employing motion information in a video compression system is described that has several advantages over existing approaches. Traditionally, motion is estimated, modelled, and coded as a vector field at the target frame it predicts. While this “prediction-centric” approach is convenient, the fact that the motion is “attached” to a specific target frame implies that it cannot easily be re-purposed to predict or synthesize other frames, which severely hampers temporal scalability. In light of this, the present book explores the possibility of anchoring motion at reference frames instead. Key to the success of the proposed “reference-based” anchoring schemes is high quality motion inference, which is enabled by the use of a more “physical” motion representation than the traditionally employed “block” motion fields. The resulting compression system can support computationally efficient, high-quality temporal motion inference, which requires half as many coded motion fields as conventional codecs. Furthermore, “features” beyond compressibility — including high scalability, accessibility, and “intrinsic” framerate upsampling — can be seamlessly supported. These features are becoming ever more relevant as the way video is consumed continues shifting from the traditional broadcast scenario to interactive browsing of video content over heterogeneous networks. This book is of interest to researchers and professionals working in multimedia signal processing, in particular those who are interested in next-generation video compression. Two comprehensive background chapters on scalable video compression and temporal frame interpolation make the book accessible for students and newcomers to the field.







A Block-based Scalable Motion Model for Highly Scalable Video Coding


Book Description

Scalable video coding has gained considerable attention during the past decade, due to its attractive features that efficiently support flexible transmission over heterogeneous networks and adaptive display on a wide range of devices. As coding efficiency is predominantly the governing principle of most video coding algorithms, scalable video coding thrives in incessantly improving efficiency through incorporating newly emerged technologies while preserving the scalable features. Motion scalability, being the main topic of this dissertation, is one of these contributive technologies. Motion scalability is based on the simple concept that different decoding scenarios require different motion prediction qualities in the optimized rate distortion sense. For example, lower decoding resolutions or bit rates usually demand lower motion prediction qualities in order to maintain a better balance between motion and texture coding. This concept, although simple, is not easily realizable in a practical scalable video codec. The error drifting effect introduced from quantized motion is the first problem to face, followed by the interactive issue with other scalabilities, the embedded coding of scalable motion, and the rate distortion optimized estimation algorithm for motion parameters. In this dissertation, we deal with these challenges and propose a block-based scalable motion model, which provides both motion structure and accuracy scalabilities in order to adapt to various decoding scenarios. Through the proposed model, rate distortion performance can be improved in the middle to low bit rate range. This accomplishment is jointly achieved by applying the proposed rate distortion optimized motion estimation algorithm at the encoder and the optimal motion quality selection algorithm at the bitstream extractor. Extensive simulations will be demonstrated based on a wavelet-based scalable video codec. These results verify the superiority of the proposed scalable motion solution over non-scalable ones.







Art of Digital Audio


Book Description

Described as "the most comprehensive book on digital audio to date", it is widely acclaimed as an industry "bible". Covering the very latest developments in digital audio technology, it provides an thorough introduction to the theory as well as acting as an authoritative and comprehensive professional reference source. Everything you need is here from the fundamental principles to the latest applications, written in an award-winning style with clear explanations from first principles. New material covered includes internet audio, PC audio technology, DVD, MPEG audio compression, digital audio broadcasting and audio networks. Whether you are in the field of audio engineering, sound recording, music technology, broadcasting and communications media or audio design and installation, this book has it all. Written by a leading international audio specialist, who conducts professional seminars and workshops around the world, the book has been road tested for many years by professional seminar attendees and students to ensure their needs are taken into account, and all the right information is covered. This new edition now includes: Internet audio PC Audio technology DVD MPEG Audio compression Digital Audio Broadcasting Audio networks Digital audio professionals will find everything they need here, from the fundamental principles to the latest applications, written in an award-winning style with clear explanations from first principles. John Watkinson is an international consultant in audio, video and data recording. He is a Fellow of the AES, a member of the British Computer Society and a chartered information systems practitioner. He presents lectures, seminars, conference papers and training courses worldwide. He is the author of many other Focal Press books, including: the Kraszna-Krausz award winning MPEG-2; The Art of Digital Audio; An Introduction to Digital Video; The Art of Sound Reproduction; An Introduction to Digital Audio; TV Fundamentals and Audio for Television. He is also co-author, with Francis Rumsey, of The Digital Interface Handbook, and contributor to the Loudspeaker and Headphone Handbook, 3rd edition.




High Dynamic Range Video


Book Description

At the time of rapid technological progress and uptake of High Dynamic Range (HDR) video content in numerous sectors, this book provides an overview of the key supporting technologies, discusses the effectiveness of various techniques, reviews the initial standardization efforts and explores new research directions in all aspects involved in HDR video systems. Topics addressed include content acquisition and production, tone mapping and inverse tone mapping operators, coding, quality of experience, and display technologies. This book also explores a number of applications using HDR video technologies in the automotive industry, medical imaging, spacecraft imaging, driving simulation and watermarking. By covering general to advanced topics, along with a broad and deep analysis, this book is suitable for both the researcher new or familiar to the area. With this book the reader will: Gain a broad understanding of all the elements in the HDR video processing chain Learn the most recent results of ongoing research Understand the challenges and perspectives for HDR video technologies Covers a broad range of topics encompassing the whole processing chain in HDR video systems, from acquisition to display Provides a comprehensive overview of this fast emerging topic Presents upcoming applications taking advantages of HDR




Fundamentals of Multimedia


Book Description

This textbook introduces the “Fundamentals of Multimedia”, addressing real issues commonly faced in the workplace. The essential concepts are explained in a practical way to enable students to apply their existing skills to address problems in multimedia. Fully revised and updated, this new edition now includes coverage of such topics as 3D TV, social networks, high-efficiency video compression and conferencing, wireless and mobile networks, and their attendant technologies. Features: presents an overview of the key concepts in multimedia, including color science; reviews lossless and lossy compression methods for image, video and audio data; examines the demands placed by multimedia communications on wired and wireless networks; discusses the impact of social media and cloud computing on information sharing and on multimedia content search and retrieval; includes study exercises at the end of each chapter; provides supplementary resources for both students and instructors at an associated website.




Computer Vision Metrics


Book Description

Computer Vision Metrics provides an extensive survey and analysis of over 100 current and historical feature description and machine vision methods, with a detailed taxonomy for local, regional and global features. This book provides necessary background to develop intuition about why interest point detectors and feature descriptors actually work, how they are designed, with observations about tuning the methods for achieving robustness and invariance targets for specific applications. The survey is broader than it is deep, with over 540 references provided to dig deeper. The taxonomy includes search methods, spectra components, descriptor representation, shape, distance functions, accuracy, efficiency, robustness and invariance attributes, and more. Rather than providing ‘how-to’ source code examples and shortcuts, this book provides a counterpoint discussion to the many fine opencv community source code resources available for hands-on practitioners.




Real-Time Image and Video Processing


Book Description

This book presents an overview of the guidelines and strategies for transitioning an image or video processing algorithm from a research environment into a real-time constrained environment. Such guidelines and strategies are scattered in the literature of various disciplines including image processing, computer engineering, and software engineering, and thus have not previously appeared in one place. By bringing these strategies into one place, the book is intended to serve the greater community of researchers, practicing engineers, industrial professionals, who are interested in taking an image or video processing algorithm from a research environment to an actual real-time implementation on a resource constrained hardware platform. These strategies consist of algorithm simplifications, hardware architectures, and software methods. Throughout the book, carefully selected representative examples from the literature are presented to illustrate the discussed concepts. After reading the book, the readers are exposed to a wide variety of techniques and tools, which they can then employ to design a real-time image or video processing system.




Biologically Inspired Robotics


Book Description

Robotic engineering inspired by biology—biomimetics—has many potential applications: robot snakes can be used for rescue operations in disasters, snake-like endoscopes can be used in medical diagnosis, and artificial muscles can replace damaged muscles to recover the motor functions of human limbs. Conversely, the application of robotics technology to our understanding of biological systems and behaviors—biorobotic modeling and analysis—provides unique research opportunities: robotic manipulation technology with optical tweezers can be used to study the cell mechanics of human red blood cells, a surface electromyography sensing system can help us identify the relation between muscle forces and hand movements, and mathematical models of brain circuitry may help us understand how the cerebellum achieves movement control. Biologically Inspired Robotics contains cutting-edge material—considerably expanded and with additional analysis—from the 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO). These 16 chapters cover both biomimetics and biorobotic modeling/analysis, taking readers through an exploration of biologically inspired robot design and control, micro/nano bio-robotic systems, biological measurement and actuation, and applications of robotics technology to biological problems. Contributors examine a wide range of topics, including: A method for controlling the motion of a robotic snake The design of a bionic fitness cycle inspired by the jaguar The use of autonomous robotic fish to detect pollution A noninvasive brain-activity scanning method using a hybrid sensor A rehabilitation system for recovering motor function in human hands after injury Human-like robotic eye and head movements in human–machine interactions A state-of-the-art resource for graduate students and researchers in the fields of control engineering, robotics, and biomedical engineering, this text helps readers understand the technology and principles in this emerging field.