Low Complexity Scalable Video Encoding


Book Description

The emerging Scalable Video Coding (SVC) extends the H.264/AVC video coding standard with new tools designed to efficiently support temporal, spatial and SNR scalability. In real-time multimedia systems, the coding performance of video encoders and decoders is limited by computational complexity. This thesis presents techniques to manage computational complexity of H.264/AVC and SVC video encoders. These techniques aim to provide significant complexity saving as well as a framework for efficient use of SVC. This thesis first investigates, experimentally, the computational complexity of MB coding mode decision in H.264/AVC video encoder. Based on machine learning techniques, complexity reduction algorithms are proposed. It is shown that these algorithms can reduce the computational complexity of Intra MB coding with negligible loss of video quality. Complexity reduction algorithms based on statistical classifiers are proposed for SVC encoder. It is shown that these algorithms can flexibly control the computational complexity in enhancement layers of SVC with negligible loss of video quality. The inherent relationship of MB mode decision in base and enhancement layers of SVC is investigated through experimental tests and a rate model function is proposed. An innovative fast mode decision model is developed to reduce the computational complexity by using the layer relationship along with the rate model function. We develop a general framework that applies to SVC and use this framework to adapt SVC bitstream by employing the low-complexity video encoding along with the input of video streaming constraints in order to adapt the bitstream. The proposed SVC based framework uses both objective low-complexity video encoding techniques and subjective saliency based video adaptation resulting in optimal use of network bandwidth. The approaches described in this thesis can not only reduce computational complexity of a video encoder, but also can manage the trade-off between complexity and distortion. These proposed algorithms are evaluated in terms of complexity reduction performance, rate-distortion performance and subjective and objective visual quality by experimental testing. The advantages and disadvantages of each algorithm are discussed.







Low Complexity H.264 Video Encoder Design Using Machine Learning Techniques


Book Description

H.264/AVC encoder complexity is mainly due to variable size in Intra and Inter frames. This makes H.264/AVC very difficult to implement, especially for real time applications and mobile devices. The current technological challenge is to conserve the compression capacity and quality that H.264 offers but reduce the encoding time and, therefore, the processing complexity. This thesis applies machine learning technique for video encoding mode decisions and investigates ways to improve the process of generating more general low complexity H.264/AVC video encoders. The proposed H.264 encoding method decreases the complexity in the mode decision inside the Inter frames. Results show, at least, a 150% average reduction of complexity and, at most, 0.6 average increases in PSNR for different kinds of videos and formats.




The H.264 Advanced Video Compression Standard


Book Description

H.264 Advanced Video Coding or MPEG-4 Part 10 is fundamental to a growing range of markets such as high definition broadcasting, internet video sharing, mobile video and digital surveillance. This book reflects the growing importance and implementation of H.264 video technology. Offering a detailed overview of the system, it explains the syntax, tools and features of H.264 and equips readers with practical advice on how to get the most out of the standard. Packed with clear examples and illustrations to explain H.264 technology in an accessible and practical way. Covers basic video coding concepts, video formats and visual quality. Explains how to measure and optimise the performance of H.264 and how to balance bitrate, computation and video quality. Analyses recent work on scalable and multi-view versions of H.264, case studies of H.264 codecs and new technological developments such as the popular High Profile extensions. An invaluable companion for developers, broadcasters, system integrators, academics and students who want to master this burgeoning state-of-the-art technology. "[This book] unravels the mysteries behind the latest H.264 standard and delves deeper into each of the operations in the codec. The reader can implement (simulate, design, evaluate, optimize) the codec with all profiles and levels. The book ends with extensions and directions (such as SVC and MVC) for further research." Professor K. R. Rao, The University of Texas at Arlington, co-inventor of the Discrete Cosine Transform




Protocols and Systems for Interactive Distributed Multimedia


Book Description

This book constitutes the refereed proceedings of the Joint International Workshops on Interactive Distributed Multimedia Systems and Protocols for Multimedia Systems, IDMS/PROMS 2002, held in Coimbra, Portugal in November 2002.The 30 revised full papers presented were carefully reviewed and selected from 112 submissions. The papers are organized in topical sections on performance of protocols and applications, mobile multimedia systems, standards and related issues, quality of service, video systems and applications, resource management, and multimedia support.




Multiple Access Communications


Book Description

This book constitutes the refereed proceedings of the 7th International Workshop on Multiple Access Communications, MACOM 2014, held in Halmstad, Sweden, in August 2014. The 12 full papers presented were carefully reviewed and selected from 22 submissions. They describe the latest advancements in the field of multiple access communications with an emphasis on reliability issues, physical layer techniques, cognitive radio, medium access control protocols, and video coding.




Complexity-Aware High Efficiency Video Coding


Book Description

This book discusses computational complexity of High Efficiency Video Coding (HEVC) encoders with coverage extending from the analysis of HEVC compression efficiency and computational complexity to the reduction and scaling of its encoding complexity. After an introduction to the topic and a review of the state-of-the-art research in the field, the authors provide a detailed analysis of the HEVC encoding tools compression efficiency and computational complexity. Readers will benefit from a set of algorithms for scaling the computational complexity of HEVC encoders, all of which take advantage from the flexibility of the frame partitioning structures allowed by the standard. The authors also provide a set of early termination methods based on data mining and machine learning techniques, which are able to reduce the computational complexity required to find the best frame partitioning structures. The applicability of the proposed methods is finally exemplified with an encoding time control system that employs the best complexity reduction and scaling methods presented throughout the book. The methods presented in this book are especially useful in power-constrained, portable multimedia devices to reduce energy consumption and to extend battery life. They can also be applied to portable and non-portable multimedia devices operating in real time with limited computational resources.




Complexity Optimized Video Codecs


Book Description

Inhaltsangabe:Abstract: We are facing an increasing bandwidth in the mobile systems and this opens up for new applications in a mobile terminal. It will be possible to download, record, send and receive images and videosequences. Even if we have more bandwidth, images and video data must be compressed before it can be sent, because of the amount of information it contains. MPEG-4 and H.263 are standards for compression of video data. The problem is that encoding and decoding algorithms are computationally intensive and complexity increases with the size of the video. In mobile applications, processing capabilities such as memory space and calculation time are limited and optimized algorithms for decoding and encoding are necessary. The question is if it is possible to encode raw video data with low complexity. Single frames e.g. from a digital camera, can then be coded and transmitted as a video sequence. On the other hand, the decoder needs to be able to handle sequences with different resolution. Thus, decoder in new mobile terminals must decode higher resolution sequences with the same complexity as low resolution video requires. The work will involve literature studies of MPEG-4 and H.263. The goal is to investigate the possibility to encode video data with low complexity and to find a way for optimized downscaling of larger sequences in a decoder. The work should include - Literature studies of MPEG-4 and H.263. - Theoretical study how CIF sequences (352x288-pixel) can be downscaled to QCIF (176x144-pixel) size. - Finding of optimized algorithms for a low complexity encoder. - Implementation of such an encoder in a microprocessor, e.g. a DSP. - Complexity analysis of processing consumption. Prerequisite experience is fair C-programming, signalprocessing skills and basic knowledge in H.263 and MPEG-4 is useful. New mobile communication standards provide an increased bandwidth, which opens up for many new media applications and services in future mobile phones. Video recording using the MMS standard, video conferencing and downloading of movies from the Internet are some of those applications. Even if the data rate is high, video data needs to be compressed using international video compression standards such as MPEG-4 or H.263. Efficient video compression algorithms are the focus of this thesis. Very limited computational capabilities of the terminals require low complexity encoder and decoder. A low complexity encoder for usage with [...]




Advanced Concepts for Intelligent Vision Systems


Book Description

This book constitutes the thoroughly refereed proceedings of the 15th International Conference on Advanced Concepts for Intelligent Vision Systems, ACIVS 2013, held in Poznań, Poland, in October 2013. The 63 revised full papers were carefully selected from 111 submissions. The topics covered are aquisition, pre-processing and coding, biometry, classification and recognition, depth, 3D and tracking, efficient implementation and frameworks, low level image analysis, segmentation and video analysis.




Encyclopedia of Multimedia


Book Description

This second edition provides easy access to important concepts, issues and technology trends in the field of multimedia technologies, systems, techniques, and applications. Over 1,100 heavily-illustrated pages — including 80 new entries — present concise overviews of all aspects of software, systems, web tools and hardware that enable video, audio and developing media to be shared and delivered electronically.