The H.264 Advanced Video Compression Standard


Book Description

H.264 Advanced Video Coding or MPEG-4 Part 10 is fundamental to a growing range of markets such as high definition broadcasting, internet video sharing, mobile video and digital surveillance. This book reflects the growing importance and implementation of H.264 video technology. Offering a detailed overview of the system, it explains the syntax, tools and features of H.264 and equips readers with practical advice on how to get the most out of the standard. Packed with clear examples and illustrations to explain H.264 technology in an accessible and practical way. Covers basic video coding concepts, video formats and visual quality. Explains how to measure and optimise the performance of H.264 and how to balance bitrate, computation and video quality. Analyses recent work on scalable and multi-view versions of H.264, case studies of H.264 codecs and new technological developments such as the popular High Profile extensions. An invaluable companion for developers, broadcasters, system integrators, academics and students who want to master this burgeoning state-of-the-art technology. "[This book] unravels the mysteries behind the latest H.264 standard and delves deeper into each of the operations in the codec. The reader can implement (simulate, design, evaluate, optimize) the codec with all profiles and levels. The book ends with extensions and directions (such as SVC and MVC) for further research." Professor K. R. Rao, The University of Texas at Arlington, co-inventor of the Discrete Cosine Transform




Complexity-Aware High Efficiency Video Coding


Book Description

This book discusses computational complexity of High Efficiency Video Coding (HEVC) encoders with coverage extending from the analysis of HEVC compression efficiency and computational complexity to the reduction and scaling of its encoding complexity. After an introduction to the topic and a review of the state-of-the-art research in the field, the authors provide a detailed analysis of the HEVC encoding tools compression efficiency and computational complexity. Readers will benefit from a set of algorithms for scaling the computational complexity of HEVC encoders, all of which take advantage from the flexibility of the frame partitioning structures allowed by the standard. The authors also provide a set of early termination methods based on data mining and machine learning techniques, which are able to reduce the computational complexity required to find the best frame partitioning structures. The applicability of the proposed methods is finally exemplified with an encoding time control system that employs the best complexity reduction and scaling methods presented throughout the book. The methods presented in this book are especially useful in power-constrained, portable multimedia devices to reduce energy consumption and to extend battery life. They can also be applied to portable and non-portable multimedia devices operating in real time with limited computational resources.




International Proceedings on Advances in Soft Computing, Intelligent Systems and Applications


Book Description

The book focuses on the state-of-the-art technologies pertaining to advances in soft computing, intelligent system and applications. The Proceedings of ASISA 2016 presents novel and original work in soft computing, intelligent system and applications by the experts and budding researchers. These are the cutting edge technologies that have immense application in various fields. The papers discuss many real world complex problems that cannot be easily handled with traditional mathematical methods. The exact solution of the problems at hand can be achieved with soft computing techniques. Soft computing represents a collection of computational techniques inheriting inspiration from evolutionary algorithms, nature inspired algorithms, bio-inspired algorithms, neural networks and fuzzy logic.




Multimedia Networking and Coding


Book Description

Advances in multimedia communication systems have enhanced the need for improved video coding standards. Due to the inherent nature of video content, large bandwidths and reliable communication links are required to ensure a satisfactory level of quality experience; inspiring industry and research communities to concentrate their efforts in this emerging research area. Multimedia Networking and Coding covers widespread knowledge and research as well as innovative applications in multimedia communication systems. This book highlights recent techniques that can evolve into future multimedia communication systems, also showing experimental results from systems and applications.




Recent Advances on Video Coding


Book Description

This book is intended to attract the attention of practitioners and researchers from industry and academia interested in challenging paradigms of multimedia video coding, with an emphasis on recent technical developments, cross-disciplinary tools and implementations. Given its instructional purpose, the book also overviews recently published video coding standards such as H.264/AVC and SVC from a simulational standpoint. Novel rate control schemes and cross-disciplinary tools for the optimization of diverse aspects related to video coding are also addressed in detail, along with implementation architectures specially tailored for video processing and encoding. The book concludes by exposing new advances in semantic video coding. In summary: this book serves as a technically sounding start point for early-stage researchers and developers willing to join leading-edge research on video coding, processing and multimedia transmission.




Image Analysis and Processing – ICIAP 2022


Book Description

The proceedings set LNCS 13231, 13232, and 13233 constitutes the refereed proceedings of the 21st International Conference on Image Analysis and Processing, ICIAP 2022, which was held during May 23-27, 2022, in Lecce, Italy, The 168 papers included in the proceedings were carefully reviewed and selected from 307 submissions. They deal with video analysis and understanding; pattern recognition and machine learning; deep learning; multi-view geometry and 3D computer vision; image analysis, detection and recognition; multimedia; biomedical and assistive technology; digital forensics and biometrics; image processing for cultural heritage; robot vision; etc.




Digital Forensics and Watermarking


Book Description

This book constitutes the refereed proceedings of the 16th International Workshop on Digital Forensics and Watermarking, IWDW 2017, held in Magdeburg, Germany, in August 2017. The 30 papers presented in this volume were carefully reviewed and selected from 48 submissions. The contributions are covering the state-of-the-art theoretical and practical developments in the fields of digital watermarking, steganography and steganalysis, forensics and anti-forensics, visual cryptography, and other multimedia-related security issues. Also included are the papers on two special sessions on biometric image tampering detection and on emerging threats of criminal use of information hiding : usage scenarios and detection approaches.




Image Processing: Concepts, Methodologies, Tools, and Applications


Book Description

Advancements in digital technology continue to expand the image science field through the tools and techniques utilized to process two-dimensional images and videos. Image Processing: Concepts, Methodologies, Tools, and Applications presents a collection of research on this multidisciplinary field and the operation of multi-dimensional signals with systems that range from simple digital circuits to computers. This reference source is essential for researchers, academics, and students in the computer science, computer vision, and electrical engineering fields.




Advances in Multimedia Information Processing -- PCM 2010, Part II


Book Description

The 2010 Pacific-Rim Conference on Multimedia (PCM 2010) was held in Shanghai at Fudan University, during September 21–24, 2010. Since its inauguration in 2000, PCM has been held in various places around the Pacific Rim, namely Sydney (PCM 2000), Beijing (PCM 2001), Hsinchu (PCM 2002), Singapore (PCM 2003), Tokyo (PCM 2004), Jeju (PCM 2005), Zhejiang (PCM 2006), Hong Kong (PCM 2007), Tainan (PCM 2008), and Bangkok (PCM 2009). PCM is a major annual international conference organized as a forum for the dissemination of state-of-the-art technological advances and research results in the fields of theoretical, experimental, and applied multimedia analysis and processing. PCM 2010 featured a comprehensive technical program which included 75 oral and 56 poster presentations selected from 261 submissions from Australia, Canada, China, France, Germany, Hong Kong, India, Iran, Italy, Japan, Korea, Myanmar, Norway, Singapore, Taiwan, Thailand, the UK, and the USA. Three distinguished researchers, Prof. Zhi-Hua Zhou from Nanjing University, Dr. Yong Rui from Microsoft, and Dr. Tie-Yan Liu from Microsoft Research Asia delivered three keynote talks to the conference. We are very grateful to the many people who helped to make this conference a s- cess. We would like to especially thank Hong Lu for local organization, Qi Zhang for handling the publication of the proceedings, and Cheng Jin for looking after the c- ference website and publicity. We thank Fei Wu for organizing the special session on large-scale multimedia search in the social network settings.




Multimedia Analysis, Processing and Communications


Book Description

This book has brought 24 groups of experts and active researchers around the world together in image processing and analysis, video processing and analysis, and communications related processing, to present their newest research results, exchange latest experiences and insights, and explore future directions in these important and rapidly evolving areas. It aims at increasing the synergy between academic and industry professionals working in the related field. It focuses on the state-of-the-art research in various essential areas related to emerging technologies, standards and applications on analysis, processing, computing, and communication of multimedia information. The target audience of this book is researchers and engineers as well as graduate students working in various disciplines linked to multimedia analysis, processing and communications, e.g., computer vision, pattern recognition, information technology, image processing, and artificial intelligence. The book is also meant to a broader audience including practicing professionals working in image/video applications such as image processing, video surveillance, multimedia indexing and retrieval, and so on. We hope that the researchers, engineers, students and other professionals who read this book would find it informative, useful and inspirational toward their own work in one way or another.