High Efficiency Video Coding (HEVC)


Book Description

This book provides developers, engineers, researchers and students with detailed knowledge about the High Efficiency Video Coding (HEVC) standard. HEVC is the successor to the widely successful H.264/AVC video compression standard, and it provides around twice as much compression as H.264/AVC for the same level of quality. The applications for HEVC will not only cover the space of the well-known current uses and capabilities of digital video – they will also include the deployment of new services and the delivery of enhanced video quality, such as ultra-high-definition television (UHDTV) and video with higher dynamic range, wider range of representable color, and greater representation precision than what is typically found today. HEVC is the next major generation of video coding design – a flexible, reliable and robust solution that will support the next decade of video applications and ease the burden of video on world-wide network traffic. This book provides a detailed explanation of the various parts of the standard, insight into how it was developed, and in-depth discussion of algorithms and architectures for its implementation.




High Efficiency Video Coding and Other Emerging Standards


Book Description

High Efficiency Video Coding and Other Emerging Standards provides an overview of high efficiency video coding (HEVC) and all its extensions and profiles. There are nearly 300 projects and problems included, and about 400 references related to HEVC alone. Next generation video coding (NGVC) beyond HEVC is also described. Other video coding standards such as AVS2, DAALA, THOR, VP9 (Google), DIRAC, VC1, and AV1 are addressed, and image coding standards such as JPEG, JPEG-LS, JPEG2000, JPEG XR, JPEG XS, JPEG XT and JPEG-Pleno are also listed.Understanding of these standards and their implementation is facilitated by overview papers, standards documents, reference software, software manuals, test sequences, source codes, tutorials, keynote speakers, panel discussions, reflector and ftp/web sites – all in the public domain. Access to these categories is also provided.




MultiMedia Modeling


Book Description

The two-volume set LNCS 11961 and 11962 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2020, held in Daejeon, South Korea, in January 2020. Of the 171 submitted full research papers, 40 papers were selected for oral presentation and 46 for poster presentation; 28 special session papers were selected for oral presentation and 8 for poster presentation; in addition, 9 demonstration papers and 6 papers for the Video Browser Showdown 2020 were accepted. The papers of LNCS 11961 are organized in the following topical sections: audio and signal processing; coding and HVS; color processing and art; detection and classification; face; image processing; learning and knowledge representation; video processing; poster papers; the papers of LNCS 11962 are organized in the following topical sections: poster papers; AI-powered 3D vision; multimedia analytics: perspectives, tools and applications; multimedia datasets for repeatable experimentation; multi-modal affective computing of large-scale multimedia data; multimedia and multimodal analytics in the medical domain and pervasive environments; intelligent multimedia security; demo papers; and VBS papers.




Algorithms for Efficient and Fast 3D-HEVC Depth Map Encoding


Book Description

This book describes and analyzes in detail the encoding effort and the encoding tool usage applied to 3D-HEVC depth map coding. Based on the analyzed information, the authors introduce efficient algorithms for accelerating the available encoding tools. The contributions discussed in this book include four algorithms for reducing intra-frame encoding effort and three algorithms for reducing inter-frame encoding effort. The presented results demonstrate several levels of encoding effort reduction with different impacts in the encoding efficiency, surpassing state-of-the-art solutions by more than 50% the encoding effort with only 0.3% encoding efficiency loss.




Fuzzy Systems and Data Mining V


Book Description

The Fuzzy Systems and Data Mining (FSDM) conference is an annual event encompassing four main themes: fuzzy theory, algorithms and systems, which includes topics like stability, foundations and control; fuzzy application, which covers different kinds of processing as well as hardware and architectures for big data and time series and has wide applicability; the interdisciplinary field of fuzzy logic and data mining, encompassing applications in electrical, industrial, chemical and engineering fields as well as management and environmental issues; and data mining, outlining new approaches to big data, massive data, scalable, parallel and distributed algorithms. The annual conference provides a platform for knowledge exchange between international experts, researchers, academics and delegates from industry. This book includes the papers accepted and presented at the 5th International Conference on Fuzzy Systems and Data Mining (FSDM 2019), held in Kitakyushu, Japan on 18-21 October 2019. This year, FSDM received 442 submissions. All papers were carefully reviewed by program committee members, taking account of the quality, novelty, soundness, breadth and depth of the research topics falling within the scope of FSDM. The committee finally decided to accept 137 papers, which represents an acceptance rate of about 30%. The papers presented here are arranged in two sections: Fuzzy Sets and Data Mining, and Communications and Networks. Providing an overview of the most recent scientific and technological advances in the fields of fuzzy systems and data mining, the book will be of interest to all those working in these fields.




Advances in Intelligent Information Hiding and Multimedia Signal Processing


Book Description

This volume includes papers presented at IIH-MSP 2017, the 13th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, held from 12 to 15 August 2017 in Matsue, Shimane, Japan. The conference addresses topics ranging from information hiding and security, and multimedia signal processing and networking, to bio-inspired multimedia technologies and systems. This volume of Smart Innovation, Systems and Technologies focuses on subjects related to massive image/video compression and transmission for emerging networks, advances in speech and language processing, information hiding and signal processing for audio and speech signals, intelligent distribution systems and applications, recent advances in security and privacy for multimodal network environments, multimedia signal processing, and machine learning. Updated with the latest research outcomes and findings, the papers presented appeal to researchers and students who are interested in the corresponding fields.




2013 International Conference on Computer Science and Artificial Intelligence


Book Description

The main objective of ICCSAI2013 is to provide a platform for the presentation of top and latest research results in global scientific areas. The conference aims to provide a high level international forum for researcher, engineers and practitioners to present and discuss recent advances and new techniques in computer science and artificial intelligence. It also serves to foster communications among researcher, engineers and practitioners working in a common interest in improving computer science, artificial intelligence and the related fields. We have received 325 numbers of papers through "Call for Paper", out of which 94 numbers of papers were accepted for publication in the conference proceedings through double blind review process. The conference is designed to stimulate the young minds including Research Scholars, Academicians, and Practitioners to contribute their ideas, thoughts and nobility in these two disciplines.




Advances on Digital Television and Wireless Multimedia Communications


Book Description

This book constitutes the refereed proceedings of the 9th International Forum on Digital TV and Wireless Multimedia Communication, IFTC 2012, Shanghai, China, November. The 69 revised full papers presented were carefully reviewed and selected from numerous submissions. The papers are organized in topical sections on image processing and pattern recognition; image and video analysis; image quality assessment; text image and speech processing; content retrieval and security; source coding; multimedia communication; new advances in broadband multimedia; human computer interface; 3D video.




The H.264 Advanced Video Compression Standard


Book Description

H.264 Advanced Video Coding or MPEG-4 Part 10 is fundamental to a growing range of markets such as high definition broadcasting, internet video sharing, mobile video and digital surveillance. This book reflects the growing importance and implementation of H.264 video technology. Offering a detailed overview of the system, it explains the syntax, tools and features of H.264 and equips readers with practical advice on how to get the most out of the standard. Packed with clear examples and illustrations to explain H.264 technology in an accessible and practical way. Covers basic video coding concepts, video formats and visual quality. Explains how to measure and optimise the performance of H.264 and how to balance bitrate, computation and video quality. Analyses recent work on scalable and multi-view versions of H.264, case studies of H.264 codecs and new technological developments such as the popular High Profile extensions. An invaluable companion for developers, broadcasters, system integrators, academics and students who want to master this burgeoning state-of-the-art technology. "[This book] unravels the mysteries behind the latest H.264 standard and delves deeper into each of the operations in the codec. The reader can implement (simulate, design, evaluate, optimize) the codec with all profiles and levels. The book ends with extensions and directions (such as SVC and MVC) for further research." Professor K. R. Rao, The University of Texas at Arlington, co-inventor of the Discrete Cosine Transform




Soft Computing and Signal Processing


Book Description

This book presents selected research papers on current developments in the fields of soft computing and signal processing from the Third International Conference on Soft Computing and Signal Processing (ICSCSP 2020). The book covers topics such as soft sets, rough sets, fuzzy logic, neural networks, genetic algorithms and machine learning and discusses various aspects of these topics, e.g., technological considerations, product implementation and application issues.