Parametric Time-Frequency Domain Spatial Audio


Book Description

A comprehensive guide that addresses the theory and practice of spatial audio This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems. Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies Includes contributions from leading researchers in the field Offers MATLAB codes with selected chapters An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.




Parametric Time-Frequency Domain Spatial Audio


Book Description

A comprehensive guide that addresses the theory and practice of spatial audio This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems. Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies Includes contributions from leading researchers in the field Offers MATLAB codes with selected chapters An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.




The Technology of Binaural Listening


Book Description

This book reports on the application of advanced models of the human binaural hearing system in modern technology, among others, in the following areas: binaural analysis of aural scenes, binaural de-reverberation, binaural quality assessment of audio channels, loudspeakers and performance spaces, binaural perceptual coding, binaural processing in hearing aids and cochlea implants, binaural systems in robots, binaural/tactile human-machine interfaces, speech-intelligibility prediction in rooms and/or multi-speaker scenarios. An introduction to binaural modeling and an outlook to the future are provided. Further, the book features a MATLAB toolbox to enable readers to construct their own dedicated binaural models on demand.




Ambisonics


Book Description

This open access book provides a concise explanation of the fundamentals and background of the surround sound recording and playback technology Ambisonics. It equips readers with the psychoacoustical, signal processing, acoustical, and mathematical knowledge needed to understand the inner workings of modern processing utilities, special equipment for recording, manipulation, and reproduction in the higher-order Ambisonic format. The book comes with various practical examples based on free software tools and open scientific data for reproducible research. The book’s introductory section offers a perspective on Ambisonics spanning from the origins of coincident recordings in the 1930s to the Ambisonic concepts of the 1970s, as well as classical ways of applying Ambisonics in first-order coincident sound scene recording and reproduction that have been practiced since the 1980s. As, from time to time, the underlying mathematics become quite involved, but should be comprehensive without sacrificing readability, the book includes an extensive mathematical appendix. The book offers readers a deeper understanding of Ambisonic technologies, and will especially benefit scientists, audio-system and audio-recording engineers. In the advanced sections of the book, fundamentals and modern techniques as higher-order Ambisonic decoding, 3D audio effects, and higher-order recording are explained. Those techniques are shown to be suitable to supply audience areas ranging from studio-sized to hundreds of listeners, or headphone-based playback, regardless whether it is live, interactive, or studio-produced 3D audio material.




Communication Acoustics


Book Description

In communication acoustics, the communication channel consists of a sound source, a channel (acoustic and/or electric) and finally the receiver: the human auditory system, a complex and intricate system that shapes the way sound is heard. Thus, when developing techniques in communication acoustics, such as in speech, audio and aided hearing, it is important to understand the time–frequency–space resolution of hearing. This book facilitates the reader’s understanding and development of speech and audio techniques based on our knowledge of the auditory perceptual mechanisms by introducing the physical, signal-processing and psychophysical background to communication acoustics. It then provides a detailed explanation of sound technologies where a human listener is involved, including audio and speech techniques, sound quality measurement, hearing aids and audiology. Key features: Explains perceptually-based audio: the authors take a detailed but accessible engineering perspective on sound and hearing with a focus on the human place in the audio communications signal chain, from psychoacoustics and audiology to optimizing digital signal processing for human listening. Presents a wide overview of speech, from the human production of speech sounds and basics of phonetics to major speech technologies, recognition and synthesis of speech and methods for speech quality evaluation. Includes MATLAB examples that serve as an excellent basis for the reader’s own investigations into communication acoustics interaction schemes which intuitively combine touch, vision and voice for lifelike interactions.




Time-Frequency Signal Analysis and Processing


Book Description

Time-Frequency Signal Analysis and Processing (TFSAP) is a collection of theory, techniques and algorithms used for the analysis and processing of non-stationary signals, as found in a wide range of applications including telecommunications, radar, and biomedical engineering. This book gives the university researcher and R&D engineer insights into how to use TFSAP methods to develop and implement the engineering application systems they require. New to this edition: - New sections on Efficient and Fast Algorithms; a "Getting Started" chapter enabling readers to start using the algorithms on simulated and real examples with the TFSAP toolbox, compare the results with the ones presented in the book and then insert the algorithms in their own applications and adapt them as needed. - Two new chapters and twenty three new sections, including updated references. - New topics including: efficient algorithms for optimal TFDs (with source code), the enhanced spectrogram, time-frequency modelling, more mathematical foundations, the relationships between QTFDs and Wavelet Transforms, new advanced applications such as cognitive radio, watermarking, noise reduction in the time-frequency domain, algorithms for Time-Frequency Image Processing, and Time-Frequency applications in neuroscience (new chapter). - A comprehensive tutorial introduction to Time-Frequency Signal Analysis and Processing (TFSAP), accessible to anyone who has taken a first course in signals - Key advances in theory, methodology and algorithms, are concisely presented by some of the leading authorities on the respective topics - Applications written by leading researchers showing how to use TFSAP methods




Principles and Applications of Spatial Hearing


Book Description

Section 3. Capturing and controlling the spatial sound field. A study on 3D sound image control by two loudspeakers located in the transverse plane / K. Iida, T. Ishii, and Y. Ishii. Selective listening point audio based on blind signal separation and 3D audio effect / T. Nishino [und weitere]. Selective listening point audio based on blind signal separation and 3D audio effect / T. Nishino. Sweet spot size in virtual sound reproduction : A temporal analysis / Y. Lacouture Parodi and P. Rubak. Psychoacoustic evaluation of different methods for creating individualized, headphone-presented virtual auditory space from B-format room impulse responses / A. Kan, C. Jin, and A. van Schaik. Effects of microphone arrangements on the accuracy of a spherical microphone array (SENZI) in acquiring high-definition 3D sound space information / J. Kodama [und weitere]. Perception-based reproduction of spatial sound with directional audio coding / V. Pulkki [und weitere]. Capturing and recreating auditory virtual reality / R. Duraiswami [und weitere]. Reconstructing sound source directivity in virtual acoustic environments / M. Noisternig, F. Zotter, and B.F.G. Katz. Implementation of real-time room auralization using a surrounding loudspeaker array / T. Okamoto [und weitere]. Spatialisation in audio augmented reality using finger snaps / H. Gamper and T. Lokki. Generation of sound ball : Its theory and implementation / Y.-H. Kim [und weitere]. Estimation of high-resolution sound properties for realizing an editable sound-space system / T. Okamoto, Y. Iwaya, and Y. Suzuki -- Section 4. Applying virtual sound techniques in the real world. Binaural hearing assistance system based on frequency domain binaural model / T. Usagawa and Y. Chisaki. A spatial auditory display for telematic music performances / J. Braasch [und weitere]. Auditory orientation training system developed for blind people using PC-based wide-range 3-D sound technology / Y. Seki [und weitere]. Mapping musical scales onto virtual 3D spaces / J. Villegas and M. Cohen. Sonifying head-related transfer unctions / D. Cabrera and W.L. Martens. Effects of spatial cues on detectability of alarm signals in noisy environments / N. Kuroda [und weitere]. Binaural technique for active noise control assessment / Y. Watanabe and H. Hamada




Analytic Methods of Sound Field Synthesis


Book Description

This book puts the focus on serving human listeners in the sound field synthesis although the approach can be also exploited in other applications such as underwater acoustics or ultrasonics. The author derives a fundamental formulation based on standard integral equations and the single-layer potential approach is identified as a useful tool in order to derive a general solution. He also proposes extensions to the single-layer potential approach which allow for a derivation of explicit solutions for circular, planar, and linear distributions of secondary sources. Based on above described formulation it is shown that the two established analytical approaches of Wave Field Synthesis and Near-field Compensated Higher Order Ambisonics constitute specific solutions to the general problem which are covered by the single-layer potential solution and its extensions.




Window Functions and Their Applications in Signal Processing


Book Description

Window functions—otherwise known as weighting functions, tapering functions, or apodization functions—are mathematical functions that are zero-valued outside the chosen interval. They are well established as a vital part of digital signal processing. Window Functions and their Applications in Signal Processing presents an exhaustive and detailed account of window functions and their applications in signal processing, focusing on the areas of digital spectral analysis, design of FIR filters, pulse compression radar, and speech signal processing. Comprehensively reviewing previous research and recent developments, this book: Provides suggestions on how to choose a window function for particular applications Discusses Fourier analysis techniques and pitfalls in the computation of the DFT Introduces window functions in the continuous-time and discrete-time domains Considers two implementation strategies of window functions in the time- and frequency domain Explores well-known applications of window functions in the fields of radar, sonar, biomedical signal analysis, audio processing, and synthetic aperture radar




Audio Source Separation and Speech Enhancement


Book Description

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: Consolidated perspective on audio source separation and speech enhancement. Both historical perspective and latest advances in the field, e.g. deep neural networks. Diverse disciplines: array processing, machine learning, and statistical signal processing. Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.