Measurement and Modelling of Head-related Transfer Function for Spatial Audio Synthesis


Book Description

There has been a growing interest in spatial sound generation arising from the development of new communications and media technologies. Binaural spatial sound systems are capable of encoding and rendering sound sources accurately in three dimensional space using only two recording/playback channels. This is based on the concept of the Head-Related Transfer Function (HRTF), which is a set of acoustic filters from the sound source to a listener's eardrums and contains all the listening cues used by the hearing mechanism for decoding spatial information encoded in binaural signals. The HRTF is usually obtained from acoustic measurements on different persons. In the case of discrete data and sets of measurements corresponding to different human subjects, it is desirable to have a continuous functional representation of the HRTF for efficiently rendering moving sounds in the virtual spatial audio systems; further this representation should be well-suited for customization to an individual listener. In this thesis, modal analysis is applied to examine the HRTF data structure, that is to employ the wave equation solutions to expand the HRTF with separable basis functions. This leads to a general representation of the HRTF into separated spatial and spectral components, where the spatial basis functions modes account for the HRTF spatial variations and the remaining HRTF spectral components provide a new means to examine the human body scattering behavior. The general model is further developed into the HRTF continuous functional representations. We use the normalized spatial modes to link near-field and far-field HRTFs directly, which provides a way to obtain the HRTFs at different ranges from measurements conducted at only a single range. The spatially invariant HRTF spectral components are represented continuously using an orthogonal series. Both spatial and spectral basis functions are well known functions, thus the developed analytical model can be used to easily examine the HRTF data feature-individualization. An important finding of this thesis is that the HRTF decomposition with the spatial basis functions can be well approximated by a finite number, which is defined as the HRTF spatial dimensionality. The dimensionality determines the least number of the HRTF measurements in space. We perform high resolution HRTF measurements on a KEMAR mannequin in a semi-anechoic acoustic chamber. Both signal processing aspects to extract HRTFs from the raw measurements and a practical high resolution spatial sampling scheme have been given in this thesis.




Anthropometric Individualization of Head-Related Transfer Functions Analysis and Modeling


Book Description

Human sound localization helps to pay attention to spatially separated speakers using interaural level and time differences as well as angle-dependent monaural spectral cues. In a monophonic teleconference, for instance, it is much more difficult to distinguish between different speakers due to missing binaural cues. Spatial positioning of the speakers by means of binaural reproduction methods using head-related transfer functions (HRTFs) enhances speech comprehension. These HRTFs are influenced by the torso, head and ear geometry as they describe the propagation path of the sound from a source to the ear canal entrance. Through this geometry-dependency, the HRTF is directional and subject-dependent. To enable a sufficient reproduction, individual HRTFs should be used. However, it is tremendously difficult to measure these HRTFs. For this reason this thesis proposes approaches to adapt the HRTFs applying individual anthropometric dimensions of a user. Since localization at low frequencies is mainly influenced by the interaural time difference, two models to adapt this difference are developed and compared with existing models. Furthermore, two approaches to adapt the spectral cues at higher frequencies are studied, improved and compared. Although the localization performance with individualized HRTFs is slightly worse than with individual HRTFs, it is nevertheless still better than with non-individual HRTFs, taking into account the measurement effort.




Head-Related Transfer Function and Acoustic Virtual Reality


Book Description

This book covers all aspects of head-related transfer function (HRTF), from the fundamentals through to the latest applications, such as 3D sound systems. An introductory chapter defines HRTF, describes the coordinate system used in the book, and presents the most recent research achievements in the field. HRTF and sound localization in the horizontal and median planes are then explained, followed by discussion of individual differences in HRTF, solutions to this individuality (personalization of HRTF), and methods of sound image control for an arbitrary 3D direction, encompassing both classic theory and state of the art data. The relations between HRTF and sound image distance and between HRTF and speech intelligibility are fully explored, and measurement and signal processing methods for HRTF are examined in depth. Here, supplementary material is provided to enable readers to measure and analyze HRTF by themselves. In addition, some typical HRTF databases are compared. The final two chapters are devoted to the principles and applications of acoustic virtual reality. This clearly written book will be ideal for all who wish to learn about HRTF and how to use it in their research.




Head-Related Transfer Function and Virtual Auditory Display


Book Description

This book systematically details the basic principles and applications of head-related transfer function (HRTF) and virtual auditory display (VAD), and reviews the latest developments in the field, especially those from the author’s own state-of-the-art research group. Head-Related Transfer Function and Virtual Auditory Display covers binaural hearing and the basic principles, experimental measurements, computation, physical characteristics analyses, filter design, and customization of HRTFs. It also details the principles and applications of VADs, including headphone and loudspeaker-based binaural reproduction, virtual reproduction of stereophonic and multi-channel surround sound, binaural room simulation, rendering systems for dynamic and real-time virtual auditory environments, psychoacoustic evaluation and validation of VADs, and a variety of applications of VADs. This guide provides all the necessary knowledge and latest results for researchers, graduate students, and engineers who work in the field of HRTF and VAD.




Fast Measurement of Individual Head-related Transfer Functions


Book Description

While binaural technology applications gained in popularity in recent years, the majority of applications still use non-individual Head-Related Transfer Functions (HRTFs) from artificial heads. However, certain applications, for example research of spatial hearing or hearing attention, require an physically exact and realistic binaural signal. The limiting factor that prohibits the widespread use of individual HRTFs is the acquisition time of such data. This time requirement has recently been reduced by the use of parallelization in the measurement signal which lead to the development of fast measurement systems capable of acquiring individual and spatially dense HRTF. This thesis provides a objective and subjective evaluation of such a system that is designed with the goal of little disturbance of the measurements in mind. The construction is detailed, followed by both an objective and subjective evaluation. A detailed investigation into additional distortion of the sound field introduced by the system itself is presented and it is shown that the system performs comparably to a conventional system in terms of sound source localization. Furthermore, a method is introduced and evaluated to further reduce the measurement time by using continuous rotation during the measurement. This method is used to reduced the measurement duration from eight minutes to three minutes without audible differences.




Spatial Audio Processing


Book Description

This book collects a wealth of information about spatial audio coding into one comprehensible volume. It is a thorough reference to the 3GPP and MPEG Parametric Stereo standards and the MPEG Surround multi-channel audio coding standard. It describes key developments in coding techniques, which is an important factor in the optimization of advanced entertainment, communications and signal processing applications. Until recently, technologies for coding audio signals, such as redundancy reduction and sophisticated source and receiver models did not incorporate spatial characteristics of source and receiving ends. Spatial audio coding achieves much higher compression ratios than conventional coders. It does this by representing multi-channel audio signals as a downmix signal plus side information that describes the perceptually-relevant spatial information. Written by experts in spatial audio coding, Spatial Audio Processing: reviews psychoacoustics (the relationship between physical measures of sound and the corresponding percepts) and spatial audio sound formats and reproduction systems; brings together the processing, acquisition, mixing, playback, and perception of spatial audio, with the latest coding techniques; analyses algorithms for the efficient manipulation of multiple, discrete and combined spatial audio channels, including both MP3 and MPEG Surround; shows how the same insights on source and receiver models can also be applied for manipulation of audio signals, such as the synthesis of virtual auditory scenes employing head-related transfer function (HRTF) processing and stereo to N-channel audio upmix. Audio processing research engineers and audio coding research and implementation engineers will find this an insightful guide. Academic audio and psychoacoustic researchers, including post-graduate and third/fourth year students taking courses in signal processing, audio and speech processing, and telecommunications, will also benefit from the information inside.




Principles And Applications Of Spatial Hearing


Book Description

Humans possess a remarkable ability to extract rich three-dimensional information about sound environments simply by analyzing the acoustic signals they receive at their two ears. Research in spatial hearing has evolved from a theoretical discipline studying the basic mechanisms of hearing to a technical discipline focused on designing and implementing increasingly sophisticated spatial auditory display systems. This book contains 39 chapters representing the current state-of-the-art in spatial audio research selected from papers presented in Sendai, Japan, at the First International Workshop on the Principles and Applications of Spatial Hearing.




Advances in Fundamental and Applied Research on Spatial Audio


Book Description

Spatial audio is a dynamic and rapidly evolving field, as it is closely linked to advances in computer technology and digital signal processing. The democratization of virtual reality hardware available as consumer devices has moved the field further out of traditional laboratory research, and directly into applied research targeting a wide range of consumers. Advances in Fundamental and Applied Research on Spatial Audio presents a collection of eight peer-reviewed chapters on this exciting area of research. The contributions are organized into three sections: “Acoustic Methodology”, “Perception”, and “Applications”, and cover a range of topics, addressing both headphone- and loudspeaker-based reproductions, offering both methodological overviews and specific case studies.







Virtual Reality


Book Description

This book constitutes the refereed proceedings of the Second International Conference on Virtual Reality, ICVR 2007, held in Beijing, China. It covers 3D rendering and visualization, interacting and navigating in virtual and augmented environments, industrial applications of virtual reality, as well as health, cultural, educational and entertainment applications.