Theory and Applications of Spherical Microphone Array Processing


Book Description

This book presents the signal processing algorithms that have been developed to process the signals acquired by a spherical microphone array. Spherical microphone arrays can be used to capture the sound field in three dimensions and have received significant interest from researchers and audio engineers. Algorithms for spherical array processing are different to corresponding algorithms already known in the literature of linear and planar arrays because the spherical geometry can be exploited to great beneficial effect. The authors aim to advance the field of spherical array processing by helping those new to the field to study it efficiently and from a single source, as well as by offering a way for more experienced researchers and engineers to consolidate their understanding, adding either or both of breadth and depth. The level of the presentation corresponds to graduate studies at MSc and PhD level. This book begins with a presentation of some of the essential mathematical and physical theory relevant to spherical microphone arrays, and of an acoustic impulse response simulation method, which can be used to comprehensively evaluate spherical array processing algorithms in reverberant environments. The chapter on acoustic parameter estimation describes the way in which useful descriptions of acoustic scenes can be parameterized, and the signal processing algorithms that can be used to estimate the parameter values using spherical microphone arrays. Subsequent chapters exploit these parameters including in particular measures of direction-of-arrival and of diffuseness of a sound field. The array processing algorithms are then classified into two main classes, each described in a separate chapter. These are signal-dependent and signal-independent beamforming algorithms. Although signal-dependent beamforming algorithms are in theory able to provide better performance compared to the signal-independent algorithms, they are currently rarely used in practice. The main reason for this is that the statistical information required by these algorithms is difficult to estimate. In a subsequent chapter it is shown how the estimated acoustic parameters can be used in the design of signal-dependent beamforming algorithms. This final step closes, at least in part, the gap between theory and practice.




Fundamentals of Spherical Array Processing


Book Description

This book provides a comprehensive introduction to the theory and practice of spherical microphone arrays, and was written for graduate students, researchers and engineers who work with spherical microphone arrays in a wide range of applications. The new edition includes additions and modifications, and references supplementary Matlab code to provide the reader with a straightforward start for own implementations. The book is also accompanied by a Matlab manual, which explains how to implement the examples and simulations presented in the book. The first two chapters provide the reader with the necessary mathematical and physical background, including an introduction to the spherical Fourier transform and the formulation of plane-wave sound fields in the spherical harmonic domain. In turn, the third chapter covers the theory of spatial sampling, employed when selecting the positions of microphones to sample sound pressure functions in space. Subsequent chapters highlight various spherical array configurations, including the popular rigid-sphere-based configuration. Beamforming (spatial filtering) in the spherical harmonics domain, including axis-symmetric beamforming, and the performance measures of directivity index and white noise gain are introduced, and a range of optimal beamformers for spherical arrays, including those that achieve maximum directivity and maximum robustness are developed, along with the Dolph–Chebyshev beamformer. The final chapter discusses more advanced beamformers, such as MVDR (minimum variance distortionless response) and LCMV (linearly constrained minimum variance) types, which are tailored to the measured sound field. Mathworks kindly distributes the Matlab sources for this book on https://www.mathworks.com/matlabcentral/fileexchange/68655-fundamentals-of-spherical-array-processing.




Modal Array Signal Processing: Principles and Applications of Acoustic Wavefield Decomposition


Book Description

This book deals with the problem of detecting and localizing multiple simultaneously active wideband acoustic sources by applying the notion of wavefield decomposition using circular and spherical microphone arrays. A rigorous derivation of modal array signal processing algorithms for unambiguous source detection and localization, as well as performance evaluations by means of measurements using an actual real-time capable implementation, are discussed.




Microphone Arrays


Book Description

This book explains the motivation for using microphone arrays as opposed to using a single sensor for sound acquisition. The book then goes on to summarize the most useful ideas, concepts, results, and new algorithms therein. The material presented in this work includes analysis of the advantages of using microphone arrays, including dimensionality reduction to remove the redundancy while preserving the variability of the array signals using the principal component analysis (PCA). The authors also discuss benefits such as beamforming with low-rank approximations, fixed, adaptive, and robust distortionless beamforming, differential beamforming, and a new form of binaural beamforming that takes advantage of both beamforming and human binaural hearing properties to improve speech intelligibility. The book makes the microphone array signal processing theory and applications available in a complete and self-contained text. The authors attempt to explain the main ideas in a clear and rigorous way so that the reader can easily capture the potentials, opportunities, challenges, and limitations of microphone array signal processing. This book is written for those who work on the topics of microphone arrays, noise reduction, speech enhancement, speech communication, and human-machine speech interfaces.




Intelligent Systems in Cybernetics and Automation Control Theory


Book Description

This book presents real-world problems and pioneering research that reflect novel approaches to cybernetics, algorithms and software engineering in the context of intelligent systems. It gathers the peer-reviewed proceedings of the 2nd Computational Methods in Systems and Software 2018 (CoMeSySo 2018), a conference that broke down traditional barriers by being held online. The goal of the event was to provide an international forum for discussing the latest high-quality research results.




Array Processing


Book Description

The focus of this book is on array processing and beamforming with Kronecker products. It considers a large family of sensor arrays that allow the steering vector to be decomposed as a Kronecker product of two steering vectors of smaller virtual arrays. Instead of directly designing a global beamformer for the original array, once the steering vector has been decomposed, smaller virtual beamformers are designed and separately optimized for each virtual array. This means the matrices that need to be inverted are smaller, which increases the robustness of the beamformers, and reduces the size of the observations. The book explains how to perform beamforming with Kronecker product filters using an unconventional approach. It shows how the Kronecker product formulation can be used to derive fixed, adaptive, and differential beamformers with remarkable flexibility. Furthermore, it demonstrates how fixed and adaptive beamformers can be intelligently combined, optimally exploiting the advantages of both. The problem of spatiotemporal signal enhancement is also addressed, and readers will learn how to perform Kronecker product filtering in this context.




Speech Processing in Modern Communication


Book Description

Modern communication devices, such as mobile phones, teleconferencing systems, VoIP, etc., are often used in noisy and reverberant environments. Therefore, signals picked up by the microphones from telecommunication devices contain not only the desired near-end speech signal, but also interferences such as the background noise, far-end echoes produced by the loudspeaker, and reverberations of the desired source. These interferences degrade the fidelity and intelligibility of the near-end speech in human-to-human telecommunications and decrease the performance of human-to-machine interfaces (i.e., automatic speech recognition systems). The proposed book deals with the fundamental challenges of speech processing in modern communication, including speech enhancement, interference suppression, acoustic echo cancellation, relative transfer function identification, source localization, dereverberation, and beamforming in reverberant environments. Enhancement of speech signals is necessary whenever the source signal is corrupted by noise. In highly non-stationary noise environments, noise transients, and interferences may be extremely annoying. Acoustic echo cancellation is used to eliminate the acoustic coupling between the loudspeaker and the microphone of a communication device. Identification of the relative transfer function between sensors in response to a desired speech signal enables to derive a reference noise signal for suppressing directional or coherent noise sources. Source localization, dereverberation, and beamforming in reverberant environments further enable to increase the intelligibility of the near-end speech signal.




Microphone Array Signal Processing


Book Description

In the past few years we have written and edited several books in the area of acousticandspeechsignalprocessing. Thereasonbehindthisendeavoristhat there were almost no books available in the literature when we ?rst started while there was (and still is) a real need to publish manuscripts summarizing the most useful ideas, concepts, results, and state-of-the-art algorithms in this important area of research. According to all the feedback we have received so far, we can say that we were right in doing this. Recently, several other researchers have followed us in this journey and have published interesting books with their own visions and perspectives. The idea of writing a book on Microphone Array Signal Processing comes from discussions we have had with many colleagues and friends. As a c- sequence of these discussions, we came up with the conclusion that, again, there is an urgent need for a monograph that carefully explains the theory and implementation of microphone arrays. While there are many manuscripts on antenna arrays from a narrowband perspective (narrowband signals and narrowband processing), the literature is quite scarce when it comes to s- sor arrays explained from a truly broadband perspective. Many algorithms for speech applications were simply borrowed from narrowband antenna - rays. However, a direct application of narrowband ideas to broadband speech processing may not be necessarily appropriate and can lead to many m- understandings.




Techniques for Noise Robustness in Automatic Speech Recognition


Book Description

Automatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice search system from a busy cafeteria or a street. This can result in degraded speech recordings and adversely affect the performance of speech recognition systems. As the use of ASR systems increases, knowledge of the state-of-the-art in techniques to deal with such problems becomes critical to system and application engineers and researchers who work with or on ASR technologies. This book presents a comprehensive survey of the state-of-the-art in techniques used to improve the robustness of speech recognition systems to these degrading external influences. Key features: Reviews all the main noise robust ASR approaches, including signal separation, voice activity detection, robust feature extraction, model compensation and adaptation, missing data techniques and recognition of reverberant speech. Acts as a timely exposition of the topic in light of more widespread use in the future of ASR technology in challenging environments. Addresses robustness issues and signal degradation which are both key requirements for practitioners of ASR. Includes contributions from top ASR researchers from leading research units in the field




Advances on P2P, Parallel, Grid, Cloud and Internet Computing


Book Description

This book aims to provide the latest research findings, innovative research results, methods and development techniques from both theoretical and practical perspectives related to P2P, Grid, Cloud and Internet computing as well as to reveal synergies among such large-scale computing paradigms. P2P, Grid, Cloud and Internet computing technologies have been very fast established as breakthrough paradigms for solving complex problems by enabling aggregation and sharing of an increasing variety of distributed computational resources at large scale. Grid Computing originated as a paradigm for high-performance computing, as an alternative to expensive supercomputers through different forms of large-scale distributed computing. P2P Computing emerged as a new paradigm after client–server and web-based computing and has shown useful to the development of social networking, B2B (Business to Business), B2C (Business to Consumer), B2G (Business to Government), B2E (Business to Employee), and so on. Cloud Computing has been defined as a “computing paradigm where the boundaries of computing are determined by economic rationale rather than technical limits”. Cloud computing has fast become the computing paradigm with applicability and adoption in all application domains and providing utility computing at large scale. Finally, Internet Computing is the basis of any large-scale distributed computing paradigms; it has very fast developed into a vast area of flourishing field with enormous impact on today’s information societies serving thus as a universal platform comprising a large variety of computing forms such as Grid, P2P, Cloud and Mobile computing.