Time-Domain Beamforming and Blind Source Separation


Book Description

This book addresses the problem of separating spontaneous multi-party speech by way of microphone arrays (beamformers) and adaptive signal processing techniques. It is written is a concise manner and an effort has been made such that all presented algorithms can be straightforwardly implemented by the reader. All experimental results have been obtained with real in-car microphone recordings involving simultaneous speech of the driver and the co-driver.




Audio Signal Processing for Next-Generation Multimedia Communication Systems


Book Description

Audio Signal Processing for Next-Generation Multimedia Communication Systems presents cutting-edge digital signal processing theory and implementation techniques for problems including speech acquisition and enhancement using microphone arrays, new adaptive filtering algorithms, multichannel acoustic echo cancellation, sound source tracking and separation, audio coding, and realistic sound stage reproduction. This book's focus is almost exclusively on the processing, transmission, and presentation of audio and acoustic signals in multimedia communications for telecollaboration where immersive acoustics will play a great role in the near future.




Filtering, Segmentation, and Depth


Book Description

"Computer vision seeks a process that starts with a noisy, ambiguous signal from a TV camera and ends with a high-level description of discrete objects located in 3-dimensional space and identified in a human classification. This book addresses the process at several levels. First to be treated are the low-level image-processing issues of noise removaland smoothing while preserving important lines and singularities in an image. At a slightly higher level, a robust contour tracing algorithm is described that produces a cartoon of the important lines in the image. Thirdis the high-level task of reconstructing the geometry of objects in the scene. The book has two aims: to give the computer vision community a new approach to early visual processing, in the form of image segmentation that incorporates occlusion at a low level, and to introduce real computer algorithms that do a better job than what most vision programmers use currently. The algorithms are: - a nonlinear filter that reduces noise and enhances edges, - an edge detector that also finds corners and produces smoothed contours rather than bitmaps, - an algorithm for filling gaps in contours."--PUBLISHER'S WEBSITE.




Blind Source Separation


Book Description

Blind Source Separation intends to report the new results of the efforts on the study of Blind Source Separation (BSS). The book collects novel research ideas and some training in BSS, independent component analysis (ICA), artificial intelligence and signal processing applications. Furthermore, the research results previously scattered in many journals and conferences worldwide are methodically edited and presented in a unified form. The book is likely to be of interest to university researchers, R&D engineers and graduate students in computer science and electronics who wish to learn the core principles, methods, algorithms and applications of BSS. Dr. Ganesh R. Naik works at University of Technology, Sydney, Australia; Dr. Wenwu Wang works at University of Surrey, UK.




Audio Source Separation and Speech Enhancement


Book Description

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: Consolidated perspective on audio source separation and speech enhancement. Both historical perspective and latest advances in the field, e.g. deep neural networks. Diverse disciplines: array processing, machine learning, and statistical signal processing. Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.




Blind Speech Separation


Book Description

This is the world’s first edited book on independent component analysis (ICA)-based blind source separation (BSS) of convolutive mixtures of speech. This book brings together a small number of leading researchers to provide tutorial-like and in-depth treatment on major ICA-based BSS topics, with the objective of becoming the definitive source for current, comprehensive, authoritative, and yet accessible treatment.




Springer Handbook of Speech Processing


Book Description

This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.




Speech Enhancement


Book Description

We live in a noisy world! In all applications (telecommunications, hands-free communications, recording, human-machine interfaces, etc.) that require at least one microphone, the signal of interest is usually contaminated by noise and reverberation. As a result, the microphone signal has to be "cleaned" with digital signal processing tools before it is played out, transmitted, or stored. This book is about speech enhancement. Different well-known and state-of-the-art methods for noise reduction, with one or multiple microphones, are discussed. By speech enhancement, we mean not only noise reduction but also dereverberation and separation of independent signals. These topics are also covered in this book. However, the general emphasis is on noise reduction because of the large number of applications that can benefit from this technology. The goal of this book is to provide a strong reference for researchers, engineers, and graduate students who are interested in the problem of signal and speech enhancement. To do so, we invited well-known experts to contribute chapters covering the state of the art in this focused field. TOC:Introduction.- Study of the Wiener Filter for Noise Reduction.- Statistical Methods for the Enhancement of Noisy Speech.- Single- und Multi-Microphone Spectral Amplitude Estimation Using a Super-Gaussian Speech Model.- From Volatility Modeling of Financial Time-Series to Stochastic Modeling and Enhancement of Speech Signals.- Single-Microphone Noise Suppression for 3G Handsets Based on Weighted Noise Estimation.- Signal Subspace Techniques for Speech Enhancement.- Speech Enhancement: Application of the Kalman Filter in the Estimate-Maximize (EM) Framework.- Speech Distortion Weighted Multichannel Wiener Filtering Techniques for Noise Reduction.- Adpative Microphone Arrays Employing Spatial Quadratic Soft Constraints and Spectral Shaping.- Single-Microphone Blind Dereverberation.- Separation and Dereverberation of Speech Signals with Multiple Microphones.- Frequency-Domain Blind Source Separation.- Subband Based Blind Source Separation.- Real-Time Blind Source Separation for Moving Speech Signals.- Separation of Speech by Computational Auditory Scene Analysis




Independent Component Analysis and Signal Separation


Book Description

This book constitutes the refereed proceedings of the 7th International Conference on Independent Component Analysis and Blind Source Separation, ICA 2007, held in London, UK, in September 2007. It covers algorithms and architectures, applications, medical applications, speech and signal processing, theory, and visual and sensory processing.




Independent Component Analysis and Blind Signal Separation


Book Description

In many situations found both in Nature and in human-built systems, a set of mixed signals is observed (frequently also with noise), and it is of great scientific and technological relevance to be able to isolate or separate them so that the information in each of the signals can be utilized. Blind source separation (BSS) research is one of the more interesting emerging fields now a days in the field of signal processing. It deals with the algorithms that allow the recovery of the original sources from a set of mixtures only. The adjective "blind" is applied because the purpose is to estimate the original sources without any a priori knowledge about either the sources or the mixing system. Most of the models employed in BSS assume the hypothesis about the independence of the original sources. Under this hypothesis, a BSS problem can be considered as a particular case of independent component analysis(ICA), a linear transformation technique that, starting from a multivariate representation of the data, minimizes the statistical dependence between the components of the representation. It can be claimed that most of the advances in ICA have been motivated by the search for solutions to the BSS problem and, the other way around, advances in ICA have been immediately applied to BSS. ICA and BSS algorithms start from a mixture model, whose parameters are estimated from the observed mixtures. Separation is achieved by applying the inverse mixture model to the observed signals(separating or unmixing model). Mixturem- els usually fall into three broad categories: instantaneous linear models, convolutive models and nonlinear models, the?rstone being the simplest but, in general, not near realistic applications. The development and test of the algorithms can be accomplished through synthetic data or with real-world data. Obviously, the most important aim(and most difficult) is the separation of real-world mixtures. BSS and ICA have strong relations also, apart from signal processing, with other fields such as statistics and artificial neural networks. As long as we can find a system that emits signals propagated through a mean, andthosesignalsarereceivedbyasetofsensorsandthereisaninterestinrecovering the original sources, we have a potential field of application for BSS and ICA. Inside that wide range of applications we can find, for instance: noise reduction applications, biomedical applications, audio systems, telecommunications, and many others. This volume comes out just 20 years after the first contributions in ICA and BSS 1 appeared . Therein after, the number of research groups working in ICA and BSS has been constantly growing, so that nowadays we can estimate that far more than 100 groups are researching in these fields. As proof of the recognition among the scientific community of ICA and BSS developments there have been numerous special sessions and special issues in several well- 1 J. Herault, B. Ans, "Circuits neuronaux à synapses modi?ables: décodage de messages composites para apprentissage non supervise", C.R. de l'Académie des Sciences, vol. 299, no. III-13,pp.525-528,1984