Sound Capture and Processing


Book Description

Provides state-of-the-art algorithms for sound capture, processing and enhancement Sound Capture and Processing: Practical Approaches covers the digital signal processing algorithms and devices for capturing sounds, mostly human speech. It explores the devices and technologies used to capture, enhance and process sound for the needs of communication and speech recognition in modern computers and communication devices. This book gives a comprehensive introduction to basic acoustics and microphones, with coverage of algorithms for noise reduction, acoustic echo cancellation, dereverberation and microphone arrays; charting the progress of such technologies from their evolution to present day standard. Sound Capture and Processing: Practical Approaches Brings together the state-of-the-art algorithms for sound capture, processing and enhancement in one easily accessible volume Provides invaluable implementation techniques required to process algorithms for real life applications and devices Covers a number of advanced sound processing techniques, such as multichannel acoustic echo cancellation, dereverberation and source separation Generously illustrated with figures and charts to demonstrate how sound capture and audio processing systems work An accompanying website containing Matlab code to illustrate the algorithms This invaluable guide will provide audio, R&D and software engineers in the industry of building systems or computer peripherals for speech enhancement with a comprehensive overview of the technologies, devices and algorithms required for modern computers and communication devices. Graduate students studying electrical engineering and computer science, and researchers in multimedia, cell-phones, interactive systems and acousticians will also benefit from this book.




Recording and Voice Processing, Volume 1


Book Description

Capturing, recording and broadcasting the voice is often difficult. Many factors must be taken into account and achieving a true representation is much more complex than one might think. The capture devices such as the position of the singer(s) or narrator(s), the acoustics, atmosphere and equipment are just some of the physical aspects that need to be mastered. Then there is the passage through the analog or digital channel, which disrupts the audio signal, as well as the processes that are often required to enrich, improve or even transform the vocal timbre and tessitura. While in the past these processes were purely material, today digital technologies and software produce surprising results that every professional in recording and broadcasting should know how to master. Recording and Voice Processing 1 addresses some general theoretical concepts. A history of recording and the physiology of the vocal apparatus are detailed in order to give the reader an understanding of the fundamental aspects of the subject. This volume also includes an advanced study of microphones, addressing their characteristics and typologies. The acoustic environment and its treatment are also considered in terms of the location of the sound capture - whether in a home studio, recording studio, live or natural environment - in order to achieve a satisfactory sound recording.




Spatial Audio Processing


Book Description

This book collects a wealth of information about spatial audio coding into one comprehensible volume. It is a thorough reference to the 3GPP and MPEG Parametric Stereo standards and the MPEG Surround multi-channel audio coding standard. It describes key developments in coding techniques, which is an important factor in the optimization of advanced entertainment, communications and signal processing applications. Until recently, technologies for coding audio signals, such as redundancy reduction and sophisticated source and receiver models did not incorporate spatial characteristics of source and receiving ends. Spatial audio coding achieves much higher compression ratios than conventional coders. It does this by representing multi-channel audio signals as a downmix signal plus side information that describes the perceptually-relevant spatial information. Written by experts in spatial audio coding, Spatial Audio Processing: reviews psychoacoustics (the relationship between physical measures of sound and the corresponding percepts) and spatial audio sound formats and reproduction systems; brings together the processing, acquisition, mixing, playback, and perception of spatial audio, with the latest coding techniques; analyses algorithms for the efficient manipulation of multiple, discrete and combined spatial audio channels, including both MP3 and MPEG Surround; shows how the same insights on source and receiver models can also be applied for manipulation of audio signals, such as the synthesis of virtual auditory scenes employing head-related transfer function (HRTF) processing and stereo to N-channel audio upmix. Audio processing research engineers and audio coding research and implementation engineers will find this an insightful guide. Academic audio and psychoacoustic researchers, including post-graduate and third/fourth year students taking courses in signal processing, audio and speech processing, and telecommunications, will also benefit from the information inside.




Parametric Time-Frequency Domain Spatial Audio


Book Description

A comprehensive guide that addresses the theory and practice of spatial audio This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems. Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies Includes contributions from leading researchers in the field Offers MATLAB codes with selected chapters An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.




Speech and Audio Processing


Book Description

With this comprehensive and accessible introduction to the field, you will gain all the skills and knowledge needed to work with current and future audio, speech, and hearing processing technologies. Topics covered include mobile telephony, human-computer interfacing through speech, medical applications of speech and hearing technology, electronic music, audio compression and reproduction, big data audio systems and the analysis of sounds in the environment. All of this is supported by numerous practical illustrations, exercises, and hands-on MATLAB® examples on topics as diverse as psychoacoustics (including some auditory illusions), voice changers, speech compression, signal analysis and visualisation, stereo processing, low-frequency ultrasonic scanning, and machine learning techniques for big data. With its pragmatic and application driven focus, and concise explanations, this is an essential resource for anyone who wants to rapidly gain a practical understanding of speech and audio processing and technology.




Integrated Devices for Artificial Intelligence and VLSI


Book Description

With its in-depth exploration of the close connection between microelectronics, AI, and VLSI technology, this book offers valuable insights into the cutting-edge techniques and tools used in VLSI design automation, making it an essential resource for anyone seeking to stay ahead in the rapidly evolving field of VLSI design. Very large-scale integration (VLSI) is the inter-disciplinary science of utilizing advanced semiconductor technology to create various functions of computer system. This book addresses the close link of microelectronics and artificial intelligence (AI). By combining VLSI technology, a very powerful computer architecture confinement is possible. To overcome problems at different design stages, researchers introduced artificial intelligent (AI) techniques in VLSI design automation. AI techniques, such as knowledge-based and expert systems, first try to define the problem and then choose the best solution from the domain of possible solutions. These days, several CAD technologies, such as Synopsys and Mentor Graphics, are specifically created to increase the automation of VLSI design. When a task is completed using the appropriate tool, each stage of the task design produces outcomes that are more productive than typical. However, combining all of these tools into a single package offer has drawbacks. We can’t really use every outlook without sacrificing the efficiency and usefulness of our output. The researchers decided to include AI approaches into VLSI design automation in order to get around these obstacles. AI is one of the fastest growing tools in the world of technology and innovation that helps to make computers more reliable and easy to use. Artificial Intelligence in VLSI design has provided high-end and more feasible solutions to the difficulties faced by the VLSI industry. Physical design, RTL design, STA, etc. are some of the most in-demand courses to enter the VLSI industry. These courses help develop a better understanding of the many tools like Synopsis. With each new dawn, artificial intelligence in VLSI design is continually evolving, and new opportunities are being investigated.




Speech and Audio Processing in Adverse Environments


Book Description

Users of signal processing systems are never satis?ed with the system they currently use. They are constantly asking for higher quality, faster perf- mance, more comfort and lower prices. Researchers and developers should be appreciative for this attitude. It justi?es their constant e?ort for improved systems. Better knowledge about biological and physical interrelations c- ing along with more powerful technologies are their engines on the endless road to perfect systems. This book is an impressive image of this process. After “Acoustic Echo 1 and Noise Control” published in 2004 many new results lead to “Topics in 2 Acoustic Echo and Noise Control” edited in 2006 . Today – in 2008 – even morenew?ndingsandsystemscouldbecollectedinthisbook.Comparingthe contributions in both edited volumes progress in knowledge and technology becomesclearlyvisible:Blindmethodsandmultiinputsystemsreplace“h- ble” low complexity systems. The functionality of new systems is less and less limited by the processing power available under economic constraints. The editors have to thank all the authors for their contributions. They cooperated readily in our e?ort to unify the layout of the chapters, the ter- nology, and the symbols used. It was a pleasure to work with all of them. Furthermore, it is the editors concern to thank Christoph Baumann and the Springer Publishing Company for the encouragement and help in publi- ing this book.




Intelligent Technologies for Interactive Entertainment


Book Description

This book constitutes the proceedings of the 5th International Conference on Intelligent Technologies for Interactive Entertainment, INTETAIN 2013. The 23 full papers presented were carefully selected from numerous submissions. The conference aims at enhancing the understanding of recent and anticipated advances in interactive technologies, and their applications to entertainment, education, culture, and the arts. The papers are grouped in topical sections on linked media, gaming technologies, and technologies for live entertainment.




Microphone Array Signal Processing


Book Description

In the past few years we have written and edited several books in the area of acousticandspeechsignalprocessing. Thereasonbehindthisendeavoristhat there were almost no books available in the literature when we ?rst started while there was (and still is) a real need to publish manuscripts summarizing the most useful ideas, concepts, results, and state-of-the-art algorithms in this important area of research. According to all the feedback we have received so far, we can say that we were right in doing this. Recently, several other researchers have followed us in this journey and have published interesting books with their own visions and perspectives. The idea of writing a book on Microphone Array Signal Processing comes from discussions we have had with many colleagues and friends. As a c- sequence of these discussions, we came up with the conclusion that, again, there is an urgent need for a monograph that carefully explains the theory and implementation of microphone arrays. While there are many manuscripts on antenna arrays from a narrowband perspective (narrowband signals and narrowband processing), the literature is quite scarce when it comes to s- sor arrays explained from a truly broadband perspective. Many algorithms for speech applications were simply borrowed from narrowband antenna - rays. However, a direct application of narrowband ideas to broadband speech processing may not be necessarily appropriate and can lead to many m- understandings.




The Art and Science of 3D Audio Recording


Book Description

This professional book offers a unique, comprehensive and timely guide on 3D audio recording. Intended for sound engineers and professionals, and summarizing more than twenty-year research on this topic, it includes extensive information and details on various microphone techniques and loudspeaker layouts, such as Auro-3D®, Dolby® AtmosTM, DTS:X®, MMAD, SONY 360 Reality Audio and Ambisonics. It presents a rich set of results obtained from both objective measurements and subjective listening tests, and a number of case studies for 3D recording, ranging from solo-instrument techniques to full symphony orchestra, and microphone systems for virtual reality applications. Further, it includes a chapter on spatial hearing discussing issues of 3D audio sound reproduction. All in all, this book offers extensive, practical information for sound engineers and professionals.