Sound Capture for Human / Machine Interfaces


Book Description

With a continuously increasing desire for natural and comfortable human/machine interaction, the acoustic interface of any terminal for multimedia or telecommunication services is challenged to allow seamless and hands-free audio communication. Sound Capture for Human-Machine Interfaces introduces the practical aspects of microphone array signal processing and presents various combinations of beamforming and acoustic echo cancellation.




Sound Capture and Processing


Book Description

Provides state-of-the-art algorithms for sound capture, processing and enhancement Sound Capture and Processing: Practical Approaches covers the digital signal processing algorithms and devices for capturing sounds, mostly human speech. It explores the devices and technologies used to capture, enhance and process sound for the needs of communication and speech recognition in modern computers and communication devices. This book gives a comprehensive introduction to basic acoustics and microphones, with coverage of algorithms for noise reduction, acoustic echo cancellation, dereverberation and microphone arrays; charting the progress of such technologies from their evolution to present day standard. Sound Capture and Processing: Practical Approaches Brings together the state-of-the-art algorithms for sound capture, processing and enhancement in one easily accessible volume Provides invaluable implementation techniques required to process algorithms for real life applications and devices Covers a number of advanced sound processing techniques, such as multichannel acoustic echo cancellation, dereverberation and source separation Generously illustrated with figures and charts to demonstrate how sound capture and audio processing systems work An accompanying website containing Matlab code to illustrate the algorithms This invaluable guide will provide audio, R&D and software engineers in the industry of building systems or computer peripherals for speech enhancement with a comprehensive overview of the technologies, devices and algorithms required for modern computers and communication devices. Graduate students studying electrical engineering and computer science, and researchers in multimedia, cell-phones, interactive systems and acousticians will also benefit from this book.




Fundamentals of Signal Enhancement and Array Signal Processing


Book Description

A comprehensive guide to the theory and practice of signal enhancement and array signal processing, including matlab codes, exercises and instructor and solution manuals Systematically introduces the fundamental principles, theory and applications of signal enhancement and array signal processing in an accessible manner Offers an updated and relevant treatment of array signal processing with rigor and concision Features a companion website that includes presentation files with lecture notes, homework exercises, course projects, solution manuals, instructor manuals, and Matlab codes for the examples in the book




Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction


Book Description

This book is dedicated to the dreamers, their dreams, and their perseverance in research work. This volume brings together the selected and peer–reviewed contributions of the p- ticipants at the COST 2102 International Conference on Verbal and Nonverbal F- tures of Human–Human and Human–Machine Interaction, held in Patras, Greece, October 29–31, 2007, hosted by the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008). The conference was sponsored by COST (European Cooperation in the Field of Scientific and Technical Research, www.cost.esf.org ) in the domain of Information and Communication Technologies (ICT) for disseminating the advances of the - search activity developed within COST Action 2102: “Cross-Modal Analysis of V- bal and Nonverbal Communication”(www.cost2102.eu). COST Action 2102 is a network of about 60 European and 6 overseas laboratories whose aim is to develop “an advanced acoustical, perceptual and psychological analysis of verbal and non-verbal communication signals originating in spontaneous face-to-face interaction, in order to identify algorithms and automatic procedures capable of identifying the human emotional states. Particular care is devoted to the recognition of emotional states, gestures, speech and facial expressions, in antici- tion of the implementation of intelligent avatars and interactive dialogue systems that could be exploited to improve user access to future telecommunication services”(see COST 2102 Memorandum of Understanding (MoU) www.cost2102.eu).




Theory and Applications of Spherical Microphone Array Processing


Book Description

This book presents the signal processing algorithms that have been developed to process the signals acquired by a spherical microphone array. Spherical microphone arrays can be used to capture the sound field in three dimensions and have received significant interest from researchers and audio engineers. Algorithms for spherical array processing are different to corresponding algorithms already known in the literature of linear and planar arrays because the spherical geometry can be exploited to great beneficial effect. The authors aim to advance the field of spherical array processing by helping those new to the field to study it efficiently and from a single source, as well as by offering a way for more experienced researchers and engineers to consolidate their understanding, adding either or both of breadth and depth. The level of the presentation corresponds to graduate studies at MSc and PhD level. This book begins with a presentation of some of the essential mathematical and physical theory relevant to spherical microphone arrays, and of an acoustic impulse response simulation method, which can be used to comprehensively evaluate spherical array processing algorithms in reverberant environments. The chapter on acoustic parameter estimation describes the way in which useful descriptions of acoustic scenes can be parameterized, and the signal processing algorithms that can be used to estimate the parameter values using spherical microphone arrays. Subsequent chapters exploit these parameters including in particular measures of direction-of-arrival and of diffuseness of a sound field. The array processing algorithms are then classified into two main classes, each described in a separate chapter. These are signal-dependent and signal-independent beamforming algorithms. Although signal-dependent beamforming algorithms are in theory able to provide better performance compared to the signal-independent algorithms, they are currently rarely used in practice. The main reason for this is that the statistical information required by these algorithms is difficult to estimate. In a subsequent chapter it is shown how the estimated acoustic parameters can be used in the design of signal-dependent beamforming algorithms. This final step closes, at least in part, the gap between theory and practice.




Handbook of Virtual Environments


Book Description

This Handbook, with contributions from leading experts in the field, provides a comprehensive, state-of-the-art account of virtual environments (VE). It serves as an invaluable source of reference for practitioners, researchers, and students in this rapidly evolving discipline. It also provides practitioners with a reference source to guide




The Democratization of Artificial Intelligence


Book Description

After a long time of neglect, Artificial Intelligence is once again at the center of most of our political, economic, and socio-cultural debates. Recent advances in the field of Artifical Neural Networks have led to a renaissance of dystopian and utopian speculations on an AI-rendered future. Algorithmic technologies are deployed for identifying potential terrorists through vast surveillance networks, for producing sentencing guidelines and recidivism risk profiles in criminal justice systems, for demographic and psychographic targeting of bodies for advertising or propaganda, and more generally for automating the analysis of language, text, and images. Against this background, the aim of this book is to discuss the heterogenous conditions, implications, and effects of modern AI and Internet technologies in terms of their political dimension: What does it mean to critically investigate efforts of net politics in the age of machine learning algorithms?




Advanced Strategies in Control Systems with Input and Output Constraints


Book Description

Physical, safety and technological constraints suggest that control actuators can neither provide unlimited amplitude signals nor unlimited speed of reaction. The techniques described in this book are useful for industrial applications in aeronautical or space domains, and in the context of biological systems. Such methods are well suited for the development of tools that help engineers to solve analysis and synthesis problems of control systems with input and output constraints.




Artificial Intelligence and Multimodal Signal Processing in Human-Machine Interaction


Book Description

Artificial Intelligence and Multimodal Signal Processing in Human-Machine Interaction presents an overview of an emerging field that is concerned with exploiting multiple modalities of communication in both Artificial Intelligence and Human-Machine Interaction. The book not only provides cross disciplinary research in the fields of multimodal signal acquisition and sensing, analysis, IoTs (Internet of Things), Artificial Intelligence, and system architectures, it also evaluates the role of Artificial Intelligence I in relation to the realization of contemporary Human Machine Interaction (HMI) systems.Readers are introduced to the multimodal signals and their role in the identification of the intended subjects, mental state and the realization of HMI systems are explored, and the applications of signal processing and machine/ensemble/deep learning for HMIs are assessed. A description of proposed methodologies is provided, and related works are also presented. This is a valuable resource for researchers, health professionals, postgraduate students, post doc researchers and faculty members in the fields of HMIs, Brain-Computer Interface (BCI), Prosthesis, Computer vision, and Mental state estimation, and all those who wish to broaden their knowledge in the allied field. - Covers advances in the multimodal signal processing and artificial intelligence assistive HMIs - Presents theories, algorithms, realizations, applications, approaches, and challenges that will have their impact and contribution in the design and development of modern and effective HMI (Human Machine Interaction) system - Presents different aspects of the multimodal signals, from the sensing to analysis using hardware/software, and making use of machine/ensemble/deep learning in the intended problem-solving




Human-Machine Interface


Book Description

HUMAN-MACHINE INTERFACE The book contains the latest advances in healthcare and presents them in the frame of the Human-Machine Interface (HMI). The Human-Machine Interface (HMI) industry has witnessed the evolution from a simple push button to a modern touch-screen display. HMI is a user interface that allows humans to operate controllers for machines, systems, or instruments. Most medical procedures are improved by HMI systems, from calling an ambulance to ensuring that a patient receives adequate treatment on time. This book describes the scenario of biomedical technologies in the context of the advanced HMI, with a focus on direct brain-computer connection. The book describes several HMI tools and related techniques for analyzing, creating, controlling, and upgrading healthcare delivery systems, and provides details regarding how advancements in technology, particularly HMI, ensure ethical and fair use in patient care. Audience The target audience for this book is medical personnel and policymakers in healthcare and pharmaceutical professionals, as well as engineers and researchers in computer science and artificial intelligence.