Speech and Computer


Book Description

This book constitutes the refereed proceedings of the 16th International Conference on Speech and Computer, SPECOM 2014, held in Novi Sad, Serbia. The 56 revised full papers presented together with 3 invited talks were carefully reviewed and selected from 100 initial submissions. It is a conference with long tradition that attracts researchers in the area of computer speech processing (recognition, synthesis, understanding etc.) and related domains (including signal processing, language and text processing, multi-modal speech processing or human-computer interaction for instance).




Embedded Systems


Book Description

Nowadays, embedded systems - computer systems that are embedded in various kinds of devices and play an important role of specific control functions, have permeated various scenes of industry. Therefore, we can hardly discuss our life or society from now onwards without referring to embedded systems. For wide-ranging embedded systems to continue their growth, a number of high-quality fundamental and applied researches are indispensable. This book contains 13 excellent chapters and addresses a wide spectrum of research topics of embedded systems, including parallel computing, communication architecture, application-specific systems, and embedded systems projects. Embedded systems can be made only after fusing miscellaneous technologies together. Various technologies condensed in this book as well as in the complementary book "Embedded Systems - Theory and Design Methodology", will be helpful to researchers and engineers around the world.




Handbook of Signal Processing in Acoustics


Book Description

The Handbook of Signal Processing in Acoustics brings together a wide range of perspectives from over 100 authors to reveal the interdisciplinary nature of the subject. It brings the key issues from both acoustics and signal processing into perspective and is a unique resource for experts and practitioners alike to find new ideas and techniques within the diversity of signal processing in acoustics.




Introduction to Audio Analysis


Book Description

Introduction to Audio Analysis serves as a standalone introduction to audio analysis, providing theoretical background to many state-of-the-art techniques. It covers the essential theory necessary to develop audio engineering applications, but also uses programming techniques, notably MATLAB®, to take a more applied approach to the topic. Basic theory and reproducible experiments are combined to demonstrate theoretical concepts from a practical point of view and provide a solid foundation in the field of audio analysis. Audio feature extraction, audio classification, audio segmentation, and music information retrieval are all addressed in detail, along with material on basic audio processing and frequency domain representations and filtering. Throughout the text, reproducible MATLAB® examples are accompanied by theoretical descriptions, illustrating how concepts and equations can be applied to the development of audio analysis systems and components. A blend of reproducible MATLAB® code and essential theory provides enable the reader to delve into the world of audio signals and develop real-world audio applications in various domains. - Practical approach to signal processing: The first book to focus on audio analysis from a signal processing perspective, demonstrating practical implementation alongside theoretical concepts - Bridge the gap between theory and practice: The authors demonstrate how to apply equations to real-life code examples and resources, giving you the technical skills to develop real-world applications - Library of MATLAB code: The book is accompanied by a well-documented library of MATLAB functions and reproducible experiments




Digital Signal Processing with Kernel Methods


Book Description

A realistic and comprehensive review of joint approaches to machine learning and signal processing algorithms, with application to communications, multimedia, and biomedical engineering systems Digital Signal Processing with Kernel Methods reviews the milestones in the mixing of classical digital signal processing models and advanced kernel machines statistical learning tools. It explains the fundamental concepts from both fields of machine learning and signal processing so that readers can quickly get up to speed in order to begin developing the concepts and application software in their own research. Digital Signal Processing with Kernel Methods provides a comprehensive overview of kernel methods in signal processing, without restriction to any application field. It also offers example applications and detailed benchmarking experiments with real and synthetic datasets throughout. Readers can find further worked examples with Matlab source code on a website developed by the authors: http://github.com/DSPKM • Presents the necessary basic ideas from both digital signal processing and machine learning concepts • Reviews the state-of-the-art in SVM algorithms for classification and detection problems in the context of signal processing • Surveys advances in kernel signal processing beyond SVM algorithms to present other highly relevant kernel methods for digital signal processing An excellent book for signal processing researchers and practitioners, Digital Signal Processing with Kernel Methods will also appeal to those involved in machine learning and pattern recognition.




Fundamentals of Speaker Recognition


Book Description

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation. "Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System. Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists. Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.




Pattern Recognition, Machine Intelligence and Biometrics


Book Description

"Pattern Recognition, Machine Intelligence and Biometrics" covers the most recent developments in Pattern Recognition and its applications, using artificial intelligence technologies within an increasingly critical field. It covers topics such as: image analysis and fingerprint recognition; facial expressions and emotions; handwriting and signatures; iris recognition; hand-palm gestures; and multimodal based research. The applications span many fields, from engineering, scientific studies and experiments, to biomedical and diagnostic applications, to personal identification and homeland security. In addition, computer modeling and simulations of human behaviors are addressed in this collection of 31 chapters by top-ranked professionals from all over the world in the field of PR/AI/Biometrics. The book is intended for researchers and graduate students in Computer and Information Science, and in Communication and Control Engineering. Dr. Patrick S. P. Wang is a Professor Emeritus at the College of Computer and Information Science, Northeastern University, USA, Zijiang Chair of ECNU, Shanghai, and NSC Visiting Chair Professor of NTUST, Taipei.




Analysis and Application of Natural Language and Speech Processing


Book Description

This book presents recent advances in NLP and speech technology, a topic attracting increasing interest in a variety of fields through its myriad applications, such as the demand for speech guided touchless technology during the Covid-19 pandemic. The authors present results of recent experimental research that provides contributions and solutions to different issues related to speech technology and speech in industry. Technologies include natural language processing, automatic speech recognition (for under-resourced dialects) and speech synthesis that are useful for applications such as intelligent virtual assistants, among others. Applications cover areas such as sentiment analysis and opinion mining, Arabic named entity recognition, and language modelling. This book is relevant for anyone interested in the latest in language and speech technology.




Multimedia Watermarking Techniques and Applications


Book Description

Intellectual property owners must continually exploit new ways of reproducing, distributing, and marketing their products. However, the threat of piracy looms as a major problem with digital distribution and storage technologies. Multimedia Watermarking Techniques and Applications covers all current and future trends in the design of modern