Speech and Computer


Book Description

This book constitutes the refereed proceedings of the 16th International Conference on Speech and Computer, SPECOM 2014, held in Novi Sad, Serbia. The 56 revised full papers presented together with 3 invited talks were carefully reviewed and selected from 100 initial submissions. It is a conference with long tradition that attracts researchers in the area of computer speech processing (recognition, synthesis, understanding etc.) and related domains (including signal processing, language and text processing, multi-modal speech processing or human-computer interaction for instance).




Automatic Speech Recognition and Translation for Low Resource Languages


Book Description

AUTOMATIC SPEECH RECOGNITION and TRANSLATION for LOW-RESOURCE LANGUAGES This book is a comprehensive exploration into the cutting-edge research, methodologies, and advancements in addressing the unique challenges associated with ASR and translation for low-resource languages. Automatic Speech Recognition and Translation for Low Resource Languages contains groundbreaking research from experts and researchers sharing innovative solutions that address language challenges in low-resource environments. The book begins by delving into the fundamental concepts of ASR and translation, providing readers with a solid foundation for understanding the subsequent chapters. It then explores the intricacies of low-resource languages, analyzing the factors that contribute to their challenges and the significance of developing tailored solutions to overcome them. The chapters encompass a wide range of topics, ranging from both the theoretical and practical aspects of ASR and translation for low-resource languages. The book discusses data augmentation techniques, transfer learning, and multilingual training approaches that leverage the power of existing linguistic resources to improve accuracy and performance. Additionally, it investigates the possibilities offered by unsupervised and semi-supervised learning, as well as the benefits of active learning and crowdsourcing in enriching the training data. Throughout the book, emphasis is placed on the importance of considering the cultural and linguistic context of low-resource languages, recognizing the unique nuances and intricacies that influence accurate ASR and translation. Furthermore, the book explores the potential impact of these technologies in various domains, such as healthcare, education, and commerce, empowering individuals and communities by breaking down language barriers. Audience The book targets researchers and professionals in the fields of natural language processing, computational linguistics, and speech technology. It will also be of interest to engineers, linguists, and individuals in industries and organizations working on cross-lingual communication, accessibility, and global connectivity.




Fundamentals of Speaker Recognition


Book Description

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation. "Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System. Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists. Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.




Embedded Systems


Book Description

Nowadays, embedded systems - computer systems that are embedded in various kinds of devices and play an important role of specific control functions, have permeated various scenes of industry. Therefore, we can hardly discuss our life or society from now onwards without referring to embedded systems. For wide-ranging embedded systems to continue their growth, a number of high-quality fundamental and applied researches are indispensable. This book contains 13 excellent chapters and addresses a wide spectrum of research topics of embedded systems, including parallel computing, communication architecture, application-specific systems, and embedded systems projects. Embedded systems can be made only after fusing miscellaneous technologies together. Various technologies condensed in this book as well as in the complementary book "Embedded Systems - Theory and Design Methodology", will be helpful to researchers and engineers around the world.




Communications and Networking


Book Description

This book "Communications and Networking" focuses on the issues at the lowest two layers of communications and networking and provides recent research results on some of these issues. In particular, it first introduces recent research results on many important issues at the physical layer and data link layer of communications and networking and then briefly shows some results on some other important topics such as security and the application of wireless networks. In summary, this book covers a wide range of interesting topics of communications and networking. The introductions, data, and references in this book will help the readers know more abut this topic and help them explore this exciting and fast-evolving field.




Applications in Time-Frequency Signal Processing


Book Description

Because most real-world signals, including speech, sonar, communication, and biological signals, are non-stationary, traditional signal analysis tools such as Fourier transforms are of limited use because they do not provide easily accessible information about the localization of a given frequency component. A more suitable approach for those studying non-stationary signals is the use of time frequency representations that are functions of both time and frequency. Applications in Time-Frequency Signal Processing investigates the use of various time-frequency representations, such as the Wigner distribution and the spectrogram, in diverse application areas. Other books tend to focus on theoretical development. This book differs by highlighting particular applications of time-frequency representations and demonstrating how to use them. It also provides pseudo-code of the computational algorithms for these representations so that you can apply them to your own specific problems. Written by leaders in the field, this book offers the opportunity to learn from experts. Time-Frequency Representation (TFR) algorithms are simplified, enabling you to understand the complex theories behind TFRs and easily implement them. The numerous examples and figures, review of concepts, and extensive references allow for easy learning and application of the various time-frequency representations.




Advances in Biometrics


Book Description

This book constitutes the refereed proceedings of the International Conference on Biometrics, ICB 2007, held in Seoul, Korea, August 2007. Biometric criteria covered by the papers are assigned to face, fingerprint, iris, speech and signature, biometric fusion and performance evaluation, gait, keystrokes, and others. In addition, the volume also announces the results of the Face Authentication Competition, FAC 2006.




Applications of Space-Time Adaptive Processing


Book Description

This text discusses various applications of space-time adaptive processing, including applications in OTH-radar, ground target tracking, STAP in real world clutter environments, jammer cancellation, superresolution, active sonar, seismics and communications. It is divided into two parts: the first dealing with the classical adaptive suppression of airborne and spacebased radar clutter, and the second comprising of miscellaneous applications in other fields such as communications, underwater sound and seismics.




Smart Antennas


Book Description

Smart Antennas—State of the Art brings together the broad expertise of 41 European experts in smart antennas. They provide a comprehensive review and an extensive analysis of the recent progress and new results generated during the last years in almost all fields of smart antennas and MIMO (multiple-input multiple-output) transmission. The following represents a summarized table of content.Receiver: space-time processing, antenna combining, reduced rank processing, robust beamforming, subspace methods, synchronization, equalization, multiuser detection, iterative methods Channel: propagation, measurements and sounding, modelling, channel estimation, direction-of-arrival estimation, subscriber location estimation Transmitter: space-time block coding, channel side information, unified design of linear transceivers, ill-conditioned channels, MIMO-MAC strategies Network Theory: channel capacity, network capacity, multihop networks Technology: antenna design, transceivers, demonstrators and testbeds, future air interfaces Applications and Systems: 3G system and link level aspects, MIMO HSDPA, MIMO-WLAN/UMTS implementation issues This book serves as a reference for scientists and engineers who need to be aware of the leading edge research in multiple-antenna communications, an essential technology for emerging broadband wireless systems.




Robust Speech


Book Description

This book on Robust Speech Recognition and Understanding brings together many different aspects of the current research on automatic speech recognition and language understanding. The first four chapters address the task of voice activity detection which is considered an important issue for all speech recognition systems. The next chapters give several extensions to state-of-the-art HMM methods. Furthermore, a number of chapters particularly address the task of robust ASR under noisy conditions. Two chapters on the automatic recognition of a speaker's emotional state highlight the importance of natural speech understanding and interpretation in voice-driven systems. The last chapters of the book address the application of conversational systems on robots, as well as the autonomous acquisition of vocalization skills.