ICASSP 86


Book Description




Proceedings of the 2012 International Conference on Information Technology and Software Engineering


Book Description

Proceedings of the 2012 International Conference on Information Technology and Software Engineering presents selected articles from this major event, which was held in Beijing, December 8-10, 2012. This book presents the latest research trends, methods and experimental results in the fields of information technology and software engineering, covering various state-of-the-art research theories and approaches. The subjects range from intelligent computing to information processing, software engineering, Web, unified modeling language (UML), multimedia, communication technologies, system identification, graphics and visualizing, etc. The proceedings provide a major interdisciplinary forum for researchers and engineers to present the most innovative studies and advances, which can serve as an excellent reference work for researchers and graduate students working on information technology and software engineering. Prof. Wei Lu, Dr. Guoqiang Cai, Prof. Weibin Liu and Dr. Weiwei Xing all work at Beijing Jiaotong University.







Language and Speech Processing


Book Description

Speech processing addresses various scientific and technological areas. It includes speech analysis and variable rate coding, in order to store or transmit speech. It also covers speech synthesis, especially from text, speech recognition, including speaker and language identification, and spoken language understanding. This book covers the following topics: how to realize speech production and perception systems, how to synthesize and understand speech using state-of-the-art methods in signal processing, pattern recognition, stochastic modelling computational linguistics and human factor studies.




Audio Processing and Speech Recognition


Book Description

This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system. Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification. By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.




Speaker Classification I


Book Description

This volume and its companion volume LNAI 4441 constitute a state-of-the-art survey in the field of speaker classification. Together they address such intriguing issues as how speaker characteristics are manifested in voice and speaking behavior. The nineteen contributions in this volume are organized into topical sections covering fundamentals, characteristics, applications, methods, and evaluation.




Fundamentals of Speaker Recognition


Book Description

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation. "Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System. Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists. Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.




Interactive Displays


Book Description

How we interface and interact with computing, communications and entertainment devices is going through revolutionary changes, with natural user inputs based on touch, voice, and vision replacing or augmenting the use of traditional interfaces based on the keyboard, mouse, joysticks, etc. As a result, displays are morphing from one-way interface devices that merely show visual content to two-way interaction devices that provide more engaging and immersive experiences. This book provides an in-depth coverage of the technologies, applications, and trends in the rapidly emerging field of interactive displays enabled by natural human-interfaces. Key features: Provides a definitive reference reading on all the touch technologies used in interactive displays, including their advantages, limitations, and future trends. Covers the fundamentals and applications of speech input, processing and recognition techniques enabling voice-based interactions. Offers a detailed review of the emerging vision-based sensing technologies, and user interactions using gestures of hands, body, face, and eye gazes. Discusses multi-modal natural user interface schemes which intuitively combine touch, voice, and vision for life-like interactions. Examines the requirements and technology status towards realizing “true” 3D immersive and interactive displays.




Voice and Audio Compression for Wireless Communications


Book Description

Voice communications remains the most important facet of mobile radio services, which may be delivered over conventional fixed links, the Internet or wireless channels. This all-encompassing volume reports on the entire 50-year history of voice compression, on recent audio compression techniques and the protection as well as transmission of these signals in hostile wireless propagation environments. Audio and Voice Compression for Wireless and Wireline Communications, Second Edition is divided into four parts with Part I covering the basics, while Part II outlines the design of analysis-by-synthesis coding, including a 100-page chapter on virtually all existing standardised speech codecs. The focus of Part III is on wideband and audio coding as well as transmission. Finally, Part IV concludes the book with a range of very low rate encoding techniques, scanning a range of research-oriented topics. Fully updated and revised second edition of “Voice Compression and Communications”, expanded to cover Audio features Includes two new chapters, on narrowband and wideband AMR coding, and MPEG audio coding Addresses the new developments in the field of wideband speech and audio compression Covers compression, error resilience and error correction coding, as well as transmission aspects, including cutting-edge turbo transceivers Presents both the historic and current view of speech compression and communications. Covering fundamental concepts in a non-mathematical way before moving to detailed discussions of theoretical principles, future concepts and solutions to various specific wireless voice communication problems, this book will appeal to both advanced readers and those with a background knowledge of signal processing and communications.




Academic Press Library in Signal Processing


Book Description

This fourth volume, edited and authored by world leading experts, gives a review of the principles, methods and techniques of important and emerging research topics and technologies in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing. With this reference source you will: - Quickly grasp a new area of research - Understand the underlying principles of a topic and its application - Ascertain how a topic relates to other areas and learn of the research issues yet to be resolved - Quick tutorial reviews of important and emerging topics of research in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing - Presents core principles and shows their application - Reference content on core principles, technologies, algorithms and applications - Comprehensive references to journal articles and other literature on which to build further, more specific and detailed knowledge - Edited by leading people in the field who, through their reputation, have been able to commission experts to write on a particular topic