ICASSP 86


Book Description







Speaker Classification I


Book Description

This volume and its companion volume LNAI 4441 constitute a state-of-the-art survey in the field of speaker classification. Together they address such intriguing issues as how speaker characteristics are manifested in voice and speaking behavior. The nineteen contributions in this volume are organized into topical sections covering fundamentals, characteristics, applications, methods, and evaluation.




Language and Speech Processing


Book Description

Speech processing addresses various scientific and technological areas. It includes speech analysis and variable rate coding, in order to store or transmit speech. It also covers speech synthesis, especially from text, speech recognition, including speaker and language identification, and spoken language understanding. This book covers the following topics: how to realize speech production and perception systems, how to synthesize and understand speech using state-of-the-art methods in signal processing, pattern recognition, stochastic modelling computational linguistics and human factor studies.




Audio Processing and Speech Recognition


Book Description

This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system. Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification. By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.




Proceedings of the 2012 International Conference on Information Technology and Software Engineering


Book Description

Proceedings of the 2012 International Conference on Information Technology and Software Engineering presents selected articles from this major event, which was held in Beijing, December 8-10, 2012. This book presents the latest research trends, methods and experimental results in the fields of information technology and software engineering, covering various state-of-the-art research theories and approaches. The subjects range from intelligent computing to information processing, software engineering, Web, unified modeling language (UML), multimedia, communication technologies, system identification, graphics and visualizing, etc. The proceedings provide a major interdisciplinary forum for researchers and engineers to present the most innovative studies and advances, which can serve as an excellent reference work for researchers and graduate students working on information technology and software engineering. Prof. Wei Lu, Dr. Guoqiang Cai, Prof. Weibin Liu and Dr. Weiwei Xing all work at Beijing Jiaotong University.




Fundamentals of Speaker Recognition


Book Description

An emerging technology, Speaker Recognition is becoming well-known for providing voice authentication over the telephone for helpdesks, call centres and other enterprise businesses for business process automation. "Fundamentals of Speaker Recognition" introduces Speaker Identification, Speaker Verification, Speaker (Audio Event) Classification, Speaker Detection, Speaker Tracking and more. The technical problems are rigorously defined, and a complete picture is made of the relevance of the discussed algorithms and their usage in building a comprehensive Speaker Recognition System. Designed as a textbook with examples and exercises at the end of each chapter, "Fundamentals of Speaker Recognition" is suitable for advanced-level students in computer science and engineering, concentrating on biometrics, speech recognition, pattern recognition, signal processing and, specifically, speaker recognition. It is also a valuable reference for developers of commercial technology and for speech scientists. Please click on the link under "Additional Information" to view supplemental information including the Table of Contents and Index.




Interactive Displays


Book Description

How we interface and interact with computing, communications and entertainment devices is going through revolutionary changes, with natural user inputs based on touch, voice, and vision replacing or augmenting the use of traditional interfaces based on the keyboard, mouse, joysticks, etc. As a result, displays are morphing from one-way interface devices that merely show visual content to two-way interaction devices that provide more engaging and immersive experiences. This book provides an in-depth coverage of the technologies, applications, and trends in the rapidly emerging field of interactive displays enabled by natural human-interfaces. Key features: Provides a definitive reference reading on all the touch technologies used in interactive displays, including their advantages, limitations, and future trends. Covers the fundamentals and applications of speech input, processing and recognition techniques enabling voice-based interactions. Offers a detailed review of the emerging vision-based sensing technologies, and user interactions using gestures of hands, body, face, and eye gazes. Discusses multi-modal natural user interface schemes which intuitively combine touch, voice, and vision for life-like interactions. Examines the requirements and technology status towards realizing “true” 3D immersive and interactive displays.




Intelligent Computing


Book Description

The book, “Intelligent Computing - Proceedings of the 2022 Computing Conference”, is a comprehensive collection of chapters focusing on the core areas of computing and their further applications in the real world. Each chapter is a paper presented at the Computing Conference 2022 held on July 14–15, 2022. Computing 2022 attracted a total of 498 submissions which underwent a double-blind peer-review process. Of those 498 submissions, 179 submissions have been selected to be included in this book. The goal of this conference is to give a platform to researchers with fundamental contributions and to be a premier venue for academic and industry practitioners to share new ideas and development experiences. We hope that readers find this book interesting and valuable as it provides the state-of-the-art intelligent methods and techniques for solving real-world problems. We also expect that the conference and its publications will be a trigger for further related research and technology improvements in this important subject.




Academic Press Library in Signal Processing


Book Description

This fourth volume, edited and authored by world leading experts, gives a review of the principles, methods and techniques of important and emerging research topics and technologies in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing. With this reference source you will: - Quickly grasp a new area of research - Understand the underlying principles of a topic and its application - Ascertain how a topic relates to other areas and learn of the research issues yet to be resolved - Quick tutorial reviews of important and emerging topics of research in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing - Presents core principles and shows their application - Reference content on core principles, technologies, algorithms and applications - Comprehensive references to journal articles and other literature on which to build further, more specific and detailed knowledge - Edited by leading people in the field who, through their reputation, have been able to commission experts to write on a particular topic