Spoken Language Processing


Book Description

Remarkable progress is being made in spoken language processing, but many powerful techniques have remained hidden in conference proceedings and academic papers, inaccessible to most practitioners. In this book, the leaders of the Speech Technology Group at Microsoft Research share these advances -- presenting not just the latest theory, but practical techniques for building commercially viable products.KEY TOPICS: Spoken Language Processing draws upon the latest advances and techniques from multiple fields: acoustics, phonology, phonetics, linguistics, semantics, pragmatics, computer science, electrical engineering, mathematics, syntax, psychology, and beyond. The book begins by presenting essential background on speech production and perception, probability and information theory, and pattern recognition. The authors demonstrate how to extract useful information from the speech signal; then present a variety of contemporary speech recognition techniques, including hidden Markov models, acoustic and language modeling, and techniques for improving resistance to environmental noise. Coverage includes decoders, search algorithms, large vocabulary speech recognition techniques, text-to-speech, spoken language dialog management, user interfaces, and interaction with non-speech interface modalities. The authors also present detailed case studies based on Microsoft's advanced prototypes, including the Whisper speech recognizer, Whistler text-to-speech system, and MiPad handheld computer.MARKET: For anyone involved with planning, designing, building, or purchasing spoken language technology.




Spoken Language Understanding


Book Description

Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication by leveraging technologies from signal processing, pattern recognition, machine learning and artificial intelligence. SLU systems are designed to extract the meaning from speech utterances and its applications are vast, from voice search in mobile devices to meeting summarization, attracting interest from both commercial and academic sectors. Both human/machine and human/human communications can benefit from the application of SLU, using differing tasks and approaches to better understand and utilize such communications. This book covers the state-of-the-art approaches for the most popular SLU tasks with chapters written by well-known researchers in the respective fields. Key features include: Presents a fully integrated view of the two distinct disciplines of speech processing and language processing for SLU tasks. Defines what is possible today for SLU as an enabling technology for enterprise (e.g., customer care centers or company meetings), and consumer (e.g., entertainment, mobile, car, robot, or smart environments) applications and outlines the key research areas. Provides a unique source of distilled information on methods for computer modeling of semantic information in human/machine and human/human conversations. This book can be successfully used for graduate courses in electronics engineering, computer science or computational linguistics. Moreover, technologists interested in processing spoken communications will find it a useful source of collated information of the topic drawn from the two distinct disciplines of speech processing and language processing under the new area of SLU.







Dialogue Processing in Spoken Language Systems


Book Description

This book constitutes the strictly refereed post-workshop documentation of the ECAI'96 Workshop on Dialogue Processing in Spoken Language Systems, held in Budapest, Hungary, in August 1996, during ECAI'96. The volume presents 16 revised full papers including a detailed introduction and survey paper by the volume editors. The papers are organized in sections on foundations of spoken language dialogue systems, dialogue systems and prosodic aspects of spoken dialogue processing, spoken dialogue systems-design and implementation, and evaluation of systems. The book reports on work being pursued both in academia and in industry as a crucial issue in speech processing.







Handbook of Multimodal and Spoken Dialogue Systems


Book Description

Dictation systems, read-aloud software for the blind, speech control of machinery, geographical information systems with speech input and output, and educational software with `talking head' artificial tutorial agents are already on the market. The field is expanding rapidly, and new methods and applications emerge almost daily. But good sources of systematic information have not kept pace with the body of information needed for development and evaluation of these systems. Much of this information is widely scattered through speech and acoustic engineering, linguistics, phonetics, and experimental psychology. The Handbook of Multimodal and Spoken Dialogue Systems presents current and developing best practice in resource creation for speech input/output software and hardware. This volume brings experts in these fields together to give detailed `how to' information and recommendations on planning spoken dialogue systems, designing and evaluating audiovisual and multimodal systems, and evaluating consumer off-the-shelf products. In addition to standard terminology in the field, the following topics are covered in depth: How to collect high quality data for designing, training, and evaluating multimodal and speech dialogue systems; How to evaluate real-life computer systems with speech input and output; How to describe and model human-computer dialogue precisely and in depth. Also included: The first systematic medium-scale compendium of terminology with definitions. This handbook has been especially designed for the needs of development engineers, decision-makers, researchers, and advanced level students in the fields of speech technology, multimodal interfaces, multimedia, computational linguistics, and phonetics.




Spoken Language Systems


Book Description

Speech processing research in Japan started in the 1940s. This book provides a compendium of the prominent studies on spoken language systems developed in Japan. It offers a comprehensive introduction to the major works conducted at Japanese research institutes that are developing spoken language systems.




Spontaneous Spoken Language


Book Description

Jim Miller and Regina Weinert investigate syntactic structure and the organization of discourse in spontaneous spoken language. Using data from English, German, and Russian, they develop a systematic analysis of spoken English and highlight properties that hold across languages. The authors argue that the differences in syntax and the construction of discourse between spontaneous speech and written language bear on various areas of linguistic theory, apart from having obvious implications for syntactic analysis. In particular, they bear on typology, Chomskyan theories of first language acquisition, and the perennial problem of language in education. In current typological practice written and spontaneous spoken texts are often compared; the authors show convincingly that typological research should compare like with like. The consequences for Chomskyan, and indeed all, theories of first language acquisition flow from the central fact that children acquire spoken language but learn written language.




The Discovery of Spoken Language


Book Description

The Discovery of Spoken Language marks one of the first efforts to integrate the field of infant speech perception research into the general study of language acquisition. It fills in a key part of the acquisition story by providing an extensive review of research on the acquisition of language during the first year of life, focusing primarily on how normally developing infants learn the organization of native language sound patterns. Peter Jusczyk examines the initial capacities that infants possess for discriminating and categorizing speech sounds and how these capacities evolve as infants gain experience with native language input. Jusczyk also looks at how infants' growing knowledge of native language sound patterns may facilitate the acquisition of other aspects of language organization and discusses the relationship between the learner's developing capacities for perceiving and producing speech.




The Spoken Language Translator


Book Description

This book describes the Spoken Language Translator (SLT), one of the first major projects in the area of automatic speech translation.