Computational Models of American Speech


Book Description

A new perspective on phonetic variation is achieved in this volume through the construction of a series of models of spoken American English. In the past, computer theorists and programmers investigating pronunciation have often relied on their own knowledge of the language or on limited transcription data. Speech recognition researchers, on the other hand, have drawn on a great deal of data but without examining in detail the information about pronunciation the data contains. The authors combine the best of each approach to develop probabilistic and rule-based computational models of transcription data. An ongoing controversy in studies of phonetic variation is the existence and proper definition of a phonetic unit. The authors argue that assumptions about the units of spoken language are critical to a computational model. Their computational models employ suprasegmental elements such as syllable boundaries, stress, and position in a unit called a metrical foot. The use of such elements in modeling data enables the creation of better computational models for both recognition and synthesis technology. This book should be of interest to speech engineers, linguists, and anyone who wishes to understand symbolic systems of communication.




Computational Models of Speech Pattern Processing


Book Description

Proceedings of the NATO Advanced Study Institute on Computational Models of Speech Pattern Processing, held in St. Helier, Jersey, UK, July 7-18, 1997




Computing PROSODY


Book Description

The prosody of spontaneous speech - A typology of spontaneous speech - Prosody, models, and spontaneous speech - On the analysis of prosody in interaction - Prosody and the structure of the message - Integrating prosodic and discourse modelling - Prosodic features of utterances in task-oriented dialogues - Variation of accent prominence within the phrase : models and spontaneous speech data - Predicting the intonation of discourse segments from examples in dialogue speech - Effects of focus on duration and vowel formant frequency in Japanese - Prosody in speech synthesis - Synthesizing spontaneous speech - Modelling prosody in spontaneous speech - Comparison of FO control rules derived from multiple speech databases - Segmental duration and speech timing - Measuring temporal compensation effect in speech perception - Prediction of major phrase boundary location and pause insertion using a stochastic context-free grammar - Prosody in speech recognition - A multi-level model for recognit ...




Cognitive Models of Speech Processing


Book Description

Cognitive Models of Speech Processing presents extensive reviews of current thinking on psycholinguistic and computational topics in speech recognition and natural-language processing, along with a substantial body of new experimental data and computational simulations. Topics range from lexical access and the recognition of words in continuous speech to syntactic processing and the relationship between syntactic and intonational structure. A Bradford Book. ACL-MIT Press Series in Natural Language Processing







Computational Modeling of Human Language Acquisition


Book Description

In doing so, computational modeling provides insight into the plausible mechanisms involved in human language acquisition, and inspires the development of better language models and techniques. This book provides an overview of the main research quesetions in the field of human language acquisition. It reviews the most commonly used computational frameworks, methodologies and resources for modeling child language learning, and the evaluation techniques used for assessing these computational models. The book is aimed at cognitive scientists who want to become familiar with the available computational methods for investigating problems related to human language acquisition, as well as computational linguists who are interested in applying their skills to the study of child language acquisition.







The Cambridge Handbook of Computational Psychology


Book Description

A cutting-edge reference source for the interdisciplinary field of computational cognitive modeling.




Dynamic Speech Models


Book Description

Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation of a linguistic message in a speaker's brain and ends with the arrival of the message in a listener's brain. Given the intricacy of the dynamic speech process and its fundamental importance in human communication, this monograph is intended to provide a comprehensive material on mathematical models of speech dynamics and to address the following issues: How do we make sense of the complex speech process in terms of its functional role of speech communication? How do we quantify the special role of speech timing? How do the dynamics relate to the variability of speech that has often been said to seriously hamper automatic speech recognition? How do we put the dynamic process of speech into a quantitative form to enable detailed analyses? And finally, how can we incorporate the knowledge of speech dynamics into computerized speech analysis and recognition algorithms? The answers to all these questions require building and applying computational models for the dynamic speech process. What are the compelling reasons for carrying out dynamic speech modeling? We provide the answer in two related aspects. First, scientific inquiry into the human speech code has been relentlessly pursued for several decades. As an essential carrier of human intelligence and knowledge, speech is the most natural form of human communication. Embedded in the speech code are linguistic (as well as para-linguistic) messages, which are conveyed through four levels of the speech chain. Underlying the robust encoding and transmission of the linguistic messages are the speech dynamics at all the four levels. Mathematical modeling of speech dynamics provides an effective tool in the scientific methods of studying the speech chain. Such scientific studies help understand why humans speak as they do and how humans exploit redundancy and variability by way of multitiered dynamic processes to enhance the efficiency and effectiveness of human speech communication. Second, advancement of human language technology, especially that in automatic recognition of natural-style human speech is also expected to benefit from comprehensive computational modeling of speech dynamics. The limitations of current speech recognition technology are serious and are well known. A commonly acknowledged and frequently discussed weakness of the statistical model underlying current speech recognition technology is the lack of adequate dynamic modeling schemes to provide correlation structure across the temporal speech observation sequence. Unfortunately, due to a variety of reasons, the majority of current research activities in this area favor only incremental modifications and improvements to the existing HMM-based state-of-the-art. For example, while the dynamic and correlation modeling is known to be an important topic, most of the systems nevertheless employ only an ultra-weak form of speech dynamics; e.g., differential or delta parameters. Strong-form dynamic speech modeling, which is the focus of this monograph, may serve as an ultimate solution to this problem. After the introduction chapter, the main body of this monograph consists of four chapters. They cover various aspects of theory, algorithms, and applications of dynamic speech models, and provide a comprehensive survey of the research work in this area spanning over past 20~years. This monograph is intended as advanced materials of speech and signal processing for graudate-level teaching, for professionals and engineering practioners, as well as for seasoned researchers and engineers specialized in speech processing