The Waveform Model of Vowel Perception and Production


Book Description

This book presents a new model of vowel perception and production derived from visual cues identified in waveform displays. In addition to describing waveform displays of vowels beyond previous descriptions, included in the book are descriptions of experimental evidence supporting near 100% vowel identification accuracy across 20 male talkers using the concepts in the model. The book content will be of interest to several academic fields including Cognitive Science, Psychology, Linguistics, Speech and Hearing, Language Acquisition, Neurolinguistics, Phonetics, and areas within Physics and Mathematics. Beyond these academic fields, the new model of vowel perception presented here could possibly be used to improve accuracy and speed within existing speech recognition systems, or it could be used to generate a new speech recognition program. Many speech recognition programs are based on simple statistical programs like Hidden Markov Models that ignore any theoretical basis to speech recognition. The Waveform Model differs from the HMM approaches since it has a theoretical basis rooted in articulation and that has potentially more promise than these simple HMM models that just take overall similarities in waveforms and try to match them to phonemes and words. Furthermore, many of the speech recognition programs use extensive training by a single user (in quiet conditions) in order to attain over 90% accuracy, which is still a relatively poor performance. The Waveform Model requires no training, can be used across talkers, and has accuracy above reported speech recognition performance (specific to vowels). In summary, the Waveform Model is innovative, and new to the literature and research communities.




Dynamics of Speech Production and Perception


Book Description

The idea that speech is a dynamic process is a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of the articulatory apparatus, waveform segments, and phonemes. Although this perspective has been mockingly referred to as "beads on a string", from the time of Henry Sweet's 19th century treatise almost up to our days specialists of speech science and speech technology have continued to conceptualize the speech signal as a sequence of static states interleaved with transitional elements reflecting the quasi-continuous nature of vocal production. This book, a collection of papers of which each looks at speech as a dynamic process and highlights one of its particularities, is dedicated to the memory of Ludmilla Andreevna Chistovich. At the outset, it was planned to be a Chistovich festschrift but, sadly, she passed away a few months before the book went to press. The 24 chapters of this volume testify to the enormous influence that she and her colleagues have had over the four decades since the publication of their 1965 monograph.




Vowel Perception and Production


Book Description

The last 50 years have witnessed a rapid growth in the understanding of the articulation and the acoustics of vowels. Contemporary theories of speech perception have concentrated on consonant perception, and this volume is intended as a balance to such bias. The authors propose a computational theory of auditory vowel perception, accounting for vowel identification in the face of acoustic differences between speakers and speaking rate and stress. This work lays the foundation for future experimental and computational studies of vowel perception.




Technological Resources for Second Language Pronunciation Learning and Teaching


Book Description

Second language (L2) pronunciation has become increasingly visible as an important area of L2 teaching and research. Despite the growing number of resources available focused on L2 pronunciation, technology in L2 pronunciation has received much less attention. While technology has been an enduring strand of L2 pronunciation research, it has also been somewhat inconspicuous. Indeed, research has examined a wide variety of technologies such as language-learning platforms, speech visualization software, and Automatic Speech Recognition. Despite the abundance of research, it can be difficult to gain a full sense of work in this area given the lack of a comprehensive and consolidated resource or reference. This book endeavors to fill that gap and make L2 pronunciation technologies more visible by providing teachers and researchers an introduction to research in a wide variety of technologies that can support pronunciation learning. While working to introduce practitioners to numerous technologies available, it also dives into the research-basis for their use, providing new studies and data featuring a wide variety of languages and learning contexts.




Speech Production and Perception


Book Description

This book aims to develop a framework for a fully explanatory theory of speech production and speech perception. It emphasises the difference between static models (primarily descriptive) and dynamic models that attempt to show how the basic linguistics and phonetics are related in an actual human speaker/listener.




Handbook of Vowels and Vowel Disorders


Book Description

In the general study of speech and phonetics, vowels have stood in second place to consonants. But what vowels are, how they differ from one another, how they vary among speakers, and how they are subject to disorder, are questions that require a closer examination. This Handbook presents a comprehensive, cogent, and up-to-date analysis of the vowel, including its typical development in children's speech, description by perceptual and instrumental methods, cross-linguistic and sociolinguistic aspects, and disorders of its production and use. It approaches the problems of vowel production and perception from the viewpoints of physiology, physics, psychology, linguistics, phonetics, phonology, and speech-language pathology. The chapters are logically complementary, and the major sections of the book are like key dimensions of understanding, each adding a perspective and base of knowledge on vowels. The sum total of the chapters is a synthesis of information on vowels that has no precedent.




Perception and Production of Fluent Speech


Book Description

Originally published in 1980, this title looks at the mental processes involved in producing and understanding spoken language. Although there had been several edited volumes on speech in the previous ten years, this volume was unique in that it deals exclusively with perception and production of fluent speech. The chapters in this volume, contributed to by distinguished scientists from psychology, linguistics and computer science, deal with such questions as: How are ideas encoded into sound? How does a speaker plan an utterance? How are words recognized? What is the role of knowledge in speech perception? In short, how do people communicate with each other using speech?




Second Language Speech


Book Description

Second language acquisition has rapidly grown as a field over the past decade, as our knowledge of the ways in which children and adults learn and use a second language has become crucial for effective language teaching. In addition to this important 'applied' function, research into second language acquisition has also informed the fields of linguistics and psychology in general, as it has shed light on the differences between native and non-native models of human language and cognition. The focus of this accessible new book is second language speech - that is, how speakers perceive, process, understand and pronounce the sounds of a second language. Each chapter includes review questions, and most chapters include 'tutorial' and 'lab' sections with practical exercises based on the University of Toronto Romance Phonetics Database (available online for free). The book also has a companion website, containing illustrated answers to the exercises, scripts for running acoustic analyses and useful weblinks.




Speech Physiology, Speech Perception, and Acoustic Phonetics


Book Description

This analysis of speech ranges from clarifying physiological, biological and neurological bases of speech through defining the principles of electrical and computer models of speech production.




Discrete-Time Speech Signal Processing


Book Description

Essential principles, practical examples, current applications, and leading-edge research. In this book, Thomas F. Quatieri presents the field's most intensive, up-to-date tutorial and reference on discrete-time speech signal processing. Building on his MIT graduate course, he introduces key principles, essential applications, and state-of-the-art research, and he identifies limitations that point the way to new research opportunities. Quatieri provides an excellent balance of theory and application, beginning with a complete framework for understanding discrete-time speech signal processing. Along the way, he presents important advances never before covered in a speech signal processing text book, including sinusoidal speech processing, advanced time-frequency analysis, and nonlinear aeroacoustic speech production modeling. Coverage includes: Speech production and speech perception: a dual view Crucial distinctions between stochastic and deterministic problems Pole-zero speech models Homomorphic signal processing Short-time Fourier transform analysis/synthesis Filter-bank and wavelet analysis/synthesis Nonlinear measurement and modeling techniques The book's in-depth applications coverage includes speech coding, enhancement, and modification; speaker recognition; noise reduction; signal restoration; dynamic range compression, and more. Principles of Discrete-Time Speech Processing also contains an exceptionally complete series of examples and Matlab exercises, all carefully integrated into the book's coverage of theory and applications.