Speech Processing in the Auditory System


Book Description

Although speech is the primary behavioral medium by which humans communicate, its auditory basis is poorly understood, having profound implications on efforts to ameliorate the behavioral consequences of hearing impairment and on the development of robust algorithms for computer speech recognition. In this volume, the authors provide an up-to-date synthesis of recent research in the area of speech processing in the auditory system, bringing together a diverse range of scientists to present the subject from an interdisciplinary perspective. Of particular concern is the ability to understand speech in uncertain, potentially adverse acoustic environments, currently the bane of both hearing aid and speech recognition technology. There is increasing evidence that the perceptual stability characteristic of speech understanding is due, at least in part, to elegant transformations of the acoustic signal performed by auditory mechanisms. As a comprehensive review of speech's auditory basis, this book will interest physiologists, anatomists, psychologists, phoneticians, computer scientists, biomedical and electrical engineers, and clinicians.




Signals and Systems for Speech and Hearing


Book Description

This book introduces speech and hearing sciences students to the principles of "signal" and "system" analysis. Beginning with an examination of what signals and systems are, the book develops a thorough background from which many of the most important issues in speech and hearing can be tackled. Numerous illustrations.




The Speech Chain


Book Description

Originally published in 1963, The Speech Chain has been regarded as the classic, easy-to-read introduction to the fundamentals and complexities of speech communication. It provides a foundation for understanding the essential aspects of linguistics, acoustics and anatomy, and explores research and development into digital processing of speech and the use of computers for the generation of artificial speech and speech recognition. This interdisciplinary account will prove invaluable to students with little or no previous exposure to the study of language.




The Production of Speech


Book Description

This monograph arose from a conference on the Production of Speech held at the University of Texas at Austin on April 28-30, 1981. It was sponsored by the Center for Cognitive Science, the College of Liberal Arts, and the Linguistics and Psychology Departments. The conference was the second in a series of conferences on human experimental psychology: the first, held to commemorate the 50th anniversary of the founding of the Psychology Department, resulted in publication of the monograph Neural Mechanisms in Behavior, D. McFadden (Ed.), Springer-Verlag, 1980. The choice of the particular topic of the second conference was motivated by the belief that the state of knowledge of speech production had recently reached a critical mass, and that a good deal was to be gained from bringing together the foremost researchers in this field. The benefits were the opportunity for the participants to compare notes on their common problems, the publication of a monograph giving a comprehensive state-of-the-art picture of this research area, and the provision of enormous intellectual stimulus for local students of this topic.




Automatic Speech Recognition


Book Description

Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates; slow response time (minutes to hours) to instantaneous response time. These characteristics taken together increase the computational complexity of the problem by several orders of magnitude. Further, speech provides a challenging task domain which embodies many of the requirements of intelligent behavior: operate in real time; exploit vast amounts of knowledge, tolerate errorful, unexpected unknown input; use symbols and abstractions; communicate in natural language and learn from the environment. Voice input to computers offers a number of advantages. It provides a natural, fast, hands free, eyes free, location free input medium. However, there are many as yet unsolved problems that prevent routine use of speech as an input device by non-experts. These include cost, real time response, speaker independence, robustness to variations such as noise, microphone, speech rate and loudness, and the ability to handle non-grammatical speech. Satisfactory solutions to each of these problems can be expected within the next decade. Recognition of unrestricted spontaneous continuous speech appears unsolvable at present. However, by the addition of simple constraints, such as clarification dialog to resolve ambiguity, we believe it will be possible to develop systems capable of accepting very large vocabulary continuous speechdictation.




Neural Control of Speech


Book Description

A comprehensive and unified account of the neural computations underlying speech production, offering a theoretical framework bridging the behavioral and the neurological literatures. In this book, Frank Guenther offers a comprehensive, unified account of the neural computations underlying speech production, with an emphasis on speech motor control rather than linguistic content. Guenther focuses on the brain mechanisms responsible for commanding the musculature of the vocal tract to produce articulations that result in an acoustic signal conveying a desired string of syllables. Guenther provides neuroanatomical and neurophysiological descriptions of the primary brain structures involved in speech production, looking particularly at the cerebral cortex and its interactions with the cerebellum and basal ganglia, using basic concepts of control theory (accompanied by nontechnical explanations) to explore the computations performed by these brain regions. Guenther offers a detailed theoretical framework to account for a broad range of both behavioral and neurological data on the production of speech. He discusses such topics as the goals of the neural controller of speech; neural mechanisms involved in producing both short and long utterances; and disorders of the speech system, including apraxia of speech and stuttering. Offering a bridge between the neurological and behavioral literatures on speech production, the book will be a valuable resource for researchers in both fields.




Intelligent Speech Signal Processing


Book Description

Intelligent Speech Signal Processing investigates the utilization of speech analytics across several systems and real-world activities, including sharing data analytics, creating collaboration networks between several participants, and implementing video-conferencing in different application areas. Chapters focus on the latest applications of speech data analysis and management tools across different recording systems. The book emphasizes the multidisciplinary nature of the field, presenting different applications and challenges with extensive studies on the design, development and management of intelligent systems, neural networks and related machine learning techniques for speech signal processing.




Multilingual Speech Processing


Book Description

Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech processing from a multilingual perspective. By taking this all-inclusive approach to speech processing, the editors have included theories, algorithms, and techniques that are required to support spoken input and output in a large variety of languages. Multilingual Speech Processing presents a comprehensive introduction to research problems and solutions, both from a theoretical as well as a practical perspective, and highlights technology that incorporates the increasing necessity for multilingual applications in our global community. Current challenges of speech processing and the feasibility of sharing data and system components across different languages guide contributors in their discussions of trends, prognoses and open research issues. This includes automatic speech recognition and speech synthesis, but also speech-to-speech translation, dialog systems, automatic language identification, and handling non-native speech. The book is complemented by an overview of multilingual resources, important research trends, and actual speech processing systems that are being deployed in multilingual human-human and human-machine interfaces. Researchers and developers in industry and academia with different backgrounds but a common interest in multilingual speech processing will find an excellent overview of research problems and solutions detailed from theoretical and practical perspectives. - State-of-the-art research with a global perspective by authors from the USA, Asia, Europe, and South Africa - The only comprehensive introduction to multilingual speech processing currently available - Detailed presentation of technological advances integral to security, financial, cellular and commercial applications




Articulation and Phonological Disorders


Book Description

A classic in the field, Articulation and Phonological Disorders: Speech Sound Disorders in Children, 7e, presents the most up-to-date perspectives on the nature, assessment, and treatment of speech sound disorders. A must-have reference, this classic book delivers exceptional coverage of clinical literature and focuses on speech disorders of unknown causes. Offering a range of perspectives, it covers the normal aspects of speech sound articulation, normal speech sound acquisition, the classification of and factors related to the presence of phonological disorders, the assessment and remediation of speech sound disorders, and phonology as it relates to language and dialectal variations. This edition features twelve manageable chapters, including a new chapter on the classification of speech sound disorders, an expanded discussion of childhood apraxia of speech, additional coverage of evidence-based practices, and a look at both motor-based and linguistically-based treatment approaches.




Designing Interactive Speech Systems


Book Description

A description of the design and implementation of spoken language dialogue within the context of spoken language dialogue systems development. Using an applications-oriented SLDS developed through the Danish Dialogue project, the authors describe the complete process involved; and in so doing present several innovative practical tools, such as dialogue design guidelines, in-depth evaluation methodologies, and speech functionality analysis. Their approach is firmly applications-oriented, describing the results applicable to industry and showing how the development of advanced applications drives research rather than vice versa. For everyone working on the R&D of spoken language services, especially in the area of telecommunications.