Speech Analysis Synthesis and Perception


Book Description

The first edition of this book has enjoyed a gratifying existence. 1s sued in 1965, it found its intended place as a research reference and as a graduate-Ievel text. Research laboratories and universities reported broad use. Published reviews-some twenty-five in number-were universally kind. Subsequently the book was translated and published in Russian (Svyaz; Moscow, 1968) and Spanish (Gredos, S.A.; Madrid, 1972). Copies of the first edition have been exhausted for several years, but demand for the material continues. At the behest of the publisher, and with the encouragement of numerous colleagues, a second edition was begun in 1970. The aim was to retain the original format, but to expand the content, especially in the areas of digital communications and com puter techniques for speech signal processing. As before, the intended audience is the graduate-Ievel engineer and physicist, but the psycho physicist, phonetician, speech scientist and linguist should find material of interest.




Analysis, Synthesis, and Perception of Musical Sounds


Book Description

This book contains a complete and accurate mathematical treatment of the sounds of music with an emphasis on musical timbre. The book spans the range from tutorial introduction to advanced research and application to speculative assessment of its various techniques. All the contributors use a generalized additive sine wave model for describing musical timbre which gives a conceptual unity, but is of sufficient utility to be adapted to many different tasks.







Speech and Audio Signal Processing


Book Description

When Speech and Audio Signal Processing published in 1999, it stood out from its competition in its breadth of coverage and its accessible, intutiont-based style. This book was aimed at individual students and engineers excited about the broad span of audio processing and curious to understand the available techniques. Since then, with the advent of the iPod in 2001, the field of digital audio and music has exploded, leading to a much greater interest in the technical aspects of audio processing. This Second Edition will update and revise the original book to augment it with new material describing both the enabling technologies of digital music distribution (most significantly the MP3) and a range of exciting new research areas in automatic music content processing (such as automatic transcription, music similarity, etc.) that have emerged in the past five years, driven by the digital music revolution. New chapter topics include: Psychoacoustic Audio Coding, describing MP3 and related audio coding schemes based on psychoacoustic masking of quantization noise Music Transcription, including automatically deriving notes, beats, and chords from music signals. Music Information Retrieval, primarily focusing on audio-based genre classification, artist/style identification, and similarity estimation. Audio Source Separation, including multi-microphone beamforming, blind source separation, and the perception-inspired techniques usually referred to as Computational Auditory Scene Analysis (CASA).




Speech Physiology, Speech Perception, and Acoustic Phonetics


Book Description

This analysis of speech ranges from clarifying physiological, biological and neurological bases of speech through defining the principles of electrical and computer models of speech production.




Progress in Speech Synthesis


Book Description

For a machine to convert text into sounds that humans can understand as speech requires an enormous range of components, from abstract analysis of discourse structure to synthesis and modulation of the acoustic output. Work in the field is thus inherently interdisciplinary, involving linguistics, computer science, acoustics, and psychology. This collection of articles by leading researchers in each of the fields involved in text-to-speech synthesis provides a picture of recent work in laboratories throughout the world and of the problems and challenges that remain. By providing samples of synthesized speech as well as video demonstrations for several of the synthesizers discussed, the book will also allow the reader to judge what all the work adds up to -- that is, how good is the synthetic speech we can now produce? Topics covered include: Signal processing and source modeling Linguistic analysis Articulatory synthesis and visual speech Concatenative synthesis and automated segmentation Prosodic analysis of natural speech Synthesis of prosody Evaluation and perception Systems and applications.




Dynamics of Speech Production and Perception


Book Description

The idea that speech is a dynamic process is a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of the articulatory apparatus, waveform segments, and phonemes. Although this perspective has been mockingly referred to as "beads on a string", from the time of Henry Sweet's 19th century treatise almost up to our days specialists of speech science and speech technology have continued to conceptualize the speech signal as a sequence of static states interleaved with transitional elements reflecting the quasi-continuous nature of vocal production. This book, a collection of papers of which each looks at speech as a dynamic process and highlights one of its particularities, is dedicated to the memory of Ludmilla Andreevna Chistovich. At the outset, it was planned to be a Chistovich festschrift but, sadly, she passed away a few months before the book went to press. The 24 chapters of this volume testify to the enormous influence that she and her colleagues have had over the four decades since the publication of their 1965 monograph.




The Handbook of Speech Perception


Book Description

The Handbook of Speech Perception is a collection of forward-looking articles that offer a summary of the technical and theoretical accomplishments in this vital area of research on language. Now available in paperback, this uniquely comprehensive companion brings together in one volume the latest research conducted in speech perception Contains original contributions by leading researchers in the field Illustrates technical and theoretical accomplishments and challenges across the field of research and language Adds to a growing understanding of the far-reaching relevance of speech perception in the fields of phonetics, audiology and speech science, cognitive science, experimental psychology, behavioral neuroscience, computer science, and electrical engineering, among others.




Auditory Perception


Book Description

This revised and updated third edition describes the nature of sound, how sound is analyzed by the auditory system, and the rules and principles governing our interpretation of auditory input. It covers many topics including sound and the auditory system, locating sound sources, the basis for loudness judgments, perception of acoustic sequences, perceptual restoration of obliterated sounds, speech production and perception, and the relation of hearing to perception in general. Whilst keeping the consistent style of the previous editions, many new features have been added, including suggestions for further reading at the end of each chapter, a section on functional imaging of the brain, expanded information on pitch and infrapitch, and additional coverage of speech processing. Advanced undergraduate and graduate students interested in auditory perception, behavioral sciences, psychology, neurobiology, architectural acoustics, and the hearing sciences will find this book an excellent guide.




Introduction to Digital Speech Processing


Book Description

Provides the reader with a practical introduction to the wide range of important concepts that comprise the field of digital speech processing. Students of speech research and researchers working in the field can use this as a reference guide.