Time Domain Representation of Speech Sounds


Book Description

The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation. The book also includes a new cohort study on the use of lexical knowledge in ASR. India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.




Explorations in Time-Frequency Analysis


Book Description

Understand the methods of modern non-stationary signal processing with authoritative insights from a leader in the field.




Introduction to Digital Speech Processing


Book Description

Provides the reader with a practical introduction to the wide range of important concepts that comprise the field of digital speech processing. Students of speech research and researchers working in the field can use this as a reference guide.




Hearing Loss


Book Description

Millions of Americans experience some degree of hearing loss. The Social Security Administration (SSA) operates programs that provide cash disability benefits to people with permanent impairments like hearing loss, if they can show that their impairments meet stringent SSA criteria and their earnings are below an SSA threshold. The National Research Council convened an expert committee at the request of the SSA to study the issues related to disability determination for people with hearing loss. This volume is the product of that study. Hearing Loss: Determining Eligibility for Social Security Benefits reviews current knowledge about hearing loss and its measurement and treatment, and provides an evaluation of the strengths and weaknesses of the current processes and criteria. It recommends changes to strengthen the disability determination process and ensure its reliability and fairness. The book addresses criteria for selection of pure tone and speech tests, guidelines for test administration, testing of hearing in noise, special issues related to testing children, and the difficulty of predicting work capacity from clinical hearing test results. It should be useful to audiologists, otolaryngologists, disability advocates, and others who are concerned with people who have hearing loss.




Dynamics of Speech Production and Perception


Book Description

The idea that speech is a dynamic process is a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of the articulatory apparatus, waveform segments, and phonemes. Although this perspective has been mockingly referred to as "beads on a string", from the time of Henry Sweet's 19th century treatise almost up to our days specialists of speech science and speech technology have continued to conceptualize the speech signal as a sequence of static states interleaved with transitional elements reflecting the quasi-continuous nature of vocal production. This book, a collection of papers of which each looks at speech as a dynamic process and highlights one of its particularities, is dedicated to the memory of Ludmilla Andreevna Chistovich. At the outset, it was planned to be a Chistovich festschrift but, sadly, she passed away a few months before the book went to press. The 24 chapters of this volume testify to the enormous influence that she and her colleagues have had over the four decades since the publication of their 1965 monograph.




Soft Computing in Acoustics


Book Description

Applications of some selected soft computing methods to acoustics and sound engineering are presented in this book. The aim of this research study is the implementation of soft computing methods to musical signal analysis and to the recognition of musical sounds and phrases. Accordingly, some methods based on such learning algorithms as neural networks, rough sets and fuzzy-logic were conceived, implemented and tested. Additionally, the above-mentioned methods were applied to the analysis and verification of subjective testing results. The last problem discussed within the framework of this book was the problem of fuzzy control of the classical pipe organ instrument. The obtained results show that computational intelligence and soft computing may be used for solving some vital problems in both musical and architectural acoustics.




Speech, Sound and Music Processing: Embracing Research in India


Book Description

This book constitutes the thoroughly refereed post-proceedings of the 8th International Symposium on Computer Music Modeling and Retrieval, CMMR 2011 and the 20th International Symposium on Frontiers of Research in Speech and Music, FRSM 2011. This year the 2 conferences merged for the first time and were held in Bhubanes, India, in March 2011. The 17 revised full papers presented were specially reviewed and revised for inclusion in this proceedings volume. The book is divided in four main chapters which reflect the high quality of the sessions of CMMR 2011, the collaboration with FRSM 2011 and the Indian influence, in the topics of Indian Music, Music Information Retrieval, Sound analysis synthesis and perception and Speech processing of Indian languages.




Speech Processing in Embedded Systems


Book Description

Speech Processing has rapidly emerged as one of the most widespread and well-understood application areas in the broader discipline of Digital Signal Processing. Besides the telecommunications applications that have hitherto been the largest users of speech processing algorithms, several non-traditional embedded processor applications are enhancing their functionality and user interfaces by utilizing various aspects of speech processing. "Speech Processing in Embedded Systems" describes several areas of speech processing, and the various algorithms and industry standards that address each of these areas. The topics covered include different types of Speech Compression, Echo Cancellation, Noise Suppression, Speech Recognition and Speech Synthesis. In addition this book explores various issues and considerations related to efficient implementation of these algorithms on real-time embedded systems, including the role played by processor CPU and peripheral functionality.




Forensic Speaker Identification


Book Description

A voice is much more than just a string of words. Voices, unlike fingerprints, are inherently complex. They signal a great deal of information in addition to the intended message: the speakers' sex, for example, or their emotional state, or age. Although evidence from DNA analysis grabs the headlines, DNA can't talk. It can't be recorded planning,




Predicting Prosody from Text for Text-to-Speech Synthesis


Book Description

Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.