Speech Spectrum Analysis


Book Description

The accurate determination of the speech spectrum, particularly for short frames, is commonly pursued in diverse areas including speech processing, recognition, and acoustic phonetics. With this book the author makes the subject of spectrum analysis understandable to a wide audience, including those with a solid background in general signal processing and those without such background. In keeping with these goals, this is not a book that replaces or attempts to cover the material found in a general signal processing textbook. Some essential signal processing concepts are presented in the first chapter, but even there the concepts are presented in a generally understandable fashion as far as is possible. Throughout the book, the focus is on applications to speech analysis; mathematical theory is provided for completeness, but these developments are set off in boxes for the benefit of those readers with sufficient background. Other readers may proceed through the main text, where the key results and applications will be presented in general heuristic terms, and illustrated with software routines and practical "show-and-tell" discussions of the results. At some points, the book refers to and uses the implementations in the Praat speech analysis software package, which has the advantages that it is used by many scientists around the world, and it is free and open source software. At other points, special software routines have been developed and made available to complement the book, and these are provided in the Matlab programming language. If the reader has the basic Matlab package, he/she will be able to immediately implement the programs in that platform---no extra "toolboxes" are required.




Signal Analysis and Prediction


Book Description

Methods of signal analysis represent a broad research topic with applications in many disciplines, including engineering, technology, biomedicine, seismography, eco nometrics, and many others based upon the processing of observed variables. Even though these applications are widely different, the mathematical background be hind them is similar and includes the use of the discrete Fourier transform and z-transform for signal analysis, and both linear and non-linear methods for signal identification, modelling, prediction, segmentation, and classification. These meth ods are in many cases closely related to optimization problems, statistical methods, and artificial neural networks. This book incorporates a collection of research papers based upon selected contri butions presented at the First European Conference on Signal Analysis and Predic tion (ECSAP-97) in Prague, Czech Republic, held June 24-27, 1997 at the Strahov Monastery. Even though the Conference was intended as a European Conference, at first initiated by the European Association for Signal Processing (EURASIP), it was very gratifying that it also drew significant support from other important scientific societies, including the lEE, Signal Processing Society of IEEE, and the Acoustical Society of America. The organizing committee was pleased that the re sponse from the academic community to participate at this Conference was very large; 128 summaries written by 242 authors from 36 countries were received. In addition, the Conference qualified under the Continuing Professional Development Scheme to provide PD units for participants and contributors.




The Acoustic Analysis of Speech


Book Description

The Acoustic Analysis Of Speech presents essential information on modern methods for the acoustic analysis of speech. It assumes only a modest technical background and is intended for the reader who wants to know the basic issues in speech analysis but does not have an extensive background in engineering, physics or mathematics. The book discusses the basic methods for the acoustic analysis of speech in relation to (a) the acoustic theory of speech production and (b) measures of primary interest to speech scientists, speech-language pathologists, linguists, psychologists or others who are interested in the acoustic signal of speech. Readers will gain an understanding of theory, methods and databases pertaining to speech acoustics. The book offers a simple and straightforward explanation of all aspects of acoustic analysis from recording the signal, to analysis methods, to sources of data on phonetic and suprasegmental aspects of speech. Includes reference to acoustic data for several languages in addition to English. The book is written at a general introductory level for course in Speech Science; Speech Acoustics; Experimental Phonetics and Laboratory Instrumentation for Speech and Hearing.




Introduction to Digital Speech Processing


Book Description

Provides the reader with a practical introduction to the wide range of important concepts that comprise the field of digital speech processing. Students of speech research and researchers working in the field can use this as a reference guide.




New Spectral Methods for Analysis of Source/filter Characteristics of Speech Signals


Book Description

This study proposes a new spectral representation called the Zeros of Z-Transform (ZZT), which is an all-zero representation of the z-transform of the signal. In addition, new chirp group delay processing techniques are developed for analysis of resonances of a signal. The combination of the ZZT representation with the chirp group delay processing algorithms provides a useful domain to study resonance characteristics of source and filter components of speech. Using the two representations, effective algorithms are developed for: source-tract decomposition of speech, glottal flow parameter estimation, formant tracking and feature extraction for speech recognition. The ZZT representation is mainly important for theoretical studies. Studying the ZZT of a signal is essential to be able to develop effective chirp group delay processing methods. Therefore, first the ZZT representation of the source-filter model of speech is studied for providing a theoretical background. We confirm through ZZT representation that anti-causality of the glottal flow signal introduces mixed-phase characteristics in speech signals. The ZZT of windowed speech signals is also studied since windowing cannot be avoided in practical signal processing algorithms and the effect of windowing on ZZT representation is drastic. We show that separate patterns exist in ZZT representations of windowed speech signals for the glottal flow and the vocal tract contributions. A decomposition method for source-tract separation is developed based on these patterns in ZZT. We define chirp group delay as group delay calculated on a circle other than the unit circle in z-plane. The need to compute group delay on a circle other than the unit circle comes from the fact that group delay spectra are often very noisy and cannot be easily processed for formant tracking purposes (the reasons are explained through ZZT representation). In this thesis, we propose methods to avoid such problems by modifying the ZZT of a signal and further computing the chirp group delay spectrum. New algorithms based on processing of the chirp group delay spectrum are developed for formant tracking and feature estimation for speech recognition. The proposed algorithms are compared to state-of-the-art techniques. Equivalent or higher efficiency is obtained for all proposed algorithms. The theoretical parts of the thesis further discuss a mixed-phase model for speech and phase processing problems in detail. Index Terms—spectral representation, source-filter separation, glottal flow estimation, formant tracking, zeros of z-transform, group delay processing, phase processing.




Intelligent Integrated Media Communication Techniques


Book Description

This volume contains many examples and applied methods explaining the basic architecture of the mobile terminals. It includes sufficient introductory material to enabling even non-expert readers to understand the topics and to make a step towards system integration of complex future applications.




Technical Abstract Bulletin


Book Description







Evolution in Computational Intelligence


Book Description

This book presents the proceedings of the 9th International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA 2021), held at NIT Mizoram, Aizwal, Mizoram, India, during June 25 – 26, 2021. FICTA conference aims to bring together researchers, scientists, engineers, and practitioners to exchange their new ideas and experiences in the domain of intelligent computing theories with prospective applications to various engineering disciplines. This volume covers broad areas of Evolution in Computational Intelligence. The conference papers included herein presents both theoretical as well as practical aspects of different areas like ANN and genetic algorithms, human-computer interaction, intelligent control optimization, evolutionary computing, intelligent e-learning systems, machine learning, mobile computing, multi-agent systems, etc. The volume will also serve as a knowledge centre for students of post-graduate level in various engineering disciplines.




Maximum-Entropy and Bayesian Methods in Inverse Problems


Book Description

This volume contains the text of the twenty-five papers presented at two workshops entitled Maximum-Entropy and Bayesian Methods in Applied Statistics, which were held at the University of Wyoming from June 8 to 10, 1981, and from August 9 to 11, 1982. The workshops were organized to bring together researchers from different fields to critically examine maxi mum-entropy and Bayesian methods in science, engineering, medicine, oceanography, economics, and other disciplines. An effort was made to maintain an informal environment where ideas could be easily ~xchanged. That the workshops were at least partially successful is borne out by the fact that there have been two succeeding workshops, and the upcoming Fifth Workshop promises to be the largest of all. These workshops and their proceedings could not have been brought to their final form without the substantial help of a number of people. The support of David Hofmann, the past chairman, and Glen Rebka, Jr. , the present chairman of the Physics Department of the University of Wyoming, has been strong and essential. Glen has taken a special interest in seeing that the proceedings have received the support required for their comple tion. The financial support of the Office of University Research Funds, University of Wyoming, is gratefully acknowledged. The secretarial staff, in particular Evelyn Haskell, Janice Gasaway, and Marce Mitchum, of the University of Wyoming Physics Department has contributed a great number of hours in helping C. Ray Smith organize and direct the workshops.