Book Description
An introduction to recent results of Japanese research and development in the fields of speech synthesis.
Author : Jeffry H. Shirai
Publisher : CRC Press
Page : 110 pages
File Size : 44,39 MB
Release : 2000-08-10
Category : Technology & Engineering
ISBN : 1482287374
An introduction to recent results of Japanese research and development in the fields of speech synthesis.
Author : Jeffry H. Shirai
Publisher : CRC Press
Page : 116 pages
File Size : 35,45 MB
Release : 2000-08-10
Category : Technology & Engineering
ISBN : 9789056990954
4.2.2. Voice Conversion Based on Piecewise Linear Conversion Rules of Formant Frequency [Mizuno-95] -- Making Formant Frequency Conversion Rules (off-line procedures) -- Voice Conversion Algorithm (on-line procedures) -- 4.2.3. Performance Evaluation -- References -- Index
Author : Akira Kurematsu
Publisher : CRC Press
Page : 132 pages
File Size : 42,55 MB
Release : 2023-03-31
Category : Technology & Engineering
ISBN : 1000657868
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recognition; speaker adaptation; speaker-independent speech recognition; utterance analysis, utterance transfer, utterance generation; contextual processing; speech synthesis and an experimental system of speech translation. This book presents the complicated technological aspects of machine translation and speech recognition, and outlines the future directions of this rapidly developing area of technology.
Author : Chieko Aoki
Publisher :
Page : 22 pages
File Size : 40,58 MB
Release : 1986
Category : Japanese language
ISBN :
Author : Shuzo Saito
Publisher : IOS Press
Page : 402 pages
File Size : 28,94 MB
Release : 1992
Category : Acoustics
ISBN : 9784274075810
Author : Jan P.H. van Santen
Publisher : Springer Science & Business Media
Page : 591 pages
File Size : 22,84 MB
Release : 2013-06-29
Category : Technology & Engineering
ISBN : 1461218942
For a machine to convert text into sounds that humans can understand as speech requires an enormous range of components, from abstract analysis of discourse structure to synthesis and modulation of the acoustic output. Work in the field is thus inherently interdisciplinary, involving linguistics, computer science, acoustics, and psychology. This collection of articles by leading researchers in each of the fields involved in text-to-speech synthesis provides a picture of recent work in laboratories throughout the world and of the problems and challenges that remain. By providing samples of synthesized speech as well as video demonstrations for several of the synthesizers discussed, the book will also allow the reader to judge what all the work adds up to -- that is, how good is the synthetic speech we can now produce? Topics covered include: Signal processing and source modeling Linguistic analysis Articulatory synthesis and visual speech Concatenative synthesis and automated segmentation Prosodic analysis of natural speech Synthesis of prosody Evaluation and perception Systems and applications.
Author : Ikuko Patricia Yuasa
Publisher : Equinox Publishing
Page : 184 pages
File Size : 26,93 MB
Release : 2008
Category : Language Arts & Disciplines
ISBN :
The major task of this book is a sociophonetic exploration of voice pitch characteristics of speakers across the cultures of Japan and America. This volume makes a cogent argument for the socio-cultural role of voice pitch in the expression of emotion and politeness and how culture and gender can intersect with each other. The book tenders acoustic phonetic evidence (as well as discourse analyses) in construing how an individual's voice pitch modulation utilized in conversational speech is reflected in this intersection as it demonstrates several methodological innovations crucial for sociophonetic research. Observations of people's voice pitch commonly made impressionistically not only contribute to this prosodic feature's perceptual stereotypes, but also inform us about our attitudes towards certain voice pitch characteristics. This volume includes an extensive review of these impressionistic remarks and acoustic phonetic investigations of voice pitch initiated in the early 20th century in the two nations, the latter of which contributed to both confirming and reconsidering the former. The volume further alludes to how attitudinal differences between these cultures were found to surface in the acoustically measured voice pitch modulation patterns obtained for this volume, stressing that voice pitch is capable of revealing various socio-cultural aspects of human behaviors.
Author : Yutaka Kidawara
Publisher : Springer Nature
Page : 103 pages
File Size : 21,17 MB
Release : 2019-11-22
Category : Computers
ISBN : 9811505950
This book provides the readers with retrospective and prospective views with detailed explanations of component technologies, speech recognition, language translation and speech synthesis. Speech-to-speech translation system (S2S) enables to break language barriers, i.e., communicate each other between any pair of person on the glove, which is one of extreme dreams of humankind. People, society, and economy connected by S2S will demonstrate explosive growth without exception. In 1986, Japan initiated basic research of S2S, then the idea spread world-wide and were explored deeply by researchers during three decades. Now, we see S2S application on smartphone/tablet around the world. Computational resources such as processors, memories, wireless communication accelerate this computation-intensive systems and accumulation of digital data of speech and language encourage recent approaches based on machine learning. Through field experiments after long research in laboratories, S2S systems are being well-developed and now ready to utilized in daily life. Unique chapter of this book is end-2-end evaluation by comparing system’s performance and human competence. The effectiveness of the system would be understood by the score of this evaluation. The book will end with one of the next focus of S2S will be technology of simultaneous interpretation for lecture, broadcast news and so on.
Author :
Publisher :
Page : 0 pages
File Size : 34,22 MB
Release : 2023
Category :
ISBN : 9780429333385
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recognition; speaker adaptation; speaker-independent speech recognition; utterance analysis, utterance transfer, utterance generation; contextual processing; speech synthesis and an experimental system of speech translation. This book presents the complicated technological aspects of machine translation and speech recognition, and outlines the future directions of this rapidly developing area of technology.
Author : Keikichi Hirose
Publisher : Springer
Page : 212 pages
File Size : 39,12 MB
Release : 2015-02-25
Category : Language Arts & Disciplines
ISBN : 3662452588
The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already been achieved in speech synthesis by using selection-based methods with segments of human speech. Although the method enables synthetic speech with various voice qualities and speaking styles, it requires large speech corpora with targeted quality and style. Accordingly, speech conversion techniques are now of growing interest among researchers. HMM/GMM-based methods are widely used, but entail several major problems when viewed from the prosody perspective; prosodic features cover a wider time span than segmental features and their frame-by-frame processing is not always appropriate. The book offers a good overview of state-of-the-art studies on prosody in speech synthesis.