A model of sonority based on pitch intelligibility


Book Description

Sonority is a central notion in phonetics and phonology and it is essential for generalizations related to syllabic organization. However, to date there is no clear consensus on the phonetic basis of sonority, neither in perception nor in production. The widely used Sonority Sequencing Principle (SSP) represents the speech signal as a sequence of discrete units, where phonological processes are modeled as symbol manipulating rules that lack a temporal dimension and are devoid of inherent links to perceptual, motoric or cognitive processes. The current work aims to change this by outlining a novel approach for the extraction of continuous entities from acoustic space in order to model dynamic aspects of phonological perception. It is used here to advance a functional understanding of sonority as a universal aspect of prosody that requires pitch-bearing syllables as the building blocks of speech.This book argues that sonority is best understood as a measurement of pitch intelligibility in perception, which is closely linked to periodic energy in acoustics. It presents a novel principle for sonority-based determinations of well-formedness – the Nucleus Attraction Principle (NAP). Two complementary NAP models independently account for symbolic and continuous representations and they mostly outperform SSP-based models, demonstrated here with experimental perception studies and with a corpus study of Modern Hebrew nouns. This work also includes a description of ProPer (Prosodic Analysis with Periodic Energy). The ProPer toolbox further exploits the proposal that periodic energy reflects sonority in order to cover major topics in prosodic research, such as prominence, intonation and speech rate. The book is finally concluded with brief discussions on selected topics: (i) the phonotactic division of labor with respect to /s/-stop clusters; (ii) the debate about the universality of sonority; and (iii) the fate of the classic phonetics–phonology dichotomy as it relates to continuity and dynamics in phonology.




Conversation and intonation in autism: A multi-dimensional analysis


Book Description

This book provides an in-depth, multi-dimensional analysis of conversations between autistic adults. The investigation is focussed on intonation style, turn-taking and the use of backchannels, filled pauses and silent pauses. Previous findings on intonation style in the context of autism spectrum disorder (ASD) are contradictory, with claims ranging from characteristically monotonous to characteristically melodic intonation. A novel methodology for quantifying intonation style is used, and it is revealed that autistic speakers tended towards a more melodic intonation style compared to control speakers in the data set under investigation. Research on turn-taking (the organisation of who speaks when in conversation) in ASD is limited, with most studies claiming a tendency for longer silent gaps in ASD. No clear overall difference in turn-timing between the ASD and the control group was found in the data under study. There was, however, a clear difference between groups specifically in the earliest stages of dialogue, where ASD dyads produced considerably longer silent gaps than controls. Backchannels (listener signals such as mmhm or okay) have barely been investigated in ASD to date. The current analysis shows that autistic speakers produced fewer backchannels per minute (particularly in the early stages of dialogue), and that backchannels were less diverse prosodically and lexically. Filled pauses (hesitation signals such as uhm and uh) in ASD have been the subject of a handful of previous studies, most of which claim that autistic speakers produced fewer uhm tokens (specifically). It is shown that filled pauses were produced at an identical rate in both groups and that there was an equivalent preference of uhm over uh. ASD speakers differed only in the prosodic realisation of filled pauses. It is further shown that autistic speakers produced more long silent (within-speaker) pauses than controls. The analyses presented in this book provide new insights into conversation strategies and intonation styles in ASD, as reviewed in a summary analysis. The findings are discussed in the context of previous research, general characteristics of cognition in ASD, and the importance of studying communication in interaction and across neurotypes.




Second Language Prosody and Computer Modeling


Book Description

This volume presents an interdisciplinary approach to the study of second language prosody and computer modeling. It addresses the importance of prosody’s role in communication, bridging the gap between applied linguistics and computer science. The book illustrates the growing importance of the relationship between automated speech recognition systems and language learning assessment in light of new technologies and showcases how the study of prosody in this context in particular can offer innovative insights into the computerized process of natural discourse. The book offers detailed accounts of different methods of analysis and computer models used and demonstrates how these models can be applied to L2 discourse analysis toward predicting real-world language use. Kang, Johnson, and Kermad also use these frameworks as a jumping-off point from which to propose new models of second language prosody and future directions for prosodic computer modeling more generally. Making the case for the use of naturalistic data for real-world applications in empirical research, this volume will foster interdisciplinary dialogues across students and researchers in applied linguistics, speech communication, speech science, and computer engineering.







Relevant Acoustic Phonetics of L2 English


Book Description

This book applies four relevant concepts in acoustic phonetics and proposes a new approach for assessing the intelligibility of second language pronunciation instrumentally.




The Power of Sound


Book Description




The Journal of the Acoustical Society of Japan (E).


Book Description

Contains English abstracts of original papers and letters to the editor that appear in the Japanese edition.




Speech Processing in the Auditory System


Book Description

Although speech is the primary behavioral medium by which humans communicate, its auditory basis is poorly understood, having profound implications on efforts to ameliorate the behavioral consequences of hearing impairment and on the development of robust algorithms for computer speech recognition. In this volume, the authors provide an up-to-date synthesis of recent research in the area of speech processing in the auditory system, bringing together a diverse range of scientists to present the subject from an interdisciplinary perspective. Of particular concern is the ability to understand speech in uncertain, potentially adverse acoustic environments, currently the bane of both hearing aid and speech recognition technology. There is increasing evidence that the perceptual stability characteristic of speech understanding is due, at least in part, to elegant transformations of the acoustic signal performed by auditory mechanisms. As a comprehensive review of speech's auditory basis, this book will interest physiologists, anatomists, psychologists, phoneticians, computer scientists, biomedical and electrical engineers, and clinicians.




Wagner's Lexical Tonality


Book Description

The four scores of the Ring dramas are analyzed bar-by-bar to derive a complete linear harmonic analysis-based readout of each of its keys and claimed lexical references. The chapters on Parsif al discuss its mediaeval sources as suggested by Wagner's prose writings, letters, and religious discourse to argue for the Gnostic and alchemist basis of its libretto imagery, lexical tonality, and anti-Semitism.