Spoken, Multilingual and Multimodal Dialogue Systems


Book Description

Dialogue systems are a very appealing technology with an extraordinary future. Spoken, Multilingual and Multimodal Dialogues Systems: Development and Assessment addresses the great demand for information about the development of advanced dialogue systems combining speech with other modalities under a multilingual framework. It aims to give a systematic overview of dialogue systems and recent advances in the practical application of spoken dialogue systems. Spoken Dialogue Systems are computer-based systems developed to provide information and carry out simple tasks using speech as the interaction mode. Examples include travel information and reservation, weather forecast information, directory information and product order. Multimodal Dialogue Systems aim to overcome the limitations of spoken dialogue systems which use speech as the only communication means, while Multilingual Systems allow interaction with users that speak different languages. Presents a clear snapshot of the structure of a standard dialogue system, by addressing its key components in the context of multilingual and multimodal interaction and the assessment of spoken, multilingual and multimodal systems In addition to the fundamentals of the technologies employed, the development and evaluation of these systems are described Highlights recent advances in the practical application of spoken dialogue systems This comprehensive overview is a must for graduate students and academics in the fields of speech recognition, speech synthesis, speech processing, language, and human–computer interaction technolgy. It will also prove to be a valuable resource to system developers working in these areas.




Handbook of Multimodal and Spoken Dialogue Systems


Book Description

Dictation systems, read-aloud software for the blind, speech control of machinery, geographical information systems with speech input and output, and educational software with `talking head' artificial tutorial agents are already on the market. The field is expanding rapidly, and new methods and applications emerge almost daily. But good sources of systematic information have not kept pace with the body of information needed for development and evaluation of these systems. Much of this information is widely scattered through speech and acoustic engineering, linguistics, phonetics, and experimental psychology. The Handbook of Multimodal and Spoken Dialogue Systems presents current and developing best practice in resource creation for speech input/output software and hardware. This volume brings experts in these fields together to give detailed `how to' information and recommendations on planning spoken dialogue systems, designing and evaluating audiovisual and multimodal systems, and evaluating consumer off-the-shelf products. In addition to standard terminology in the field, the following topics are covered in depth: How to collect high quality data for designing, training, and evaluating multimodal and speech dialogue systems; How to evaluate real-life computer systems with speech input and output; How to describe and model human-computer dialogue precisely and in depth. Also included: The first systematic medium-scale compendium of terminology with definitions. This handbook has been especially designed for the needs of development engineers, decision-makers, researchers, and advanced level students in the fields of speech technology, multimodal interfaces, multimedia, computational linguistics, and phonetics.




Conversational AI


Book Description

This book provides a comprehensive introduction to Conversational AI. While the idea of interacting with a computer using voice or text goes back a long way, it is only in recent years that this idea has become a reality with the emergence of digital personal assistants, smart speakers, and chatbots. Advances in AI, particularly in deep learning, along with the availability of massive computing power and vast amounts of data, have led to a new generation of dialogue systems and conversational interfaces. Current research in Conversational AI focuses mainly on the application of machine learning and statistical data-driven approaches to the development of dialogue systems. However, it is important to be aware of previous achievements in dialogue technology and to consider to what extent they might be relevant to current research and development. Three main approaches to the development of dialogue systems are reviewed: rule-based systems that are handcrafted using best practice guidelines; statistical data-driven systems based on machine learning; and neural dialogue systems based on end-to-end learning. Evaluating the performance and usability of dialogue systems has become an important topic in its own right, and a variety of evaluation metrics and frameworks are described. Finally, a number of challenges for future research are considered, including: multimodality in dialogue systems, visual dialogue; data efficient dialogue model learning; using knowledge graphs; discourse and dialogue phenomena; hybrid approaches to dialogue systems development; dialogue with social robots and in the Internet of Things; and social and ethical issues.




The Conversational Interface


Book Description

This book provides a comprehensive introduction to the conversational interface, which is becoming the main mode of interaction with virtual personal assistants, smart devices, various types of wearable, and social robots. The book consists of four parts. Part I presents the background to conversational interfaces, examining past and present work on spoken language interaction with computers. Part II covers the various technologies that are required to build a conversational interface along with practical chapters and exercises using open source tools. Part III looks at interactions with smart devices, wearables, and robots, and discusses the role of emotion and personality in the conversational interface. Part IV examines methods for evaluating conversational interfaces and discusses future directions.




Emotion, Affect and Personality in Speech


Book Description

This book explores the various categories of speech variation and works to draw a line between linguistic and paralinguistic phenomenon of speech. Paralinguistic contrast is crucial to human speech but has proven to be one of the most difficult tasks in speech systems. In the quest for solutions to speech technology and sciences, this book narrows down the gap between speech technologists and phoneticians and emphasizes the imperative efforts required to accomplish the goal of paralinguistic control in speech technology applications and the acute need for a multidisciplinary categorization system. This interdisciplinary work on paralanguage will not only serve as a source of information but also a theoretical model for linguists, sociologists, psychologists, phoneticians and speech researchers.




Immersive Environments, Augmented Realities, and Virtual Worlds: Assessing Future Trends in Education


Book Description

Technology has had direct impact on education in increasing the way that society continues to learn. Applications of immersive environments, virtual worlds, and augmented reality have significant implications for how teaching and learning are achieved in contemporary education. Immersive Environments, Augmented Realities and Virtual Worlds: Assessing Future Trends in Education brings together current research and performance in trends in education. While examining cyber behavior and the use of virtual worlds, immersive technologies and augmented realities aim to improve teaching and enhancing learning.




Perception in Multimodal Dialogue Systems


Book Description

This book constitutes the refereed proceedings of the 4th IEEE Tutorial and Research Workshop on Perception and Interactive Technologies for Speech-Based Systems, PIT 2008, held in Kloster Irsee, Germany, in June 2008. The 37 revised full papers presented together with 1 invited keynote lecture were carefully selected from numerous submissions for inclusion in the book. The papers are organized in topical sections on multimodal and spoken dialogue systems, classification of dialogue acts and sound, recognition of eye gaze, head poses, mimics and speech as well as combinations of modalities, vocal emotion recognition, human-like and social dialogue systems, and evaluation methods for multimodal dialogue systems.




Speech Technology


Book Description

This book gives an overview of the research and application of speech technologies in different areas. One of the special characteristics of the book is that the authors take a broad view of the multiple research areas and take the multidisciplinary approach to the topics. One of the goals in this book is to emphasize the application. User experience, human factors and usability issues are the focus in this book.




Your Virtual Butler


Book Description

In most scenarios of the future a personalized virtual butler appears. This butler not only performs communication and coordination tasks but also gives recommendations on how to handle everyday problems. The aim of this book is to explore the prerequisites of such a personalized virtual butler by asking: what is known about the capacities and the needs of aging people; which information and communication technologies have been used in assisting/conversing with persons, especially older ones, and what were the results; what are the advantages/disadvantages of virtual butlers as mainly software programs compared robots as butlers; and which methods, especially in artificial intelligence, have to be developed further and in which direction in order to create a virtual butler in the foreseeable future?




The Oxford Handbook of Psycholinguistics


Book Description

The ability to communicate through spoken and written language is one of the defining characteristics of the human race, yet it remains a deeply mysterious process. The young science of psycholinguistics attempts to uncover the mechanisms and representations underlying human language. This interdisciplinary field has seen massive developments over the past decade, with a broad expansion of the research base, and the incorporation of new experimental techniques such as brain imaging and computational modelling. The result is that real progress is being made in the understanding of the key components of language in the mind. The Oxford Handbook of Psycholinguistics brings together the views of 75 leading researchers in psycholinguistics to provide a comprehensive and authoritative review of the current state of the art in psycholinguistics. With almost 50 chapters written by experts in the field, the range and depth of coverage is unequalled. The contributors are eminent in a wide range of fields, including psychology, linguistics, human memory, cognitive neuroscience, bilingualism, genetics, development and neuropsychology. Their contributions are organised into six themed sections, covering word recognition, the mental lexicon, comprehension and discourse, language production, language development, and perspectives on psycholinguistics. The breadth of coverage, coupled with the accessibility of the short chapter format should make the handbook essential reading for both students and researchers in the fields of psychology, linguistics and neuroscience.