Speech, Sound and Music Processing: Embracing Research in India


Book Description

This book constitutes the thoroughly refereed post-proceedings of the 8th International Symposium on Computer Music Modeling and Retrieval, CMMR 2011 and the 20th International Symposium on Frontiers of Research in Speech and Music, FRSM 2011. This year the 2 conferences merged for the first time and were held in Bhubanes, India, in March 2011. The 17 revised full papers presented were specially reviewed and revised for inclusion in this proceedings volume. The book is divided in four main chapters which reflect the high quality of the sessions of CMMR 2011, the collaboration with FRSM 2011 and the Indian influence, in the topics of Indian Music, Music Information Retrieval, Sound analysis synthesis and perception and Speech processing of Indian languages.




Acoustics of Bangla Speech Sounds


Book Description

This book presents the consolidated acoustic data for all phones in Standard Colloquial Bengali (SCB), commonly known as Bangla, a Bengali language used by 350 million people in India, Bangladesh, and the Bengali diaspora. The book analyzes the real speech of selected native speakers of the Bangla dialect to ensure that a proper acoustical database is available for the development of speech technologies. The acoustic data presented consists of averages and their normal spread, represented by the standard deviations of necessary acoustic parameters including e.g. formant information for multiple native speakers of both sexes. The study employs two important speech technologies:(1) text to speech synthesis (TTS) and (2) automatic speech recognition (ASR). The procedures, particularly those related to the use of technologies, are described in sufficient detail to enable researchers to use them to create technical acoustic databases for any other Indian dialect. The book offers a unique resource for scientists and industrial practitioners who are interested in the acoustic analysis and processing of Indian dialects to develop similar dialect databases of their own.




Time Domain Representation of Speech Sounds


Book Description

The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation. The book also includes a new cohort study on the use of lexical knowledge in ASR. India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.




Advances in Speech and Music Technology


Book Description

This book presents advances in speech and music in the domain of audio signal processing. The book begins with introductory chapters on the basics of speech and music, and then proceeds to computational aspects of speech and music, including music information retrieval and spoken language processing. The authors discuss the intersection in the field of computer science, musicology and speech analysis, and how the multifaceted nature of speech and music information processing requires unique algorithms, systems using sophisticated signal processing, and machine learning techniques that better extract useful information. The authors discuss how a deep understanding of both speech and music in terms of perception, emotion, mood, gesture and cognition is essential for successful application. Also discussed is the overwhelming amount of data that has been generated across the world that requires efficient processing for better maintenance, retrieval, indexing and querying and how machine learning and artificial intelligence are most suited for these computational tasks. The book provides both technological knowledge and a comprehensive treatment of essential topics in speech and music processing.




Intelligent Methods and Big Data in Industrial Applications


Book Description

The inspiration for this book came from the Industrial Session of the ISMIS 2017 Conference in Warsaw. It covers numerous applications of intelligent technologies in various branches of the industry. Intelligent computational methods and big data foster innovation and enable the industry to overcome technological limitations and explore the new frontiers. Therefore it is necessary for scientists and practitioners to cooperate and inspire each other, and use the latest research findings to create new designs and products. As such, the contributions cover solutions to the problems experienced by practitioners in the areas of artificial intelligence, complex systems, data mining, medical applications and bioinformatics, as well as multimedia- and text processing. Further, the book shows new directions for cooperation between science and industry and facilitates efficient transfer of knowledge in the area of intelligent information systems.




Musicality of Human Brain through Fractal Analytics


Book Description

This book provides a comprehensive overview of how fractal analytics can lead to the extraction of interesting features from the complex electroencephalograph (EEG) signals generated by Hindustani classical music. It particularly focuses on how the brain responses to the emotional attributes of Hindustani classical music that have been long been a source of discussion for musicologists and psychologists. Using robust scientific techniques that are capable of looking into the most intricate dynamics of the complex EEG signals, it deciphers the human brain’s response to different ragas of Hindustani classical music, shedding new light on what happens inside the performer’s brain when they are mentally composing the imagery of a particular raga. It also explores the much- debated issue in the musical fraternity of whether there are any universal cues in music that make it identifiable for people throughout the world, and if so, what are the neural correlates associated with the universal cues? This book is of interest to researchers and scholars of music and the brain, nonlinear science, music cognition, music signal processing and music information retrieval. In addition, researchers in the field of nonlinear biomedical signal processing and music signal analysis benefit from this book.




Auditory Interfaces


Book Description

Auditory Interfaces explores how human-computer interactions can be significantly enhanced through the improved use of the audio channel. Providing historical, theoretical and practical perspectives, the book begins with an introductory overview, before presenting cutting-edge research with chapters on embodied music recognition, nonspeech audio, and user interfaces. This book will be of interest to advanced students, researchers and professionals working in a range of fields, from audio sound systems, to human-computer interaction and computer science.




Recommender Systems for Medicine and Music


Book Description

Music recommendation systems are becoming more and more popular. The increasing amount of personal data left by users on social media contributes to more accurate inference of the user’s musical preferences and the same to quality of personalized systems. Health recommendation systems have become indispensable tools in decision making processes in the healthcare sector. Their main objective is to ensure the availability of valuable information at the right time by ensuring information quality, trustworthiness, authentication, and privacy concerns. Medical doctors deal with various kinds of diseases in which the music therapy helps to improve symptoms. Listening to music may improve heart rate, respiratory rate, and blood pressure in people with heart disease. Sound healing therapy uses aspects of music to improve physical and emotional health and well-being. The book presents a variety of approaches useful to create recommendation systems in healthcare, music, and in music therapy.




Modern Advances in Applied Intelligence


Book Description

The two volume set LNAI 8481 and 8482 constitutes the refereed conference proceedings of the 27th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2014, held in Kaohsiung, Taiwan, in June 2014. The total of 106 papers selected for the proceedings were carefully reviewed and selected from various submissions. The papers deal with a wide range of topics from applications of applied intelligent systems to solve real-life problems in all areas including engineering, science, industry, automation and robotics, business and finance, medicine and biomedicine, bioinformatics, cyberspace and human-machine interaction.




Pervasive Computing and Social Networking


Book Description

The book features original papers from International Conference on Pervasive Computing and Social Networking (ICPCSN 2021), organized by NSIT, Salem, India during 19 – 20 march 2021. It covers research works on conceptual, constructive, empirical, theoretical and practical implementations of pervasive computing and social networking methods for developing more novel ideas and innovations in the growing field of information and communication technologies.