Speech Recognition Using Articulatory and Excitation Source Features


Book Description

This book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) performance. Speech recognition is analyzed for read, extempore, and conversation modes of speech. Five groups of articulatory features (AFs) are explored for speech recognition, in addition to conventional spectral features. Each chapter provides the motivation for exploring the specific feature for SR task, discusses the methods to extract those features, and finally suggests appropriate models to capture the sound unit specific knowledge from the proposed features. The authors close by discussing various combinations of spectral, articulatory and source features, and the desired models to enhance the performance of SR systems.




Multilingual Phone Recognition in Indian Languages


Book Description

The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.




Smart Computing Paradigms: New Progresses and Challenges


Book Description

This two-volume book focuses on both theory and applications in the broad areas of communication technology, computer science and information security. It brings together contributions from scientists, professors, scholars and students, and presents essential information on computing, networking, and informatics. It also discusses the practical challenges encountered and the solutions used to overcome them, the goal being to promote the “translation” of basic research into applied research, and of applied research into practice. The works presented here will also demonstrate the importance of basic scientific research in a range of fields.




The Stationary Bionic Wavelet Transform and its Applications for ECG and Speech Processing


Book Description

This book first details a proposed Stationary Bionic Wavelet Transform (SBWT) for use in speech processing. The author then details the proposed techniques based on SBWT. These techniques are relevant to speech enhancement, speech recognition, and ECG de-noising. The techniques are then evaluated by comparing them to a number of methods existing in literature. For evaluating the proposed techniques, results are applied to different speech and ECG signals and their performances are justified from the results obtained from using objective criterion such as SNR, SSNR, PSNR, PESQ , MAE, MSE and more.




Intelligent Computing and Optimization


Book Description

This book of Springer Nature is another proof of Springer’s outstanding greatness on the lively interface of Holistic Computational Optimization, Green IoTs, Smart Modeling, and Deep Learning! It is a masterpiece of what our community of academics and experts can provide when an interconnected approach of joint, mutual, and meta-learning is supported by advanced operational research and experience of the World-Leader Springer Nature! The 6th edition of International Conference on Intelligent Computing and Optimization took place at G Hua Hin Resort & Mall on April 27–28, 2023, with tremendous support from the global research scholars across the planet. Objective is to celebrate “Research Novelty with Compassion and Wisdom” with researchers, scholars, experts, and investigators in Intelligent Computing and Optimization across the globe, to share knowledge, experience, and innovation—a marvelous opportunity for discourse and mutuality by novel research, invention, and creativity. This proceedings book of the 6th ICO’2023 is published by Springer Nature—Quality Label of Enlightenment.




Workshops at 18th International Conference on Intelligent Environments (IE2022)


Book Description

The term Intelligent Environments (IEs) refers to physical spaces in which information and communication technologies are interwoven with sensing technologies, innovative user interfaces, robotics and artificial intelligence to create interactive spaces which increase the awareness and enhance the experience of those occupying them. The growing IE community is rooted in academia, but increasingly involves practitioners. It explores the core ideas of IEs as well as the factors necessary to make them a reality, such as energy efficiency, the computational constraints of edge devices and privacy issues. This book presents papers from Workshops held during the 18th International Conference on Intelligent Environments, IE2022, held as a hybrid conference in Biarritz, France, from 20 to 23 June 2022. The conference is now recognized as a major annual venue in the field of IE. It offers a truly international forum for the exchange of information and ideas, and welcomes contributions from all technically active regions of the planet. Included here are 35 papers from the 1st International Workshop on Sentiment Analysis and Emotion Recognition for Social Robots (SENTIRobots’22); 1st International Workshop on Edge AI for Smart Agriculture (EAISA’22); 2nd International Workshop on Artificial Intelligence and Machine Learning for Emerging Topics (ALLEGET’22); 11th International Workshop on the Reliability of Intelligent Environments (WoRIE’22); 2nd International Workshop on Self-Learning in Intelligent Environments (SeLIE’22); 5th Workshop on Citizen Centric Smart Cities Solutions (CCSCS’22); 11th International Workshop on Intelligent Environments Supporting Healthcare and Well-being (WISHWell’22) Exploring some of the latest research and developments in the field, the book will be of interest to all those working with intelligent environments and its associated technologies.




Marine Navigation


Book Description

The 12th International Conference on Marine Navigation and Safety of Sea Transportation (TransNav 2017) will take place on June 21-23 in Gdynia, Poland. Main themes of this conference include: electronic navigation, route planning, mathematical models, methods and algorithms, ships manoeuvring, navigational risks, Global Navigation Satellite Systems (GNSS), Automatic Identification System (AIS), marine radar, anti-collision, dynamic positioning, visualization of data, hydrometereological aspects and weather routing, safety at sea, inland navigation, autonomous water transport, communications and global maritime distress and safety system (GMDSS), port ant routes optimum location and magnetic compasses.




Speech Synthesis and Recognition


Book Description

With the growing impact of information technology on daily life, speech is becoming increasingly important for providing a natural means of communication between humans and machines. This extensively reworked and updated new edition of Speech Synthesis and Recognition is an easy-to-read introduction to current speech technology. Aimed at advanced undergraduates and graduates in electronic engineering, computer science and information technology, the book is also relevant to professional engineers who need to understand enough about speech technology to be able to apply it successfully and to work effectively with speech experts. No advanced mathematical ability is required and no specialist prior knowledge of phonetics or of the properties of speech signals is assumed.







Neural Networks and Statistical Learning


Book Description

This book provides a broad yet detailed introduction to neural networks and machine learning in a statistical framework. A single, comprehensive resource for study and further research, it explores the major popular neural network models and statistical learning approaches with examples and exercises and allows readers to gain a practical working understanding of the content. This updated new edition presents recently published results and includes six new chapters that correspond to the recent advances in computational learning theory, sparse coding, deep learning, big data and cloud computing. Each chapter features state-of-the-art descriptions and significant research findings. The topics covered include: • multilayer perceptron; • the Hopfield network; • associative memory models;• clustering models and algorithms; • t he radial basis function network; • recurrent neural networks; • nonnegative matrix factorization; • independent component analysis; •probabilistic and Bayesian networks; and • fuzzy sets and logic. Focusing on the prominent accomplishments and their practical aspects, this book provides academic and technical staff, as well as graduate students and researchers with a solid foundation and comprehensive reference on the fields of neural networks, pattern recognition, signal processing, and machine learning.