Hardware Accelerator Systems for Artificial Intelligence and Machine Learning


Book Description

Hardware Accelerator Systems for Artificial Intelligence and Machine Learning, Volume 122 delves into arti?cial Intelligence and the growth it has seen with the advent of Deep Neural Networks (DNNs) and Machine Learning. Updates in this release include chapters on Hardware accelerator systems for artificial intelligence and machine learning, Introduction to Hardware Accelerator Systems for Artificial Intelligence and Machine Learning, Deep Learning with GPUs, Edge Computing Optimization of Deep Learning Models for Specialized Tensor Processing Architectures, Architecture of NPU for DNN, Hardware Architecture for Convolutional Neural Network for Image Processing, FPGA based Neural Network Accelerators, and much more. - Updates on new information on the architecture of GPU, NPU and DNN - Discusses In-memory computing, Machine intelligence and Quantum computing - Includes sections on Hardware Accelerator Systems to improve processing efficiency and performance




Data Orchestration in Deep Learning Accelerators


Book Description

This Synthesis Lecture focuses on techniques for efficient data orchestration within DNN accelerators. The End of Moore's Law, coupled with the increasing growth in deep learning and other AI applications has led to the emergence of custom Deep Neural Network (DNN) accelerators for energy-efficient inference on edge devices. Modern DNNs have millions of hyper parameters and involve billions of computations; this necessitates extensive data movement from memory to on-chip processing engines. It is well known that the cost of data movement today surpasses the cost of the actual computation; therefore, DNN accelerators require careful orchestration of data across on-chip compute, network, and memory elements to minimize the number of accesses to external DRAM. The book covers DNN dataflows, data reuse, buffer hierarchies, networks-on-chip, and automated design-space exploration. It concludes with data orchestration challenges with compressed and sparse DNNs and future trends. The target audience is students, engineers, and researchers interested in designing high-performance and low-energy accelerators for DNN inference.










Deep Learning for Computer Architects


Book Description

Machine learning, and specifically deep learning, has been hugely disruptive in many fields of computer science. The success of deep learning techniques in solving notoriously difficult classification and regression problems has resulted in their rapid adoption in solving real-world problems. The emergence of deep learning is widely attributed to a virtuous cycle whereby fundamental advancements in training deeper models were enabled by the availability of massive datasets and high-performance computer hardware. This text serves as a primer for computer architects in a new and rapidly evolving field. We review how machine learning has evolved since its inception in the 1960s and track the key developments leading up to the emergence of the powerful deep learning techniques that emerged in the last decade. Next we review representative workloads, including the most commonly used datasets and seminal networks across a variety of domains. In addition to discussing the workloads themselves, we also detail the most popular deep learning tools and show how aspiring practitioners can use the tools with the workloads to characterize and optimize DNNs. The remainder of the book is dedicated to the design and optimization of hardware and architectures for machine learning. As high-performance hardware was so instrumental in the success of machine learning becoming a practical solution, this chapter recounts a variety of optimizations proposed recently to further improve future designs. Finally, we present a review of recent research published in the area as well as a taxonomy to help readers understand how various contributions fall in context.




Towards Heterogeneous Multi-core Systems-on-Chip for Edge Machine Learning


Book Description

This book explores and motivates the need for building homogeneous and heterogeneous multi-core systems for machine learning to enable flexibility and energy-efficiency. Coverage focuses on a key aspect of the challenges of (extreme-)edge-computing, i.e., design of energy-efficient and flexible hardware architectures, and hardware-software co-optimization strategies to enable early design space exploration of hardware architectures. The authors investigate possible design solutions for building single-core specialized hardware accelerators for machine learning and motivates the need for building homogeneous and heterogeneous multi-core systems to enable flexibility and energy-efficiency. The advantages of scaling to heterogeneous multi-core systems are shown through the implementation of multiple test chips and architectural optimizations.




Sparse Arrays for Radar, Sonar, and Communications


Book Description

Specialized resource providing detailed coverage of recent advances in theory and applications of sparse arrays Sparse Arrays for Radar, Sonar, and Communications discusses various design approaches of sparse arrays, including those seeking to increase the corresponding one-dimensional and two-dimensional virtual array apertures, as well as others that configure the arrays based on solutions of constrained minimization problems. The latter includes statistical bounds and signal-to-interference and noise ratio; in this respect, the book utilizes the recent strides made in convex optimizations and machine learning for sparse array configurability in both fixed and dynamic environments. Similar ideas are presented for sparse array-waveform design. The book also discusses the role of sparse arrays in improving target detection and resolution in radar, improving channel capacity in massive MIMO, and improving underwater target localization in sonar. It covers different sparse array topologies, and provides various approaches that deliver the optimum and semi-optimum sparse array transceivers. . Edited by a world-leading expert in Radar and Signal Processing and contributed to by world-class researchers in their respective fields, Sparse Arrays for Radar, Sonar, and Communications covers topics including: Utilizing sparse arrays in emerging technologies and showing their offerings in various sensing and communications applications Applying sparse arrays to different environments and obtain superior performances over conventional uniform arrays Solving the localization, beamforming, and direction-finding problems using non-uniform array structures for narrowband and wideband signals Designing sparse array structures for both stationary and moving platforms that produce physical and synthesized array apertures. Using deep neural networks that learn the underlying complex nonlinear model and output the sparse array configuration using representations of the input data spatio-temporal observations Solving for optimum sparse array configurations and beamforming coefficients in sensing using iterative convex optimization methods Providing complete coverage of the recent considerable progress in sparse array design and configurations, Sparse Arrays for Radar, Sonar, and Communications is an essential resource on the subject for graduate students and engineers pursuing research and applications in the broad areas of active/passive sensing and communications.




Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing


Book Description

This book presents recent advances towards the goal of enabling efficient implementation of machine learning models on resource-constrained systems, covering different application domains. The focus is on presenting interesting and new use cases of applying machine learning to innovative application domains, exploring the efficient hardware design of efficient machine learning accelerators, memory optimization techniques, illustrating model compression and neural architecture search techniques for energy-efficient and fast execution on resource-constrained hardware platforms, and understanding hardware-software codesign techniques for achieving even greater energy, reliability, and performance benefits.




Deep Learning in Computer Vision


Book Description

Deep learning algorithms have brought a revolution to the computer vision community by introducing non-traditional and efficient solutions to several image-related problems that had long remained unsolved or partially addressed. This book presents a collection of eleven chapters where each individual chapter explains the deep learning principles of a specific topic, introduces reviews of up-to-date techniques, and presents research findings to the computer vision community. The book covers a broad scope of topics in deep learning concepts and applications such as accelerating the convolutional neural network inference on field-programmable gate arrays, fire detection in surveillance applications, face recognition, action and activity recognition, semantic segmentation for autonomous driving, aerial imagery registration, robot vision, tumor detection, and skin lesion segmentation as well as skin melanoma classification. The content of this book has been organized such that each chapter can be read independently from the others. The book is a valuable companion for researchers, for postgraduate and possibly senior undergraduate students who are taking an advanced course in related topics, and for those who are interested in deep learning with applications in computer vision, image processing, and pattern recognition.




Intelligence Science IV


Book Description

This book constitutes the refereed proceedings of the 5th International Conference on Intelligence Science, ICIS 2022, held in Xi'an, China, in August 2022. The 41 full and 5 short papers presented in this book were carefully reviewed and selected from 85 submissions. They were organized in topical sections as follows: Brain cognition; machine learning; data intelligence; language cognition; remote sensing images; perceptual intelligence; wireless sensor; and medical artificial intelligence.