Stream Processing with Apache Spark


Book Description

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs. Authors Gerard Maas and François Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now supports: the original Spark Streaming library and the newer Structured Streaming API. Learn fundamental stream processing concepts and examine different streaming architectures Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams




Stream Processing with Apache Spark


Book Description

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You’ll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs. Authors Gerard Maas and François Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now supports: the original Spark Streaming library and the newer Structured Streaming API. Learn fundamental stream processing concepts and examine different streaming architectures Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams




Adaptive Health Management Information Systems: Concepts, Cases, and Practical Applications


Book Description

Adaptive Health Management Information Systems, Fourth Edition is a thorough resource for a broad range of healthcare professionals–from informaticians, physicians and nurses, to pharmacists, public health and allied health professionals–who need to keep pace the digital transformation of health care. Wholly revised, updated, and expanded in scope, the fourth edition covers the latest developments in the field of health management information systems (HMIS) including big data analytics and machine learning in health care; precision medicine; digital health commercialization; supply chain management; informatics for pharmacy and public health; digital health leadership; cybersecurity; and social media analytics.




Big Data Analytics in Cybersecurity


Book Description

Big data is presenting challenges to cybersecurity. For an example, the Internet of Things (IoT) will reportedly soon generate a staggering 400 zettabytes (ZB) of data a year. Self-driving cars are predicted to churn out 4000 GB of data per hour of driving. Big data analytics, as an emerging analytical technology, offers the capability to collect, store, process, and visualize these vast amounts of data. Big Data Analytics in Cybersecurity examines security challenges surrounding big data and provides actionable insights that can be used to improve the current practices of network operators and administrators. Applying big data analytics in cybersecurity is critical. By exploiting data from the networks and computers, analysts can discover useful network information from data. Decision makers can make more informative decisions by using this analysis, including what actions need to be performed, and improvement recommendations to policies, guidelines, procedures, tools, and other aspects of the network processes. Bringing together experts from academia, government laboratories, and industry, the book provides insight to both new and more experienced security professionals, as well as data analytics professionals who have varying levels of cybersecurity expertise. It covers a wide range of topics in cybersecurity, which include: Network forensics Threat analysis Vulnerability assessment Visualization Cyber training. In addition, emerging security domains such as the IoT, cloud computing, fog computing, mobile computing, and cyber-social networks are examined. The book first focuses on how big data analytics can be used in different aspects of cybersecurity including network forensics, root-cause analysis, and security training. Next it discusses big data challenges and solutions in such emerging cybersecurity domains as fog computing, IoT, and mobile app security. The book concludes by presenting the tools and datasets for future cybersecurity research.




Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive


Book Description

Immerse yourself in the realm of big data with "Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive," your definitive guide to mastering two of the most potent technologies in the data engineering landscape. This book provides comprehensive insights into the complexities of Apache Hadoop and Hive, equipping you with the expertise to store, manage, and analyze vast amounts of data with precision. From setting up your initial Hadoop cluster to performing sophisticated data analytics with HiveQL, each chapter methodically builds on the previous one, ensuring a robust understanding of both fundamental concepts and advanced methodologies. Discover how to harness HDFS for scalable and reliable storage, utilize MapReduce for intricate data processing, and fully exploit data warehousing capabilities with Hive. Targeted at data engineers, analysts, and IT professionals striving to advance their proficiency in big data technologies, this book is an indispensable resource. Through a blend of theoretical insights, practical knowledge, and real-world examples, you will master data storage optimization, advanced Hive functionalities, and best practices for secure and efficient data management. Equip yourself to confront big data challenges with confidence and skill with "Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive." Whether you're a novice in the field or seeking to expand your expertise, this book will be your invaluable guide on your data engineering journey.




Stream Processing Unleashed: Real-Time Analytics for the Modern Era


Book Description

Mrs.V.Suganthi, Assistant Professor, Department of Computer Science, C.T.T.E College for Women, Chennai,Tamil Nadu, India. Mr.Z.Harith Ahamed, Assistant Professor, Department of Computer Science, Jamal Mohamed College (Autonomous), Tiruchirappalli, Tamil Nadu, India. Dr.T.Shiek Pareeth, Assistant Professor, Department of Mathematics, Jamal Mohamed College (Autonomous), Tiruchirappalli, Tamil Nadu, India. Mrs.P.Indumathi, Assistant Professor, Department of Computer Science with Data Analytics, Kongunadu Arts and Science College, Coimbatore, Tamil Nadu, India. Mrs.S.Nandhinieswari, Assistant Professor, Department of Computer Science, Sri Ramakrishna Arts and Science College For Women, Coimbatore, Tamil Nadu, India.




Expert Hadoop Administration


Book Description

This is the eBook of the printed book and may not include any media, website access codes, or print supplements that may come packaged with the bound book. The Comprehensive, Up-to-Date Apache Hadoop Administration Handbook and Reference “Sam Alapati has worked with production Hadoop clusters for six years. His unique depth of experience has enabled him to write the go-to resource for all administrators looking to spec, size, expand, and secure production Hadoop clusters of any size.” —Paul Dix, Series Editor In Expert Hadoop® Administration, leading Hadoop administrator Sam R. Alapati brings together authoritative knowledge for creating, configuring, securing, managing, and optimizing production Hadoop clusters in any environment. Drawing on his experience with large-scale Hadoop administration, Alapati integrates action-oriented advice with carefully researched explanations of both problems and solutions. He covers an unmatched range of topics and offers an unparalleled collection of realistic examples. Alapati demystifies complex Hadoop environments, helping you understand exactly what happens behind the scenes when you administer your cluster. You’ll gain unprecedented insight as you walk through building clusters from scratch and configuring high availability, performance, security, encryption, and other key attributes. The high-value administration skills you learn here will be indispensable no matter what Hadoop distribution you use or what Hadoop applications you run. Understand Hadoop’s architecture from an administrator’s standpoint Create simple and fully distributed clusters Run MapReduce and Spark applications in a Hadoop cluster Manage and protect Hadoop data and high availability Work with HDFS commands, file permissions, and storage management Move data, and use YARN to allocate resources and schedule jobs Manage job workflows with Oozie and Hue Secure, monitor, log, and optimize Hadoop Benchmark and troubleshoot Hadoop




Model and Data Engineering


Book Description

This volume LNCS 14396 constitutes the refereed proceedings of the 12th International Conference, MEDI 2023,in November 2023 ,held in Sousse, Tunisia. The 27 full papers were carefully peer reviewed and selected from 99 submissions. The Annual International Conference on Model and Data Engineering focuses on bring together researchers and practitioners and enabling them to showcase the latest advances in modelling and data management.




Towards Smart World


Book Description

Towards Smart World: Homes to Cities Using Internet of Things provides an overview of basic concepts from the rising of machines and communication to IoT for making cities smart, real-time applications domains, related technologies, and their possible solutions for handling relevant challenges. This book highlights the utilization of IoT for making cities smart and its underlying technologies in real-time application areas such as emergency departments, intelligent traffic systems, indoor and outdoor securities, automotive industries, environmental monitoring, business entrepreneurship, facial recognition, and motion-based object detection. Features The book covers the challenging issues related to sensors, detection, and tracking of moving objects, and solutions to handle relevant challenges. It contains the most recent research analysis in the domain of communications, signal processing, and computing sciences for facilitating smart homes, buildings, environmental conditions, and cities. It presents the readers with practical approaches and future direction for using IoT in smart cities and discusses how it deals with human dynamics, the ecosystem, and social objects and their relation. It describes the latest technological advances in IoT and visual surveillance with their implementations. This book is an ideal resource for IT professionals, researchers, undergraduate or postgraduate students, practitioners, and technology developers who are interested in gaining deeper knowledge and implementing IoT for smart cities, real-time applications areas, and technologies, and a possible set of solutions to handle relevant challenges. Dr. Lavanya Sharma is an Assistant Professor in the Amity Institute of Information Technology at Amity University UP, Noida, India. She has been a recipient of several prestigious awards during her academic career. She is an active nationally recognized researcher who has published numerous papers in her field.




Applied Soft Computing and Communication Networks


Book Description

This book constitutes thoroughly refereed post-conference proceedings of the International Applied Soft Computing and Communication Networks (ACN 2020) held in VIT, Chennai, India, during October 14–17, 2020. The research papers presented were carefully reviewed and selected from several initial submissions. The book is directed to the researchers and scientists engaged in various fields of intelligent systems.