HDInsight Essentials - Second Edition


Book Description

If you want to discover one of the latest tools designed to produce stunning Big Data insights, this book features everything you need to get to grips with your data. Whether you are a data architect, developer, or a business strategist, HDInsight adds value in everything from development, administration, and reporting.




Hadoop MapReduce v2 Cookbook - Second Edition


Book Description

If you are a Big Data enthusiast and wish to use Hadoop v2 to solve your problems, then this book is for you. This book is for Java programmers with little to moderate knowledge of Hadoop MapReduce. This is also a one-stop reference for developers and system admins who want to quickly get up to speed with using Hadoop v2. It would be helpful to have a basic knowledge of software development using Java and a basic working knowledge of Linux.




Data Lake Development with Big Data


Book Description

Explore architectural approaches to building Data Lakes that ingest, index, manage, and analyze massive amounts of data using Big Data technologies About This Book Comprehend the intricacies of architecting a Data Lake and build a data strategy around your current data architecture Efficiently manage vast amounts of data and deliver it to multiple applications and systems with a high degree of performance and scalability Packed with industry best practices and use-case scenarios to get you up-and-running Who This Book Is For This book is for architects and senior managers who are responsible for building a strategy around their current data architecture, helping them identify the need for a Data Lake implementation in an enterprise context. The reader will need a good knowledge of master data management and information lifecycle management, and experience of Big Data technologies. What You Will Learn Identify the need for a Data Lake in your enterprise context and learn to architect a Data Lake Learn to build various tiers of a Data Lake, such as data intake, management, consumption, and governance, with a focus on practical implementation scenarios Find out the key considerations to be taken into account while building each tier of the Data Lake Understand Hadoop-oriented data transfer mechanism to ingest data in batch, micro-batch, and real-time modes Explore various data integration needs and learn how to perform data enrichment and data transformations using Big Data technologies Enable data discovery on the Data Lake to allow users to discover the data Discover how data is packaged and provisioned for consumption Comprehend the importance of including data governance disciplines while building a Data Lake In Detail A Data Lake is a highly scalable platform for storing huge volumes of multistructured data from disparate sources with centralized data management services. This book explores the potential of Data Lakes and explores architectural approaches to building data lakes that ingest, index, manage, and analyze massive amounts of data using batch and real-time processing frameworks. It guides you on how to go about building a Data Lake that is managed by Hadoop and accessed as required by other Big Data applications. This book will guide readers (using best practices) in developing Data Lake's capabilities. It will focus on architect data governance, security, data quality, data lineage tracking, metadata management, and semantic data tagging. By the end of this book, you will have a good understanding of building a Data Lake for Big Data. Style and approach Data Lake Development with Big Data provides architectural approaches to building a Data Lake. It follows a use case-based approach where practical implementation scenarios of each key component are explained. It also helps you understand how these use cases are implemented in a Data Lake. The chapters are organized in a way that mimics the sequential data flow evidenced in a Data Lake.




Learning Azure DocumentDB


Book Description

Create outstanding enterprise solutions around DocumentDB using the latest technologies and programming tools with Azure About This Book Get to know the concepts of DocumentDB and learn to work your way around it Manipulate and query your documents using different modern technologies to access DocumentDB Build a real-life scenario using Microsoft Visual Studio and C# with this handy and practical guide Who This Book Is For This book is for novice developers and database architects who need a thorough knowledge of the features of DocumentDB and developing applications with it. Basic knowledge of SQL would be helpful. What You Will Learn Create, manage, and configure your DocumentDB environment Execute SQL queries from simple to complex and nested ones against your database Get to know about advanced DocumentDB techniques such as scopes, portioning, indexing, triggers, UDF's, and security Fine-tune your DocumentDB database to optimize performance and costs Interact with DocumentDB from different technologies and platforms Build a real-life scenario using C# and put DocumentDB at the heart of Azure solutions Understand how to migrate from your current datastore to DocumentDB In Detail Learning DocumentDB adopts a practical, step-by-step approach to help you learn the basics of DocumentDB and use your new-found abilities in real-life scenarios and enterprise solutions. We start with the absolute basics, such as setting up a DocumentDB environment, and guide you through managing your databases, and executing simple and complex queries. Next, we explain how to work with DocumentDB using the open REST protocol, and demonstrate how JavaScript works with DocumentDB. We'll also show you how to authenticate and execute queries. Moving on, you'll find out how to use DocumentDB from within Node.js to kick-start your Node.js projects. Next, you'll discover how to increase the performance of your DocumentDB database and fine-tune it. Finally, you'll get to grips with using DocumentDB in conjunction with other services offered from the Microsoft Azure platform. Style and approach This book can be used as a tutorial where you learn step by step, but also as a knowledge base to quickly look up recipes you can instantly utilize. Starting with the basics and moving on to advanced topics, every concept is explained in theory and demonstrated through easy-to-understand examples.




Smarter Decisions – The Intersection of Internet of Things and Decision Science


Book Description

Enter the world of Internet of Things with the power of data science with this highly practical, engaging book About This Book Explore real-world use cases from the Internet of Things (IoT) domain using decision science with this easy-to-follow, practical book Learn to make smarter decisions on top of your IoT solutions so that your IoT is smart in a real sense This highly practical, example-rich guide fills the gap between your knowledge of data science and IoT Who This Book Is For If you have a basic programming experience with R and want to solve business use cases in IoT using decision science then this book is for you. Even if your're a non-technical manager anchoring IoT projects, you can skip the code and still benefit from the book. What You Will Learn Explore decision science with respect to IoT Get to know the end to end analytics stack – Descriptive + Inquisitive + Predictive + Prescriptive Solve problems in IoT connected assets and connected operations Design and solve real-life IoT business use cases using cutting edge machine learning techniques Synthesize and assimilate results to form the perfect story for a business Master the art of problem solving when IoT meets decision science using a variety of statistical and machine learning techniques along with hands on tasks in R In Detail With an increasing number of devices getting connected to the Internet, massive amounts of data are being generated that can be used for analysis. This book helps you to understand Internet of Things in depth and decision science, and solve business use cases. With IoT, the frequency and impact of the problem is huge. Addressing a problem with such a huge impact requires a very structured approach. The entire journey of addressing the problem by defining it, designing the solution, and executing it using decision science is articulated in this book through engaging and easy-to-understand business use cases. You will get a detailed understanding of IoT, decision science, and the art of solving a business problem in IoT through decision science. By the end of this book, you'll have an understanding of the complex aspects of decision making in IoT and will be able to take that knowledge with you onto whatever project calls for it Style and approach This scenario-based tutorial approaches the topic systematically, allowing you to build upon what you learned in previous chapters.




Microsoft Azure Essentials - Fundamentals of Azure


Book Description

Microsoft Azure Essentials from Microsoft Press is a series of free ebooks designed to help you advance your technical skills with Microsoft Azure. The first ebook in the series, Microsoft Azure Essentials: Fundamentals of Azure, introduces developers and IT professionals to the wide range of capabilities in Azure. The authors - both Microsoft MVPs in Azure - present both conceptual and how-to content for key areas, including: Azure Websites and Azure Cloud Services Azure Virtual Machines Azure Storage Azure Virtual Networks Databases Azure Active Directory Management tools Business scenarios Watch Microsoft Press’s blog and Twitter (@MicrosoftPress) to learn about other free ebooks in the “Microsoft Azure Essentials” series.




Hadoop Essentials


Book Description

If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. This book is also meant for Hadoop professionals who want to find solutions to the different challenges they come across in their Hadoop projects.




Hadoop For Dummies


Book Description

Let Hadoop For Dummies help harness the power of your data and rein in the information overload Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters. Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.




Instant Mapreduce Patterns - Hadoop Essentials How-To


Book Description

Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. This is a Packt Instant How-to guide, which provides concise and clear recipes for getting started with Hadoop.This book is for big data enthusiasts and would-be Hadoop programmers. It is also meant for Java programmers who either have not worked with Hadoop at all, or who know Hadoop and MapReduce but are not sure how to deepen their understanding.




Data Analysis and Business Modeling with Excel 2013


Book Description

Manage, analyze, and visualize data with Microsoft Excel 2013 to transform raw data into ready to use information About This Book Create formulas to help you analyze and explain findings Develop interactive spreadsheets that will impress your audience and give them the ability to slice and dice data A step-by-step guide to learn various ways to model data for businesses with the help of Excel 2013 Who This Book Is For If you want to start using Excel 2013 for data analysis and business modeling and enhance your skills in the data analysis life cycle then this book is for you, whether you're new to Excel or experienced. What You Will Learn Discover what Excel formulas are all about and how to use them in your spreadsheet development Identify bad data and learn cleaning strategies Create interactive spreadsheets that engage and appeal to your audience Leverage Excel's powerful built-in tools to get the median, maximum, and minimum values of your data Build impressive tables and combine datasets using Excel's built-in functionality Learn the powerful scripting language VBA, allowing you to implement your own custom solutions with ease In Detail Excel 2013 is one of the easiest to use data analysis tools you will ever come across. Its simplicity and powerful features has made it the go to tool for all your data needs. Complex operations with Excel, such as creating charts and graphs, visualization, and analyzing data make it a great tool for managers, data scientists, financial data analysts, and those who work closely with data. Learning data analysis and will help you bring your data skills to the next level. This book starts by walking you through creating your own data and bringing data into Excel from various sources. You'll learn the basics of SQL syntax and how to connect it to a Microsoft SQL Server Database using Excel's data connection tools. You will discover how to spot bad data and strategies to clean that data to make it useful to you. Next, you'll learn to create custom columns, identify key metrics, and make decisions based on business rules. You'll create macros using VBA and use Excel 2013's shiny new macros. Finally, at the end of the book, you'll be provided with useful shortcuts and tips, enabling you to do efficient data analysis and business modeling with Excel 2013. Style and approach This is a step-by-step guide to performing data analysis and business modelling with Excel 2013, complete with examples and tips.