Data Science Manuale Italiano – Advanced Machine Learning e Deployment


Book Description

Questa opera segue il curriculum 2021 della Association for Computing Machinery per specialisti in Scienze dei Dati, con l’obiettivo di costituire un “Bignami” della Scienza ed Ingegneria dei Dati e facilitare il percorso di formazione personale a partire da competenze specialistiche in Informatica o Matematica o Statistica per un lettore di lingua madre italiana. Parte di una serie di testi, riepiloga prima di tutto la metodologia di lavoro standard CRISP DM utilizzata in questa opera e in progetti di Scienza dei Dati. Poichè questo testo utilizza Orange per gli aspetti applicativi, ne descrive l’installazione ed i widget. La fase di modellizzazione dei dati viene considerata nell’ottica dell’apprendimento automatico riepilogando i tipi di apprendimento automatico, i tipi di modelli, i tipi di problemi e i tipi di algoritmi. Sono descritti gli aspetti avanzati associati alla modellizzazione quali le funzioni di perdita e di ottimizzazione come la gradient descent, le tecniche per analizzare le prestazioni dei modelli come il Bootstrapping e la Cross Validation. Vengono analizzati gli scenari di deployment e le più comuni piattaforme, con esempi applicativi. Vengono proposti i meccanismi per automatizzare l’apprendimento automatico e per supportare l’interpretabilità dei modelli e dei risultati come Partial Dependence Plot, Permuted Feature Importance e altre. Gli esercizi sono descritti con Orange e Python con l’uso della libreria Keras/Tensorflow. Il testo è corredato di materiale di supporto ed è possibile scaricare gli esempi in Orange e i dati di prova.




The Data Science Design Manual


Book Description

This engaging and clearly written textbook/reference provides a must-have introduction to the rapidly emerging interdisciplinary field of data science. It focuses on the principles fundamental to becoming a good data scientist and the key skills needed to build systems for collecting, analyzing, and interpreting data. The Data Science Design Manual is a source of practical insights that highlights what really matters in analyzing data, and provides an intuitive understanding of how these core concepts can be used. The book does not emphasize any particular programming language or suite of data-analysis tools, focusing instead on high-level discussion of important design principles. This easy-to-read text ideally serves the needs of undergraduate and early graduate students embarking on an “Introduction to Data Science” course. It reveals how this discipline sits at the intersection of statistics, computer science, and machine learning, with a distinct heft and character of its own. Practitioners in these and related fields will find this book perfect for self-study as well. Additional learning tools: Contains “War Stories,” offering perspectives on how data science applies in the real world Includes “Homework Problems,” providing a wide range of exercises and projects for self-study Provides a complete set of lecture slides and online video lectures at www.data-manual.com Provides “Take-Home Lessons,” emphasizing the big-picture concepts to learn from each chapter Recommends exciting “Kaggle Challenges” from the online platform Kaggle Highlights “False Starts,” revealing the subtle reasons why certain approaches fail Offers examples taken from the data science television show “The Quant Shop” (www.quant-shop.com)




Powering the Digital Economy: Opportunities and Risks of Artificial Intelligence in Finance


Book Description

This paper discusses the impact of the rapid adoption of artificial intelligence (AI) and machine learning (ML) in the financial sector. It highlights the benefits these technologies bring in terms of financial deepening and efficiency, while raising concerns about its potential in widening the digital divide between advanced and developing economies. The paper advances the discussion on the impact of this technology by distilling and categorizing the unique risks that it could pose to the integrity and stability of the financial system, policy challenges, and potential regulatory approaches. The evolving nature of this technology and its application in finance means that the full extent of its strengths and weaknesses is yet to be fully understood. Given the risk of unexpected pitfalls, countries will need to strengthen prudential oversight.




Real World OCaml


Book Description

This fast-moving tutorial introduces you to OCaml, an industrial-strength programming language designed for expressiveness, safety, and speed. Through the book’s many examples, you’ll quickly learn how OCaml stands out as a tool for writing fast, succinct, and readable systems code. Real World OCaml takes you through the concepts of the language at a brisk pace, and then helps you explore the tools and techniques that make OCaml an effective and practical tool. In the book’s third section, you’ll delve deep into the details of the compiler toolchain and OCaml’s simple and efficient runtime system. Learn the foundations of the language, such as higher-order functions, algebraic data types, and modules Explore advanced features such as functors, first-class modules, and objects Leverage Core, a comprehensive general-purpose standard library for OCaml Design effective and reusable libraries, making the most of OCaml’s approach to abstraction and modularity Tackle practical programming problems from command-line parsing to asynchronous network programming Examine profiling and interactive debugging techniques with tools such as GNU gdb




Exploring Splunk


Book Description

Big data has incredible business value, and Splunk is the best tool for unlocking that value. Exploring Splunk shows you how to pinpoint answers and find patterns obscured by the flood of machinegenerated data. This book uses an engaging, visual presentation style that quickly familiarizes you with how to use Splunk. You'll move from mastering Splunk basics to creatively solving real-world problems, finding the gems hidden in big data.




Flask Web Development


Book Description

Take full creative control of your web applications with Flask, the Python-based microframework. With the second edition of this hands-on book, youâ??ll learn Flask from the ground up by developing a complete, real-world application created by author Miguel Grinberg. This refreshed edition accounts for important technology changes that have occurred in the past three years. Explore the frameworkâ??s core functionality, and learn how to extend applications with advanced web techniques such as database migrations and an application programming interface. The first part of each chapter provides you with reference and background for the topic in question, while the second part guides you through a hands-on implementation. If you have Python experience, youâ??re ready to take advantage of the creative freedom Flask provides. Three sections include: A thorough introduction to Flask: explore web application development basics with Flask and an application structure appropriate for medium and large applications Building Flasky: learn how to build an open source blogging application step-by-step by reusing templates, paginating item lists, and working with rich text Going the last mile: dive into unit testing strategies, performance analysis techniques, and deployment options for your Flask application




Machine Learning


Book Description

A comprehensive introduction to machine learning that uses probabilistic models and inference as a unifying approach. Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package—PMTK (probabilistic modeling toolkit)—that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.




The Linux Command Line, 2nd Edition


Book Description

You've experienced the shiny, point-and-click surface of your Linux computer--now dive below and explore its depths with the power of the command line. The Linux Command Line takes you from your very first terminal keystrokes to writing full programs in Bash, the most popular Linux shell (or command line). Along the way you'll learn the timeless skills handed down by generations of experienced, mouse-shunning gurus: file navigation, environment configuration, command chaining, pattern matching with regular expressions, and more. In addition to that practical knowledge, author William Shotts reveals the philosophy behind these tools and the rich heritage that your desktop Linux machine has inherited from Unix supercomputers of yore. As you make your way through the book's short, easily-digestible chapters, you'll learn how to: • Create and delete files, directories, and symlinks • Administer your system, including networking, package installation, and process management • Use standard input and output, redirection, and pipelines • Edit files with Vi, the world's most popular text editor • Write shell scripts to automate common or boring tasks • Slice and dice text files with cut, paste, grep, patch, and sed Once you overcome your initial "shell shock," you'll find that the command line is a natural and expressive way to communicate with your computer. Just don't be surprised if your mouse starts to gather dust.







The Ultimate Sniper


Book Description

Through revised text, new photos, specialised illustrations, updated charts and additional information sidebars, The Ultimate Sniper once again thoroughly details the three great skill areas of sniping; marksmanship, fieldcraft and tactics.