Fairness and Machine Learning


Book Description

An introduction to the intellectual foundations and practical utility of the recent work on fairness and machine learning. Fairness and Machine Learning introduces advanced undergraduate and graduate students to the intellectual foundations of this recently emergent field, drawing on a diverse range of disciplinary perspectives to identify the opportunities and hazards of automated decision-making. It surveys the risks in many applications of machine learning and provides a review of an emerging set of proposed solutions, showing how even well-intentioned applications may give rise to objectionable results. It covers the statistical and causal measures used to evaluate the fairness of machine learning models as well as the procedural and substantive aspects of decision-making that are core to debates about fairness, including a review of legal and philosophical perspectives on discrimination. This incisive textbook prepares students of machine learning to do quantitative work on fairness while reflecting critically on its foundations and its practical utility. • Introduces the technical and normative foundations of fairness in automated decision-making • Covers the formal and computational methods for characterizing and addressing problems • Provides a critical assessment of their intellectual foundations and practical utility • Features rich pedagogy and extensive instructor resources




Limitations of Fairness in Machine Learning


Book Description

The issue of socially responsible machine learning has never been more pressing. An entire field of machine learning is dedicated to investigating the societal aspects of automated decision-making systems and providing technical solutions for algorithmic fairness. However, any attempt to improve the fairness of algorithms must be examined under the lens of potential societal harm. In this thesis, we study existing approaches to fair classification and shed light on their various limitations. First, we show that relaxations of fairness constraints used to simplify the learning process of fair models are too coarse, since the final classifier may be distinctly unfair even though the relaxed constraint is satisfied. In response, we propose a new and provably fair method that incorporates the fairness relaxations in a strongly convex formulation. Second, we observe an increased awareness of protected attributes such as race or gender in the last layer of deep neural networks when we regularize them for fair outcomes. Based on this observation, we construct a neural network that explicitly treats input points differently because of protected personal characteristics. With this explicit formulation, we can replicate the predictions of a fair neural network. We argue that both the fair neural network and the explicit formulation demonstrate disparate treatment-a form of discrimination in anti-discrimination laws. Third, we consider fairness properties of the majority vote-a popular ensemble method for aggregating multiple machine learning models to obtain more accurate and robust decisions. We algorithmically investigate worst-case fairness guarantees of the majority vote when it consists of multiple classifiers that are themselves already fair. Under strong independence assumptions on the classifiers, we can guarantee a fair majority vote. Without any assumptions on the classifiers, a fair majority vote cannot be guaranteed in general, but different fairness regimes are possible: on the one hand, using fair classifiers may improve the worst-case fairness guarantees. On the other hand, the majority vote may not be fair at all.




Big Data and Social Science


Book Description

Both Traditional Students and Working Professionals Acquire the Skills to Analyze Social Problems. Big Data and Social Science: A Practical Guide to Methods and Tools shows how to apply data science to real-world problems in both research and the practice. The book provides practical guidance on combining methods and tools from computer science, statistics, and social science. This concrete approach is illustrated throughout using an important national problem, the quantitative study of innovation. The text draws on the expertise of prominent leaders in statistics, the social sciences, data science, and computer science to teach students how to use modern social science research principles as well as the best analytical and computational tools. It uses a real-world challenge to introduce how these tools are used to identify and capture appropriate data, apply data science models and tools to that data, and recognize and respond to data errors and limitations. For more information, including sample chapters and news, please visit the author's website.




Ethics in Artificial Intelligence: Bias, Fairness and Beyond


Book Description

This book is a collection of chapters in the newly developing area of ethics in artificial intelligence. The book comprises chapters written by leading experts in this area which makes it a one of its kind collections. Some key features of the book are its unique combination of chapters on both theoretical and practical aspects of integrating ethics into artificial intelligence. The book touches upon all the important concepts in this area including bias, discrimination, fairness, and interpretability. Integral components can be broadly divided into two segments – the first segment includes empirical identification of biases, discrimination, and the ethical concerns thereof in impact assessment, advertising and personalization, computational social science, and information retrieval. The second segment includes operationalizing the notions of fairness, identifying the importance of fairness in allocation, clustering and time series problems, and applications of fairness in software testing/debugging and in multi stakeholder platforms. This segment ends with a chapter on interpretability of machine learning models which is another very important and emerging topic in this area.




The Ethical Algorithm


Book Description

Over the course of a generation, algorithms have gone from mathematical abstractions to powerful mediators of daily life. Algorithms have made our lives more efficient, more entertaining, and, sometimes, better informed. At the same time, complex algorithms are increasingly violating the basic rights of individual citizens. Allegedly anonymized datasets routinely leak our most sensitive personal information; statistical models for everything from mortgages to college admissions reflect racial and gender bias. Meanwhile, users manipulate algorithms to "game" search engines, spam filters, online reviewing services, and navigation apps. Understanding and improving the science behind the algorithms that run our lives is rapidly becoming one of the most pressing issues of this century. Traditional fixes, such as laws, regulations and watchdog groups, have proven woefully inadequate. Reporting from the cutting edge of scientific research, The Ethical Algorithm offers a new approach: a set of principled solutions based on the emerging and exciting science of socially aware algorithm design. Michael Kearns and Aaron Roth explain how we can better embed human principles into machine code - without halting the advance of data-driven scientific exploration. Weaving together innovative research with stories of citizens, scientists, and activists on the front lines, The Ethical Algorithm offers a compelling vision for a future, one in which we can better protect humans from the unintended impacts of algorithms while continuing to inspire wondrous advances in technology.




Patterns, Predictions, and Actions: Foundations of Machine Learning


Book Description

An authoritative, up-to-date graduate textbook on machine learning that highlights its historical context and societal impacts Patterns, Predictions, and Actions introduces graduate students to the essentials of machine learning while offering invaluable perspective on its history and social implications. Beginning with the foundations of decision making, Moritz Hardt and Benjamin Recht explain how representation, optimization, and generalization are the constituents of supervised learning. They go on to provide self-contained discussions of causality, the practice of causal inference, sequential decision making, and reinforcement learning, equipping readers with the concepts and tools they need to assess the consequences that may arise from acting on statistical decisions. Provides a modern introduction to machine learning, showing how data patterns support predictions and consequential actions Pays special attention to societal impacts and fairness in decision making Traces the development of machine learning from its origins to today Features a novel chapter on machine learning benchmarks and datasets Invites readers from all backgrounds, requiring some experience with probability, calculus, and linear algebra An essential textbook for students and a guide for researchers




Oxford Handbook of Ethics of AI


Book Description

This volume tackles a quickly-evolving field of inquiry, mapping the existing discourse as part of a general attempt to place current developments in historical context; at the same time, breaking new ground in taking on novel subjects and pursuing fresh approaches. The term "A.I." is used to refer to a broad range of phenomena, from machine learning and data mining to artificial general intelligence. The recent advent of more sophisticated AI systems, which function with partial or full autonomy and are capable of tasks which require learning and 'intelligence', presents difficult ethical questions, and has drawn concerns from many quarters about individual and societal welfare, democratic decision-making, moral agency, and the prevention of harm. This work ranges from explorations of normative constraints on specific applications of machine learning algorithms today-in everyday medical practice, for instance-to reflections on the (potential) status of AI as a form of consciousness with attendant rights and duties and, more generally still, on the conceptual terms and frameworks necessarily to understand tasks requiring intelligence, whether "human" or "A.I."




Evaluating the Fairness in the Performance of Machine Learning Methods


Book Description

Machine learning plays an increasingly important role in our lives, tackling both prevalent and specialized but high-risk problems. Motivated by legislation, responsibility to ensure transparency and accountability of machine learning methods and needs to maintain public's trust on the algorithms used in our lives, researchers have paid much attention to the fairness issue in machine learning. There are many methods developed to measure, reduce and even eliminate the fairness issue for both general and specific settings or algorithms. In this project, we focus on fairness in classification machine learning problems in healthcare which is one critical field of the application of machine learning. We found a general way to detect the fairness issue in the performance of machine learning methods and found the general solutions to address the issue in all the dimensions of data, method and metrics. We also introduced fairness threshold to help reduce the fairness issue without retraining the model and performance boundary to help analyze the effect of the methods we tried.




Practical Fairness


Book Description

Fairness is an increasingly important topic as machine learning and AI more generally take over the world. While this is an active area of research, many realistic best practices are emerging at all steps along the data pipeline, from data selection and preprocessing to blackbox model audits. This book will guide you through the technical, legal, and ethical aspects of making your code fair and secure while highlighting cutting edge academic research and ongoing legal developments related to fairness and algorithms. There is mounting evidence that the widespread deployment of machine learning and artificial intelligence in business and government is reproducing the same biases we are trying to fight in the real world. For this reason, fairness is an increasingly important consideration for the data scientist. Yet discussions of what fairness means in terms of actual code are few and far between. This code will show you how to code fairly as well as cover basic concerns related to data security and privacy from a fairness perspective.




All of Statistics


Book Description

Taken literally, the title "All of Statistics" is an exaggeration. But in spirit, the title is apt, as the book does cover a much broader range of topics than a typical introductory book on mathematical statistics. This book is for people who want to learn probability and statistics quickly. It is suitable for graduate or advanced undergraduate students in computer science, mathematics, statistics, and related disciplines. The book includes modern topics like non-parametric curve estimation, bootstrapping, and classification, topics that are usually relegated to follow-up courses. The reader is presumed to know calculus and a little linear algebra. No previous knowledge of probability and statistics is required. Statistics, data mining, and machine learning are all concerned with collecting and analysing data.