Veridical Data Science


Book Description

Using real-world data case studies, this innovative and accessible textbook introduces an actionable framework for conducting trustworthy data science. Most textbooks present data science as a linear analytic process involving a set of statistical and computational techniques without accounting for the challenges intrinsic to real-world applications. Veridical Data Science, by contrast, embraces the reality that most projects begin with an ambiguous domain question and messy data; it acknowledges that datasets are mere approximations of reality while analyses are mental constructs. Bin Yu and Rebecca Barter employ the innovative Predictability, Computability, and Stability (PCS) framework to assess the trustworthiness and relevance of data-driven results relative to three sources of uncertainty that arise throughout the data science life cycle: the human decisions and judgment calls made during data collection, cleaning, and modeling. By providing real-world data case studies, intuitive explanations of common statistical and machine learning techniques, and supplementary R and Python code, Veridical Data Science offers a clear and actionable guide for conducting responsible data science. Requiring little background knowledge, this lucid, self-contained textbook provides a solid foundation and principled framework for future study of advanced methods in machine learning, statistics, and data science. Presents the Predictability, Computability, and Stability (PCS) methodology for producing trustworthy data-driven results Teaches how a data science project should be conducted from beginning to end, including extensive discussion of the data scientist's decision-making process Cultivates critical thinking throughout the entire data science life cycle Provides practical examples and illuminating case studies of real-world data analysis problems with associated code, exercises, and solutions Suitable for advanced undergraduate and graduate students, domain scientists, and practitioners




Leadership in Statistics and Data Science


Book Description

This edited collection brings together voices of the strongest thought leaders on diversity, equity and inclusion in the field of statistics and data science, with the goal of encouraging and steering the profession into the regular practice of inclusive and humanistic leadership. It provides futuristic ideas for promoting opportunities for equitable leadership, as well as tested approaches that have already been found to make a difference. It speaks to the challenges and opportunities of leading successful research collaborations and making strong connections within research teams. Curated with a vision that leadership takes a myriad of forms, and that diversity has many dimensions, this volume examines the nuances of leadership within a workplace environment and promotes storytelling and other competencies as critical elements of effective leadership. It makes the case for inclusive and humanistic leadership in statistics and data science, where there often remains a dearth of women and members of certain racial communities among the employees. Titled and non-titled leaders will benefit from the planning, evaluation, and structural tools offered within to contribute inclusive excellence in workplace climate, environment, and culture.




Communicating with Data


Book Description

Communication is a critical yet often overlooked part of data science. Communicating with Data aims to help students and researchers write about their insights in a way that is both compelling and faithful to the data. General advice on science writing is also provided, including how to distill findings into a story and organize and revise the story, and how to write clearly, concisely, and precisely. This is an excellent resource for students who want to learn how to write about scientific findings, and for instructors who are teaching a science course in communication or a course with a writing component. Communicating with Data consists of five parts. Part I helps the novice learn to write by reading the work of others. Part II delves into the specifics of how to describe data at a level appropriate for publication, create informative and effective visualizations, and communicate an analysis pipeline through well-written, reproducible code. Part III demonstrates how to reduce a data analysis to a compelling story and organize and write the first draft of a technical paper. Part IV addresses revision; this includes advice on writing about statistical findings in a clear and accurate way, general writing advice, and strategies for proof reading and revising. Part V offers advice about communication strategies beyond the page, which include giving talks, building a professional network, and participating in online communities. This book also provides 22 portfolio prompts that extend the guidance and examples in the earlier parts of the book and help writers build their portfolio of data communication.




CLADAG 2021 BOOK OF ABSTRACTS AND SHORT PAPERS


Book Description

The book collects the short papers presented at the 13th Scientific Meeting of the Classification and Data Analysis Group (CLADAG) of the Italian Statistical Society (SIS). The meeting has been organized by the Department of Statistics, Computer Science and Applications of the University of Florence, under the auspices of the Italian Statistical Society and the International Federation of Classification Societies (IFCS). CLADAG is a member of the IFCS, a federation of national, regional, and linguistically-based classification societies. It is a non-profit, non-political scientific organization, whose aims are to further classification research.







Science, Reason, and Faith


Book Description

Built into our very nature is a desire to know about the world around us. The big questions of human existence are inescapable: Who am I? Why am I here, and where am I going? Why is there evil in the world? What is the meaning of life? This yearning for truth ultimately leads us to our Creator. God knows the longings of the human heart, and he reveals himself to us through creation, through Scripture, and ultimately through the Incarnation. Because God the Son became man, we have a person to look to in our pursuit of truth: Jesus Christ himself, who is Truth. Christ helps us see that truth is not just the object of science and reason, but what animates the mysterious and loving power of faith. In Science, Reason, and Faith, Fr. Robert Spitzer, SJ, explores in depth the Bible and the intersection of three realms that the secular world tells us are separate and incompatible. Fr. Spitzer draws the modern reader's attention to the many seeming conflicts between science, reason, and Catholic teaching. By tackling these difficult questions, he shows that it is precisely through the integration of science, reason, and faith that we can truly discover ourselves, our world, and our God.




Knowledge Graphs


Book Description

A rigorous and comprehensive textbook covering the major approaches to knowledge graphs, an active and interdisciplinary area within artificial intelligence. The field of knowledge graphs, which allows us to model, process, and derive insights from complex real-world data, has emerged as an active and interdisciplinary area of artificial intelligence over the last decade, drawing on such fields as natural language processing, data mining, and the semantic web. Current projects involve predicting cyberattacks, recommending products, and even gleaning insights from thousands of papers on COVID-19. This textbook offers rigorous and comprehensive coverage of the field. It focuses systematically on the major approaches, both those that have stood the test of time and the latest deep learning methods.




The Simple Genetic Algorithm


Book Description

Content Description #"A Bradford book."#Includes bibliographical references (p.) and index.




Veridical Data Science


Book Description

Using real-world data case studies, this innovative and accessible textbook introduces an actionable framework for conducting trustworthy data science. Most textbooks present data science as a linear analytic process involving a set of statistical and computational techniques without accounting for the challenges intrinsic to real-world applications. Veridical Data Science, by contrast, embraces the reality that most projects begin with an ambiguous domain question and messy data; it acknowledges that datasets are mere approximations of reality while analyses are mental constructs. Bin Yu and Rebecca Barter employ the innovative Predictability, Computability, and Stability (PCS) framework to assess the trustworthiness and relevance of data-driven results relative to three sources of uncertainty that arise throughout the data science life cycle: the human decisions and judgment calls made during data collection, cleaning, and modeling. By providing real-world data case studies, intuitive explanations of common statistical and machine learning techniques, and supplementary R and Python code, Veridical Data Science offers a clear and actionable guide for conducting responsible data science. Requiring little background knowledge, this lucid, self-contained textbook provides a solid foundation and principled framework for future study of advanced methods in machine learning, statistics, and data science. Presents the Predictability, Computability, and Stability (PCS) methodology for producing trustworthy data-driven results Teaches how a data science project should be conducted from beginning to end, including extensive discussion of the data scientist's decision-making process Cultivates critical thinking throughout the entire data science life cycle Provides practical examples and illuminating case studies of real-world data analysis problems with associated code, exercises, and solutions Suitable for advanced undergraduate and graduate students, domain scientists, and practitioners




ECKM 2023 24th European Conference on Knowledge Management Vol 2


Book Description

These proceedings represent the work of contributors to the 24th European Conference on Knowledge Management (ECKM 2023), hosted by Iscte – Instituto Universitário de Lisboa, Portugal on 7-8 September 2023. The Conference Chair is Prof Florinda Matos, and the Programme Chair is Prof Álvaro Rosa, both from Iscte Business School, Iscte – Instituto Universitário de Lisboa, Portugal. ECKM is now a well-established event on the academic research calendar and now in its 24th year the key aim remains the opportunity for participants to share ideas and meet the people who hold them. The scope of papers will ensure an interesting two days. The subjects covered illustrate the wide range of topics that fall into this important and ever-growing area of research. The opening keynote presentation is given by Professor Leif Edvinsson, on the topic of Intellectual Capital as a Missed Value. The second day of the conference will open with an address by Professor Noboru Konno from Tama Graduate School and Keio University, Japan who will talk about Society 5.0, Knowledge and Conceptual Capability, and Professor Jay Liebowitz, who will talk about Digital Transformation for the University of the Future. With an initial submission of 350 abstracts, after the double blind, peer review process there are 184 Academic research papers, 11 PhD research papers, 1 Masters Research paper, 4 Non-Academic papers and 11 work-in-progress papers published in these Conference Proceedings. These papers represent research from Australia, Austria, Brazil, Bulgaria, Canada, Chile, China, Colombia, Cyprus, Czech Republic, Denmark, Finland, France, Germany, Greece, Hungary, India, Iran, Iraq, Ireland, Israel, Italy, Japan, Jordan, Kazakhstan, Kuwait, Latvia, Lithuania, Malaysia, México, Morocco, Netherlands, Norway, Palestine, Peru, Philippines, Poland, Portugal, Romania, South Africa, Spain, Sweden, Switzerland, Taiwan, Thailand, Tunisia, UK, United Arab Emirates and the USA.