Humanities Data Analysis


Book Description

A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations




Social Science Research


Book Description

This book is designed to introduce doctoral and graduate students to the process of conducting scientific research in the social sciences, business, education, public health, and related disciplines. It is a one-stop, comprehensive, and compact source for foundational concepts in behavioral research, and can serve as a stand-alone text or as a supplement to research readings in any doctoral seminar or research methods class. This book is currently used as a research text at universities on six continents and will shortly be available in nine different languages.




Humanities Data in R


Book Description




Quantitative Methods in the Humanities


Book Description

This timely and lucid guide is intended for students and scholars working on all historical periods and topics in the humanities and social sciences--especially for those who do not think of themselves as experts in quantification, "big data," or "digital humanities." The authors reveal quantification to be a powerful and versatile tool, applicable to a myriad of materials from the past. Their book, accessible to complete beginners, offers detailed advice and practical tips on how to build a dataset from historical sources and how to categorize it according to specific research questions. Drawing on examples from works in social, political, economic, and cultural history, the book guides readers through a wide range of methods, including sampling, cross-tabulations, statistical tests, regression, factor analysis, network analysis, sequence analysis, event history analysis, geographical information systems, text analysis, and visualization. The requirements, advantages, and pitfalls of these techniques are presented in layperson's terms, avoiding mathematical terminology. Conceived primarily for historians, the book will prove invaluable to other humanists, as well as to social scientists looking for a nontechnical introduction to quantitative methods. Covering the most recent techniques, in addition to others not often enough discussed, the book will also have much to offer to the most seasoned practitioners of quantification.




Scientometrics for the Humanities and Social Sciences


Book Description

Scientometrics for the Humanities and Social Sciences is the first ever book on scientometrics that deals with the historical development of both quantitative and qualitative data analysis in scientometric studies. It focuses on its applicability in new and emerging areas of inquiry. This important book presents the inherent potential for data mining and analysis of qualitative data in scientometrics. The author provides select cases of scientometric studies in the humanities and social sciences, explaining their research objectives, sources of data and methodologies. It illustrates how data can be gathered not only from prominent online databases and repositories, but also from journals that are not stored in these databases. With the support of specific examples, the book shows how data on demographic variables can be collected to supplement scientometric data. The book deals with a research methodology which has an increasing applicability not only to the study of science, but also to the study of the disciplines in the humanities and social sciences.




Text Mining for Qualitative Data Analysis in the Social Sciences


Book Description

Gregor Wiedemann evaluates text mining applications for social science studies with respect to conceptual integration of consciously selected methods, systematic optimization of algorithms and workflows, and methodological reflections relating to empirical research. In an exemplary study, he introduces workflows to analyze a corpus of around 600,000 newspaper articles on the subject of “democratic demarcation” in Germany. He provides a valuable resource for innovative measures to social scientists and computer scientists in the field of applied natural language processing.




Text Analysis with R


Book Description

Now in its second edition, Text Analysis with R provides a practical introduction to computational text analysis using the open source programming language R. R is an extremely popular programming language, used throughout the sciences; due to its accessibility, R is now used increasingly in other research areas. In this volume, readers immediately begin working with text, and each chapter examines a new technique or process, allowing readers to obtain a broad exposure to core R procedures and a fundamental understanding of the possibilities of computational text analysis at both the micro and the macro scale. Each chapter builds on its predecessor as readers move from small scale “microanalysis” of single texts to large scale “macroanalysis” of text corpora, and each concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book’s focus is on making the technical palatable and making the technical useful and immediately gratifying. Text Analysis with R is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological toolkit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that readers simply cannot gather using traditional qualitative methods of close reading and human synthesis. This new edition features two new chapters: one that introduces dplyr and tidyr in the context of parsing and analyzing dramatic texts to extract speaker and receiver data, and one on sentiment analysis using the syuzhet package. It is also filled with updated material in every chapter to integrate new developments in the field, current practices in R style, and the use of more efficient algorithms.




The Behavioral and Social Sciences


Book Description

This volume explores the scientific frontiers and leading edges of research across the fields of anthropology, economics, political science, psychology, sociology, history, business, education, geography, law, and psychiatry, as well as the newer, more specialized areas of artificial intelligence, child development, cognitive science, communications, demography, linguistics, and management and decision science. It includes recommendations concerning new resources, facilities, and programs that may be needed over the next several years to ensure rapid progress and provide a high level of returns to basic research.




Cultural Science


Book Description

Cultural Science introduces a new way of thinking about culture. Adopting an evolutionary and systems approach, the authors argue that culture is the population-wide source of newness and innovation; it faces the future, not the past. Its chief characteristic is the formation of groups or 'demes' (organised and productive subpopulation; 'demos'). Demes are the means for creating, distributing and growing knowledge. However, such groups are competitive and knowledge-systems are adversarial. Starting from a rereading of Darwinian evolutionary theory, the book utilises multidisciplinary resources: Raymond Williams's 'culture is ordinary' approach; evolutionary science (e.g. Mark Pagel and Herbert Gintis); semiotics (Yuri Lotman); and economic theory (from Schumpeter to McCloskey). Successive chapters argue that: -Culture and knowledge need to be understood from an externalist ('linked brains') perspective, rather than through the lens of individual behaviour; -Demes are created by culture, especially storytelling, which in turn constitutes both politics and economics; -The clash of systems - including demes - is productive of newness, meaningfulness and successful reproduction of culture; -Contemporary urban culture and citizenship can best be explained by investigating how culture is used, and how newness and innovation emerge from unstable and contested boundaries between different meaning systems; -The evolution of culture is a process of technologically enabled 'demic concentration' of knowledge, across overlapping meaning-systems or semiospheres; a process where the number of demes accessible to any individual has increased at an accelerating rate, resulting in new problems of scale and coordination for cultural science to address. The book argues for interdisciplinary 'consilience', linking evolutionary and complexity theory in the natural sciences, economics and anthropology in the social sciences, and cultural, communication and media studies in the humanities and creative arts. It describes what is needed for a new 'modern synthesis' for the cultural sciences. It combines analytical and historical methods, to provide a framework for a general reconceptualisation of the theory of culture – one that is focused not on its political or customary aspects but rather its evolutionary significance as a generator of newness and innovation.




Data Analytics in Digital Humanities


Book Description

This book covers computationally innovative methods and technologies including data collection and elicitation, data processing, data analysis, data visualizations, and data presentation. It explores how digital humanists have harnessed the hypersociality and social technologies, benefited from the open-source sharing not only of data but of code, and made technological capabilities a critical part of humanities work. Chapters are written by researchers from around the world, bringing perspectives from diverse fields and subject areas. The respective authors describe their work, their research, and their learning. Topics include semantic web for cultural heritage valorization, machine learning for parody detection by classification, psychological text analysis, crowdsourcing imagery coding in natural disasters, and creating inheritable digital codebooks.Designed for researchers and academics, this book is suitable for those interested in methodologies and analytics that can be applied in literature, history, philosophy, linguistics, and related disciplines. Professionals such as librarians, archivists, and historians will also find the content informative and instructive.