Humanities Data in R


Book Description




Humanities Data Analysis


Book Description

A practical guide to data-intensive humanities research using the Python programming language The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment. The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter. An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions. Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python Applicable to many humanities disciplines, including history, literature, and sociology Offers real-world case studies using publicly available data sets Provides exercises at the end of each chapter for students to test acquired skills Emphasizes visual storytelling via data visualizations




Text Analysis with R


Book Description

Now in its second edition, Text Analysis with R provides a practical introduction to computational text analysis using the open source programming language R. R is an extremely popular programming language, used throughout the sciences; due to its accessibility, R is now used increasingly in other research areas. In this volume, readers immediately begin working with text, and each chapter examines a new technique or process, allowing readers to obtain a broad exposure to core R procedures and a fundamental understanding of the possibilities of computational text analysis at both the micro and the macro scale. Each chapter builds on its predecessor as readers move from small scale “microanalysis” of single texts to large scale “macroanalysis” of text corpora, and each concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book’s focus is on making the technical palatable and making the technical useful and immediately gratifying. Text Analysis with R is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological toolkit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that readers simply cannot gather using traditional qualitative methods of close reading and human synthesis. This new edition features two new chapters: one that introduces dplyr and tidyr in the context of parsing and analyzing dramatic texts to extract speaker and receiver data, and one on sentiment analysis using the syuzhet package. It is also filled with updated material in every chapter to integrate new developments in the field, current practices in R style, and the use of more efficient algorithms.




Big Data in Computational Social Science and Humanities


Book Description

This edited volume focuses on big data implications for computational social science and humanities from management to usage. The first part of the book covers geographic data, text corpus data, and social media data, and exemplifies their concrete applications in a wide range of fields including anthropology, economics, finance, geography, history, linguistics, political science, psychology, public health, and mass communications. The second part of the book provides a panoramic view of the development of big data in the fields of computational social sciences and humanities. The following questions are addressed: why is there a need for novel data governance for this new type of data?, why is big data important for social scientists?, and how will it revolutionize the way social scientists conduct research? With the advent of the information age and technologies such as Web 2.0, ubiquitous computing, wearable devices, and the Internet of Things, digital society has fundamentally changed what we now know as "data", the very use of this data, and what we now call "knowledge". Big data has become the standard in social sciences, and has made these sciences more computational. Big Data in Computational Social Science and Humanities will appeal to graduate students and researchers working in the many subfields of the social sciences and humanities.




Humanities Data in R


Book Description

​This pioneering book teaches readers to use R within four core analytical areas applicable to the Humanities: networks, text, geospatial data, and images. This book is also designed to be a bridge: between quantitative and qualitative methods, individual and collaborative work, and the humanities and social sciences. Humanities Data with R does not presuppose background programming experience. Early chapters take readers from R set-up to exploratory data analysis (continuous and categorical data, multivariate analysis, and advanced graphics with emphasis on aesthetics and facility). Following this, networks, geospatial data, image data, natural language processing and text analysis each have a dedicated chapter. Each chapter is grounded in examples to move readers beyond the intimidation of adding new tools to their research. Everything is hands-on: networks are explained using U.S. Supreme Court opinions, and low-level NLP methods are applied to short stories by Sir Arthur Conan Doyle. After working through these examples with the provided data, code and book website, readers are prepared to apply new methods to their own work. The open source R programming language, with its myriad packages and popularity within the sciences and social sciences, is particularly well-suited to working with humanities data. R packages are also highlighted in an appendix. This book uses an expanded conception of the forms data may take and the information it represents. The methodology will have wide application in classrooms and self-study for the humanities, but also for use in linguistics, anthropology, and political science. Outside the classroom, this intersection of humanities and computing is particularly relevant for research and new modes of dissemination across archives, museums and libraries. ​




Routledge International Handbook of Research Methods in Digital Humanities


Book Description

This book draws on both traditional and emerging fields of study to consider consider what a grounded definition of quantitative and qualitative research in the Digital Humanities (DH) might mean; which areas DH can fruitfully draw on in order to foster and develop that understanding; where we can see those methods applied; and what the future directions of research methods in Digital Humanities might look like. Schuster and Dunn map a wide-ranging DH research methodology by drawing on both ‘traditional’ fields of DH study such as text, historical sources, museums and manuscripts, and innovative areas in research production, such as knowledge and technology, digital culture and society and history of network technologies. Featuring global contributions from scholars in the United Kingdom, the United States, Europe and Australia, this book draws together a range of disciplinary perspectives to explore the exciting developments offered by this fast-evolving field. Routledge International Handbook of Research Methods in Digital Humanities is essential reading for anyone who teaches, researches or studies Digital Humanities or related subjects.




Defining Digital Humanities


Book Description

This reader brings together the essential readings that have emerged in Digital Humanities. It provides a historical overview of how the term ‘Humanities Computing’ developed into the term ‘Digital Humanities’, and highlights core readings which explore the meaning, scope, and implementation of the field. To contextualize and frame each included reading, the editors and authors provide a commentary on the original piece. There is also an annotated bibliography of other material not included in the text to provide an essential list of reading in the discipline.




Computational Humanities


Book Description

The first book to intervene in debates on computation in the digital humanities Bringing together leading experts from across North America and Europe, Computational Humanities redirects debates around computation and humanities digital scholarship from dualistic arguments to nuanced discourse centered around theories of knowledge and power. This volume is organized around four questions: Why or why not pursue computational humanities? How do we engage in computational humanities? What can we study using these methods? Who are the stakeholders? Recent advances in technologies for image and sound processing have expanded computational approaches to cultural forms beyond text, and new forms of data, from listservs and code repositories to tweets and other social media content, have enlivened debates about what counts as digital humanities scholarship. Providing case studies of collaborations between humanities-centered and computation-centered researchers, this volume highlights both opportunities and frictions, showing that data and computation are as much about power, prestige, and precarity as they are about p-values. Contributors: Mark Algee-Hewitt, Stanford U; David Bamman, U of California, Berkeley; Kaspar Beelen, U of London; Peter Bell, Philipps U of Marburg; Tobias Blanke, U of Amsterdam; Julia Damerow, Arizona State U; Quinn Dombrowski, Stanford U; Crystal Nicole Eddins, U of Pittsburgh; Abraham Gibson, U of Texas at San Antonio; Tassie Gniady; Crystal Hall, Bowdoin College; Vanessa M. Holden, U of Kentucky; David Kloster, Indiana U; Manfred D. Laubichler, Arizona State U; Katherine McDonough, Lancaster U; Barbara McGillivray, King’s College London; Megan Meredith-Lobay, Simon Fraser U; Federico Nanni, Alan Turing Institute; Fabian Offert, U of California, Santa Barbara; Hannah Ringler, Illinois Institute of Technology; Roopika Risam, Dartmouth College; Joshua D. Rothman, U of Alabama; Benjamin M. Schmidt; Lisa Tagliaferri, Rutgers U; Jeffrey Tharsen, U of Chicago; Marieke van Erp, Royal Netherlands Academy of Arts and Sciences; Lee Zickel, Case Western Reserve U.




Doing Digital Humanities


Book Description

Digital Humanities is rapidly evolving as a significant approach to/method of teaching, learning and research across the humanities. This is a first-stop book for people interested in getting to grips with digital humanities whether as a student or a professor. The book offers a practical guide to the area as well as offering reflection on the main objectives and processes, including: Accessible introductions of the basics of Digital Humanities through to more complex ideas A wide range of topics from feminist Digital Humanities, digital journal publishing, gaming, text encoding, project management and pedagogy Contextualised case studies Resources for starting Digital Humanities such as links, training materials and exercises Doing Digital Humanities looks at the practicalities of how digital research and creation can enhance both learning and research and offers an approachable way into this complex, yet essential topic.




Multivariate Humanities


Book Description

This case study-based textbook in multivariate analysis for advanced students in the humanities emphasizes descriptive, exploratory analyses of various types of datasets from a wide range of sub-disciplines, promoting the use of multivariate analysis and illustrating its wide applicability. Fields featured include, but are not limited to, historical agriculture, arts (music and painting), theology, and stylometrics (authorship issues). Most analyses are based on existing data, earlier analysed in published peer-reviewed papers. Four preliminary methodological and statistical chapters provide general technical background to the case studies. The multivariate statistical methods presented and illustrated include data inspection, several varieties of principal component analysis, correspondence analysis, multidimensional scaling, cluster analysis, regression analysis, discriminant analysis, and three-mode analysis. The bulk of the text is taken up by 14 case studies that lean heavily on graphical representations of statistical information such as biplots, using descriptive statistical techniques to support substantive conclusions. Each study features a description of the substantive background to the data, followed by discussion of appropriate multivariate techniques, and detailed results interpreted through graphical illustrations. Each study is concluded with a conceptual summary. Datasets in SPSS are included online.