Scalability Issues in Authorship Attribution


Book Description

Provides an in-depth and systematic study of the so-called scalability issues in authorship attribution -- the task that aims to identify the author of a text, given a model of authorial style based on texts of known authorship. Computational authorship attribution does not rely on in-depth reading, but rather automates the process. This book investigates the behavior of a text categorization approach to the task when confronted with scalability issues. By addressing the issues of experimental design, data size, and author set size, the dissertation demonstrates whether the approach taken is valid in experiments with limited or sufficient data, and with small or large sets of authors.




Authorship Attribution


Book Description

Authorship Attribution surveys the history and present state of the discipline, presenting some comparative results where available. It also provides a theoretical and empirically-tested basis for further work. Many modern techniques are described and evaluated, along with some insights for application for novices and experts alike.




Authorship attribution in Turkish Texts


Book Description

The latest developments in the field of computer technology have created new ways to share information without time and space limits. Computer technologies have not only made life easier and more accessible for users, but they have also opened up a new arena for illegal activities. These illegal actions have found an opportunity to spread via e-mails, websites, Internet chat rooms, forum pages, and social networking websites (like Facebook, Twitter, Instagram). Online contributors do not need to provide information such as their real names, the city where they live, age or gender in order to share their opinions, and such feelings of anonymity encourage criminal activities. Thus, disputed authorship cases have become one of the main challenges of the technological era. This research is a corpus-based simulated authorship casework application in Turkish. Texts for the corpora were collected from a collaborative online encyclopaedia – Eksi Sozluk (Sour Times) and Twitter. The corpus consists of 900 texts from 52 authors in total. However, 105 texts belong to seven authors from Twitter. The two methodological approaches that were applied are qualitative and statistical methods, according to Grant’s (2013) approach. Ten different tests were applied, depending on the various parameters that are forensically possible in real-world cases. Accordingly, the role of feature type, size, including the candidate author size, text size and a limited number of texts per author and finally cross-genre application were tested. The analyses revealed that such a combined approach has promising results in some tests in that they attributed authorship in Turkish. The findings of the research indicated that there is the potential to attribute unknown authors in Turkish and it appears that the results have significant conclusions for the broader application of forensic authorship attribution techniques in Turkish texts. Keywords: Authorship Attribution, Turkish, Forensic Linguistics, Authorship Analysis




›Prometheus Bound‹ – A Separate Authorial Trace in the Aeschylean Corpus


Book Description

Classics, Computer Science, and Linguistics are brought together in this book, in an attempt to provide an answer to the authorship question concerning Prometheus Bound, a disputed play in the Aeschylean corpus, by applying some well-established Computer Stylistics methods. One of the main objectives of Stylometry, which, broadly speaking, is the study of quantified style, is Authorship Attribution. In its traditional form it can range from manually calculating descriptive statistics to the use of computer-assisted methodologies. However, non-traditional Authorship Attribution drastically changed the field. It brought together modern Linguistics and Artificial Intelligence applications (machine learning, natural language processing), and its key characteristic is that it aims at developing fully-automated systems for the attribution of texts of unknown authorship. In this book the author employs a series of supervised and unsupervised techniques used in non-traditional Authorship Attribution–applied here for the first time in ancient drama. The outcome of the analysis indicates a significant distance between the disputed text and the secure plays of Aeschylus, but also various interesting (micro-linguistic) ties of affinity with other authors, especially Sophocles and Euripides.




Machine Learning for Authorship Attribution and Cyber Forensics


Book Description

The book first explores the cybersecurity’s landscape and the inherent susceptibility of online communication system such as e-mail, chat conversation and social media in cybercrimes. Common sources and resources of digital crimes, their causes and effects together with the emerging threats for society are illustrated in this book. This book not only explores the growing needs of cybersecurity and digital forensics but also investigates relevant technologies and methods to meet the said needs. Knowledge discovery, machine learning and data analytics are explored for collecting cyber-intelligence and forensics evidence on cybercrimes. Online communication documents, which are the main source of cybercrimes are investigated from two perspectives: the crime and the criminal. AI and machine learning methods are applied to detect illegal and criminal activities such as bot distribution, drug trafficking and child pornography. Authorship analysis is applied to identify the potential suspects and their social linguistics characteristics. Deep learning together with frequent pattern mining and link mining techniques are applied to trace the potential collaborators of the identified criminals. Finally, the aim of the book is not only to investigate the crimes and identify the potential suspects but, as well, to collect solid and precise forensics evidence to prosecute the suspects in the court of law.




The SAGE Encyclopedia of Communication Research Methods


Book Description

Communication research is evolving and changing in a world of online journals, open-access, and new ways of obtaining data and conducting experiments via the Internet. Although there are generic encyclopedias describing basic social science research methodologies in general, until now there has been no comprehensive A-to-Z reference work exploring methods specific to communication and media studies. Our entries, authored by key figures in the field, focus on special considerations when applied specifically to communication research, accompanied by engaging examples from the literature of communication, journalism, and media studies. Entries cover every step of the research process, from the creative development of research topics and questions to literature reviews, selection of best methods (whether quantitative, qualitative, or mixed) for analyzing research results and publishing research findings, whether in traditional media or via new media outlets. In addition to expected entries covering the basics of theories and methods traditionally used in communication research, other entries discuss important trends influencing the future of that research, including contemporary practical issues students will face in communication professions, the influences of globalization on research, use of new recording technologies in fieldwork, and the challenges and opportunities related to studying online multi-media environments. Email, texting, cellphone video, and blogging are shown not only as topics of research but also as means of collecting and analyzing data. Still other entries delve into considerations of accountability, copyright, confidentiality, data ownership and security, privacy, and other aspects of conducting an ethical research program. Features: 652 signed entries are contained in an authoritative work spanning four volumes available in choice of electronic or print formats. Although organized A-to-Z, front matter includes a Reader’s Guide grouping entries thematically to help students interested in a specific aspect of communication research to more easily locate directly related entries. Back matter includes a Chronology of the development of the field of communication research; a Resource Guide to classic books, journals, and associations; a Glossary introducing the terminology of the field; and a detailed Index. Entries conclude with References/Further Readings and Cross-References to related entries to guide students further in their research journeys. The Index, Reader’s Guide themes, and Cross-References combine to provide robust search-and-browse in the e-version.




Confirmatory Factor Analysis for Applied Research, Second Edition


Book Description

This accessible book has established itself as the go-to resource on confirmatory factor analysis (CFA) for its emphasis on practical and conceptual aspects rather than mathematics or formulas. Detailed, worked-through examples drawn from psychology, management, and sociology studies illustrate the procedures, pitfalls, and extensions of CFA methodology. The text shows how to formulate, program, and interpret CFA models using popular latent variable software packages (LISREL, Mplus, EQS, SAS/CALIS); understand the similarities ...




Computational Linguistics and Intelligent Text Processing


Book Description

This two-volume set, consisting of LNCS 7816 and LNCS 7817, constitutes the thoroughly refereed proceedings of the 13th International Conference on Computer Linguistics and Intelligent Processing, CICLING 2013, held on Samos, Greece, in March 2013. The total of 91 contributions presented was carefully reviewed and selected for inclusion in the proceedings. The papers are organized in topical sections named: general techniques; lexical resources; morphology and tokenization; syntax and named entity recognition; word sense disambiguation and coreference resolution; semantics and discourse; sentiment, polarity, subjectivity, and opinion; machine translation and multilingualism; text mining, information extraction, and information retrieval; text summarization; stylometry and text simplification; and applications.




Scaling Impact


Book Description

Scaling Impact introduces a new and practical approach to scaling the positive impacts of research and innovation. Inspired by leading scientific and entrepreneurial innovators from across Africa, Asia, the Caribbean, Latin America, and the Middle East, this book presents a synthesis of unrivalled diversity and grounded ingenuity. The result is a different perspective on how to achieve impact that matters, and an important challenge to the predominant more-is-better paradigm of scaling. For organisations and individuals working to change the world for the better, scaling impact is a common goal and a well-founded aim. The world is changing rapidly, and seemingly intractable problems like environmental degradation or accelerating inequality press us to do better for each other and our environment as a global community. Challenges like these appear to demand a significant scale of action, and here the authors argue that a more creative and critical approach to scaling is both possible and essential. To encourage uptake and co-development, the authors present actionable principles that can help organisations and innovators design, manage, and evaluate scaling strategies. Scaling Impact is essential reading for development and innovation practitioners and professionals, but also for researchers, students, evaluators, and policymakers with a desire to spark meaningful change.




The Arden Research Handbook of Contemporary Shakespeare Criticism


Book Description

The Arden Research Handbook of Contemporary Shakespeare Criticism is a wide-ranging, authoritative guide to research on critical approaches to Shakespeare by an international team of leading scholars. It contains chapters on 20 specific critical practices, each grounded in analysis of a Shakespeare play. These practices range from foundational approaches including character studies, close reading and genre studies, through those that emerged in the 1970s and 1980s that challenged the preconceptions on which traditional liberal humanism is based, including feminism, cultural materialism and new historicism. Perspectives drawn from postcolonial, queer studies and critical race studies, besides more recent critical practices including presentism, ecofeminism and cognitive ethology all receive detailed treatment. In addition to its coverage of distinct critical approaches, the handbook contains various sections that provide non-specialists with practical help: an A–Z glossary of key terms and concepts, a chronology of major publications and events, an introduction to resources for study of the field and a substantial annotated bibliography.