Corpus Linguistics Beyond the Word


Book Description

This volume will be of particular interest to readers interested in expanding the applications of corpus linguistics techniques through new tools and approaches. The text includes selected papers from the Fifth North American Symposium, hosted by the Linguistics Department at Montclair State University in Montclair New Jersey in May 2004. The symposium papers represented several areas of corpus studies including language development, syntactic analysis, pragmatics and discourse, language change, register variation, corpus creation and annotation, and practical applications of corpus work, primarily in language teaching, but also in medical training and machine translation. A common thread through most of the papers was the use of corpora to study domains longer than the word. Not surprisingly, fully half of the papers deal with the computational tools and linguistic strategies needed to search for and analyze these longer spans of language while most of the remaining papers examine particular syntactic and rhetorical properties of one or more corpora.




Beyond Concordance Lines


Book Description

In over 30 years of data-driven learning (DDL) research, there has been a growing sophistication in the ways we collect, analyse, and put corpus data to use. This volume takes a three-fold perspective on DDL. It first looks at DDL and its role in informing language learning theory and how it might shed light on the language development process; secondly it addresses how DDL can help us characterise learner language and inform teaching accordingly, and thirdly it showcases practical applications for the use of DDL in classrooms. The contributors to this volume examine a variety of instructional settings and languages across the world. They reflect on theoretical, methodological and classroom implications using both novel and established language learning theories, natural language processing (NLP), longitudinal research designs, and a variety of language learning targets. The present volume is an invitation from some of the leading researchers in DDL to reflect on the research avenues that will define the field in the coming years.




Corpus Linguistics and Statistics with R


Book Description

This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.




Corpus Linguistics, Context and Culture


Book Description

Corpus Linguistics, Context and Culture demonstrates the potential of corpus linguistic methods for investigating language patterns across a range of contexts. Organised in three sections, the chapters range from detailed case studies on lexico-grammatical patterns to fundamental discussions of meaning as part of the ‘discourse, contexts and cultures’ theme. The final part on ‘learner contexts’ specifically emphasises the need for mixed-method approaches and the consideration of pedagogical implications for real world contexts. Beyond its contribution to current debates in the field, this edited volume indicates new directions in cross-disciplinary work.




Corpora in Applied Linguistics


Book Description




Pragmatics of Discourse


Book Description

Discourse is language as it occurs, in any form or context, beyond the speech act. It may be written or spoken, monological or dialogical, but there is always a communicative aim or purpose. The present volume provides systematic orientation in the vast field of studying discourse from a pragmatic perspective. It first gives an overview of a range of approaches developed for the analysis of discourse, including, among others, conversation analysis, systemic-functional analysis, genre analysis, critical discourse analysis, corpus-driven approaches and multimodal analysis. The focus is furthermore on functional units in discourse, such as discourse markers, moves, speech act sequences, discourse phases and silence. The final section of the volume examines discourse types and domains, providing a taxonomy of discourse types and focusing on a range of discourse domains, e.g. classroom discourse, medical discourse, legal discourse, electronic discourse. Each article surveys the current state of the art of the respective topic area while also presenting new research findings.




Doing Corpus Linguistics


Book Description

Doing Corpus Linguistics offers a practical step-by-step introduction to corpus linguistics, making use of widely available corpora and of a register analysis-based theoretical framework to provide students in Applied Linguistics and TESOL with the understanding and skills necessary to meaningfully analyze corpora and carry out successful corpus-based research. Divided into three parts – Introduction to Doing Corpus Linguistics and Register Analysis; Searches in Available Corpora; and Building Your Own Corpus, Analyzing Your Quantitative Results, and Making Sense of Data – the book emphasizes hands-on experience with performing language analysis research and in interpreting findings in a meaningful and engaging way. Readers are given multiple opportunities to analyze and apply language data by completing smaller tasks and corpus projects using publicly available corpora. The book also takes readers through the process of building a specialized corpus designed to answer a specific research question and provides detailed information on completing a final research project that includes both a written paper and an oral presentation of their specific research projects. Doing Corpus Linguistics provides students in applied linguistics and TESOL with the opportunity to gain proficiency in the technical and interpretive aspects of corpus research and to encourage them to participate in the growing field of corpus linguistics.




Corpus Linguistics and Sociolinguistics


Book Description

In Corpus Linguistics and Sociolinguistics, Beke Hansen analyses variation and change in the modal systems of three second-language varieties of English in Asia by taking a sociolinguistic approach to corpus data. Her study focuses on the modal and semi-modal verbs of strong obligation and necessity in Hong Kong English, Indian English, and Singapore English based on the relevant ICE component corpora. She adopts a typologically-informed perspective on variation in World Englishes by comparing the structures of the speakers’ first languages with the structures of the emergent varieties in the expression of epistemic modality. Beyond this, she analyses language change by constructing apparent-time scenarios to compensate for the lack of diachronic corpora in World Englishes.




Using Corpora in Discourse Analysis


Book Description

How can you carry out discourse analysis using corpus linguistics? What research questions should I ask? Which methods should you use and when? What is a collocational network or a key cluster? Introducing the major techniques, methods and tools for corpus-assisted analysis of discourse, this book answers these questions and more, showing readers how to best use corpora in their analyses of discourse. Using carefully tailored case studies, each chapter is devoted to a central technique, including frequency, concordancing and keywords, going step by step through the process of applying different analytical procedures. Introducing a wide range of different corpora, from holiday brochures to political debates, the book considers the key debates and latest advances in the field. Fully revised and updated, this new edition includes: - A new chapter on how to conduct research projects in corpus-based discourse analysis - Completely rewritten chapters on collocation and advanced techniques, using a corpus of jihadist propaganda texts and covering topics such as social media and visual analysis - Coverage of major tools, including CQPweb, AntConc, Sketch Engine and #LancsBox - Discussion of newer techniques including the derivation of lockwords and the comparison of multiple data sets for diachronic analysis With exercises, discussion questions and suggested further readings in each chapter, this book is an excellent guide to using corpus linguistics techniques to carry out discourse analysis.




Sociolinguistics and Corpus Linguistics


Book Description

This textbook introduces students to the ways in which techniques from corpus linguistics can be used to aid sociolinguistic research. Corpus linguistics shares with variationist sociolinguistics a quantitative approach to the study of variation or differences between populations. It may also complement qualitative traditions of enquiry such as interactional sociolinguistics.This text covers a range of different topics within sociolinguistics:*Analysing demographic variation*Comparing language use across different cultures*Examining language change over time*Studying transcripts of spoken interactions*Identifying attitudes or discourses.Written for undergraduate and postgraduate students of sociolinguistics, or corpus linguists who wish to use corpora to study social phenomena, this textbook examines how corpora can be drawn on to investigate synchronic variation, diachronic change and the construction of discourses. It refers to several classic corpus-based studies as well as the author's own research. Original analyses of a number of corpora including the British National Corpus, the Survey of English Dialects and the Brown family of corpora are complemented by a new corpus of written British English collected around 2006 for the purposes of writing the book.Techniques of analysis like concordancing, keywords and collocations are discussed, along with corpus annotation and statistical procedures such as chi-squared tests and clustering. Paul Baker takes a critical approach to using corpora in sociolinguistics, outlining the limitations of the approach as well as its advantages.