A Corpus-Driven Approach to Language Contact


Book Description

This book proposes a corpus-driven approach to language contact based on the study of endangered languages. Drawing on variationist and language contact frameworks, it presents an analysis of spoken corpora from Europe and Mexico using a combination of criteria. The aim of this approach is to establish patterns of multilingual speech prevailing in different communities and allow for crosslinguistic comparison.







In Search of Basic Units of Spoken Language


Book Description

What is the best way to analyze spontaneous spoken language? In their search for the basic units of spoken language the authors of this volume opt for a corpus-driven approach. They share a strong conviction that prosodic structure is essential for the study of spoken discourse and each bring their own theoretical and practical experience to the table. In the first part of the book they segment spoken material from a range of different languages (Russian, Hebrew, Central Pomo (an indigenous language from California), French, Japanese, Italian, and Brazilian Portuguese). In the second part of the book each author analyzes the same two spoken English samples, but looking at them from different perspectives, using different methods of analysis as reflected in their respective analyses in Part I. This approach allows for common tendencies of segmentation to emerge, both prosodic and segmental.




Developments in English


Book Description

The history of the English language is a vast and diverse area of research. In this volume, a team of leading historians of English come together to analyse 'real' language, drawing on corpus data to shed new light on long-established issues and debates in the field. Combining synchronic and diachronic analysis, the chapters address the major issues in corpus linguistics – methodological, theoretical and applied – and place special focus on the use of electronic resources in the research of English and the wider field of digital humanities. Topics covered include polemical articles on the optimal use of corpus linguistic methods, macro-level patterns of text and discourse organisation, and micro-features such as interjections and hesitators. Covering Englishes from the past and present, this book is designed specifically for graduate students and researchers working in fields of corpus linguistics, the history of the English language, and historical linguistics.




Pattern Grammar


Book Description

This book describes an approach to lexis and grammar based on the concept of phraseology and of language patterning arising from work on large corpora. The notion of 'pattern' as a systematic way of dealing with the interface between lexis and grammar was used in Collins Cobuild English Dictionary (1995) and in the two books in the Collins Cobuild Grammar Patterns series (1996; 1998). This volume describes the research that led to these publications, and explores the theoretical and practical implications of the research. The first chapter sets the work in the context of work on phraseology. The next two chapters give several examples of patterns and how they are identified. Chapters 4 and 5 discuss and exemplify the association of pattern and meaning. Chapters 6, 7 and 8 relate the concept of pattern to traditional approaches to grammar and to discourse. Chapter 9 summarizes the book and adds to the theoretical discussion, as well as indicating the applications of this approach to language teaching. The volume is intended to contribute to the current debate concerning how corpora challenge existing linguistic theories, and as such will be of interest to researchers in the fields of grammar, lexis, discourse and corpus linguistics. It is written in an accessible style, however, and will be equally suitable for students taking courses in those areas.




Progressives, Patterns, Pedagogy


Book Description

This book presents a large-scale corpus-driven study of progressives in 'real' English and 'school' English, combining an analysis of general linguistic interest with a pedagogically motivated one. A systematic comparative analysis of more than 10,000 progressive forms taken from the largest existing corpora of spoken British English and from a small corpus of EFL textbook texts highlights numerous differences between actual language use and textbook language concerning the distribution of progressives, their preferred contexts, favoured functions, and typical lexical-grammatical patterns. On the basis of these differences, a number of pedagogical implications are derived, the integration of which then leads to a first draft of an innovative concept of teaching progressives - a concept which responds to three key criteria in pedagogical description: typicality, authenticity, and communicative utility. The analysis also demonstrates that many existing accounts of the progressive are inappropriate in several respects and that not enough attention is being paid to lexical-grammatical relations.! Winner of the "Wissenschaftspreis Hannover 2006" for outstanding research monographs !




Corpus Linguistics at Work


Book Description

The book offers a combined discussion of the main theoretical, methodological and application issues related to corpus work. Thus, starting from the definition of what is a corpus and why reading a corpus calls for a different methodology from reading a text, the underlying assumptions behind corpus work are discussed. The two main approaches to corpus work are discussed as the “corpus-based” and the “corpus-driven” approach and the theoretical positions underlying them explored in detail. The book adopts and exemplifies the parameters of the corpus-driven approach and posits a new unit of linguistic description defined systematically in the light of corpus evidence. The applications where the corpus-driven approach is exemplified are language teaching and contrastive linguistics. Alternating between practical examples and theoretical evaluation, the reader is led step-by-step to a detailed understanding of the issues involved in corpus work and, at the same time, tempted to explore for himself some of the major applications where a corpus-driven methodology can reveal unprecedented insights into linguistic patterning.




The Routledge Handbook of Corpus Linguistics


Book Description

The Routledge Handbook of Corpus Linguistics provides a timely overview of a dynamic and rapidly growing area with a widely applied methodology. Through the electronic analysis of large bodies of text, corpus linguistics demonstrates and supports linguistic statements and assumptions. In recent years it has seen an ever-widening application in a variety of fields: computational linguistics, discourse analysis, forensic linguistics, pragmatics and translation studies. Bringing together experts in the key areas of development and change, the handbook is structured around six themes which take the reader through building and designing a corpus to using a corpus to study literature and translation. A comprehensive introduction covers the historical development of the field and its growing influence and application in other areas. Structured around five headings for ease of reference, each contribution includes further reading sections with three to five key texts highlighted and annotated to facilitate further exploration of the topics. The Routledge Handbook of Corpus Linguistics is the ideal resource for advanced undergraduates and postgraduates.




Applications of Pattern-driven Methods in Corpus Linguistics


Book Description

The use of corpora has conventionally been envisioned as being either corpus-based or corpus-driven. While the formal definition of the latter term has been widely accepted since it was established by Tognini-Bonelli (2001), it is often applied to studies that do not, in fact, fullfil the fundamental requirement of a theory-neutral starting point. This volume proposes the term pattern-driven as a more precise alternative. The chapters illustrate a variety of methods that fall under this broad methodology, such as the extraction of lexical bundles, POS-grams and semantic frames, and demonstrate how these approaches can uncover new understandings of both synchronic and diachronic linguistic phenomena.




Corpus Linguistics


Book Description

Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.