Corpus Linguistics and Linguistic Theory


Book Description

From being the occupation of a marginal (and frequently marginalised) group of researchers, the linguistic analysis of machine-readable language corpora has moved to the mainstream of research on the English language. In this process an impressive body of results has accumulated which, over and above the intrinsic descriptive interest it holds for students of the English language, forces a major and systematic re-thinking of foundational issues in linguistic theory. Corpus linguistics and linguistic theory was accordingly chosen as the motto for the twentieth annual gathering of ICAME, the International Computer Archive of Modern/ Medieval English, which was hosted by the University of Freiburg (Germany) in 1999. The present volume, which presents selected papers from this conference, thus builds on previous successful work in the computer-aided description of English and at the same time represents an attempt at stock-taking and methodological reflection in a linguistic subdiscipline that has clearly come of age.Contributions cover all levels of linguistic description - from phonology/ prosody, through grammar and semantics to discourse-analytical issues such as genre or gender-specific linguistic usage. They are united by a desire to further the dialogue between the corpus-linguistic community and researchers working in other traditions. Thereby, the atmosphere ranges from undisguised skepticism (as expressed by Noam Chomsky in an interview which is part of the opening contribution by Bas Aarts) to empirically substantiated optimism (as, for example, in Bernadette Vine's significantly titled contribution Getting things done).




Corpus Linguistics and Linguistic Theory


Book Description

From being the occupation of a marginal (and frequently marginalised) group of researchers, the linguistic analysis of machine-readable language corpora has moved to the mainstream of research on the English language. In this process an impressive body of results has accumulated which, over and above the intrinsic descriptive interest it holds for students of the English language, forces a major and systematic re-thinking of foundational issues in linguistic theory. Corpus linguistics and linguistic theory was accordingly chosen as the motto for the twentieth annual gathering of ICAME, the International Computer Archive of Modern/ Medieval English, which was hosted by the University of Freiburg (Germany) in 1999. The present volume, which presents selected papers from this conference, thus builds on previous successful work in the computer-aided description of English and at the same time represents an attempt at stock-taking and methodological reflection in a linguistic subdiscipline that has clearly come of age. Contributions cover all levels of linguistic description - from phonology/ prosody, through grammar and semantics to discourse-analytical issues such as genre or gender-specific linguistic usage. They are united by a desire to further the dialogue between the corpus-linguistic community and researchers working in other traditions. Thereby, the atmosphere ranges from undisguised skepticism (as expressed by Noam Chomsky in an interview which is part of the opening contribution by Bas Aarts) to empirically substantiated optimism (as, for example, in Bernadette Vine's significantly titled contribution Getting things done).




Corpus Linguistics


Book Description

Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.







English Corpus Linguistics


Book Description

English Corpus Linguistics is a step-by-step guide to creating and analyzing linguistic corpora. It begins with a discussion of the role that corpus linguistics plays in linguistic theory, demonstrating that corpora have proven to be very useful resources for linguists who believe that their theories and descriptions of English should be based on real rather than contrived data. Charles F. Meyer goes on to describe how to plan the creation of a corpus, how to collect and computerize data for inclusion in a corpus, how to annotate the data that are collected, and how to conduct a corpus analysis of a completed corpus. The book concludes with an overview of the challenges that corpus linguists face to make both the creation and analysis of corpora much easier undertakings than they currently are. Clearly organized and accessibly written, this book will appeal to students of linguistics and English language.







Web As Corpus


Book Description

Is the internet a suitable linguistic corpus? How can we use it in corpus techniques? What are the special properties that we need to be aware of? This book answers those questions. The Web is an exponentially increasing source of language and corpus linguistics data. From gigantic static information resources to user-generated Web 2.0 content, the breadth and depth of information available is breathtaking – and bewildering. This book explores the theory and practice of the “web as corpus”. It looks at the most common tools and methods used and features a plethora of examples based on the author's own teaching experience. This book also bridges the gap between studies in computational linguistics, which emphasize technical aspects, and studies in corpus linguistics, which focus on the implications for language theory and use.




From Ælfric to the New York Times


Book Description

The twenty papers of this volume - published to honour Gunnel Tottie - are of interest to everyone concerned with the study of the English language. The collection is a convincing argument for an approach to language studies based on the analysis of computerized corpora. Though this is not an introduction to the field but a series of highly specialized studies, readers get a good overview of the work being done at present in English computer corpus studies. English corpus linguistics, though basically concerned with the study of varieties of English, goes far beyond the simple ordering and counting of large numbers of examples but is deeply concerned with linguistic theory - based on real language data. The volume includes sections on corpora of written and spoken present-day English, historical corpora, contrastive corpora, and on the application of corpus studies to teaching purposes.




Linguistic Evidence


Book Description

The renaissance of corpus linguistics and promising developments in experimental linguistic techniques in recent years have led to a remarkable revival of interest in issues of the empirical base of linguistic theory in general, and the status of different kinds of linguistic evidence in particular. Consensus is growing (a) that even so-called primary data (from introspection as well as authentic language production) are inherently complex performance data only indirectly reflecting the subject of linguistic theory, (b) that for an appropriate foundation of linguistic theories evidence from different sources such as introspective data, corpus data, data from (psycho-)linguistic experiments, historical and diachronic data, typological data, neurolinguistic data and language learning data are not only welcome but also often necessary. It is in particular by contrasting evidence from different sources with respect to particular research questions that we may gain a deeper understanding of the status and quality of the individual types of linguistic evidence on the one hand, and of their mutual relationship and respective weight on the other. The present volume is a collection of (selected) papers presented at the conference on 'Linguistic Evidence' in Tübingen 2004, which was explicitly devoted to the above issues. All of them address these issues in relation to specific linguistic research problems, thereby helping to establish a better understanding of the nature of linguistic evidence in particularly insightful ways.




Corpus Linguistics


Book Description

An investigation into the way people use language in speech and writing, this volume introduces the corpus-based approach, which is based on analysis of large databases of real language examples stored on computer.