Fundamental Principles of Corpus Linguistics


Book Description

How might evidence of language use – writing and speech – be used as a way of studying language? Corpus linguistics is the study of linguistic data from a particular language or set of languages. It is a fast-moving approach to studying language, and there is still a degree of divergence in how research questions are approached using corpus data. This book uses a framework, based on the work of Karl Popper, to explore a number of fundamental issues in corpus linguistics. It critically evaluates how these issues are tackled, and proposes a set of best practices for future research. It spells out why using corpus data is valuable, what we can learn from using it, and how we may most effectively progress our understanding of language by using such data. It is essential reading for researchers and students of language in general, and of applied linguistics and English language in particular.




Perspectives on Corpus Linguistics


Book Description

Perspectives on Corpus Linguistics is a collection of interviews with fourteen well-known researchers in the field of linguistics. Each interview consists of a set of ten questions: the first seven are common to all contributors while the last three are connected to the research experience of each guest. In the general questions, the invited scholars explore (sometimes controversial) topics such as the concept of representativeness, the role of intuition and the status of Corpus Linguistics. In the specific questions, they provide a thorough discussion of materials and methods in corpus research as well as theoretical and applied perspectives on the use of corpora in language studies. Whether experts or novices, the volume should be of interest to all those who want to learn about corpus linguistics and carry out research in this fascinating and growing area.




A Practical Handbook of Corpus Linguistics


Book Description

This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.




Statistics in Corpus Linguistics


Book Description

A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.




Corpus linguistics


Book Description

Corpora are used widely in linguistics, but not always wisely. This book attempts to frame corpus linguistics systematically as a variant of the observational method. The first part introduces the reader to the general methodological discussions surrounding corpus data as well as the practice of doing corpus linguistics, including issues such as the scientific research cycle, research design, extraction of corpus data and statistical evaluation. The second part consists of a number of case studies from the main areas of corpus linguistics (lexical associations, morphology, grammar, text and metaphor), surveying the range of issues studied in corpus linguistics while at the same time showing how they fit into the methodology outlined in the first part.




Corpus Linguistics: An Introduction


Book Description

Corpus Linguistics: An Introduction will appeal to a wide spectrum of scholars, researchers, and particularly to students of linguistics. It offers guidelines for the creation and usage of corpora in the form of empirical language databases with direct functional and theoretical interpretation of a natural language. Drawn from original research and written in an accessible language and style, this book will create avenues for further advancements in mainstream and applied linguistics and language technology.




Developing Linguistic Corpora


Book Description

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.




Corpus Linguistics


Book Description

Corpus Linguistics has quickly established itself as the leading undergraduate course book in the subject. This second edition takes full account of the latest developments in the rapidly changing field, making this the most up-to-date and comprehensive textbook available. It gives a step-by-step introduction to what a corpus is, how corpora are constructed, and what can be done with them. Each chapter ends with a section of study questions that contain practical corpus-based exercises.* Designed for student use, with all technical terms explained in the text and referenced further in a Glossary* Examples are taken from existing corpora; detailed case study chapter included* Contains end-of-chapter summaries, study questions and suggestions for further reading* Updated reviews of new studies, areas that have recently come to prominence and new directions in corpus encoding and annotation standards* Detailed coverage of multilingual corpus construction and use* An in-depth historical review of computer-based corpora from the 1940s to the present day* Helpful appendices include answers to the study questions, up-to-date information on where corpora can be found, and the latest software for corpus research."e;[An] important addition to the fast growing literature in corpus linguistics... should be read by anyone interested in utilization of large-scale corpora in linguistic research."e; Studies in the Linguistic Sciences, on the first edition




The Cambridge Handbook of Learner Corpus Research


Book Description

The origins of learner corpus research go back to the late 1980s when large electronic collections of written or spoken data started to be collected from foreign/second language learners, with a view to advancing our understanding of the mechanisms of second language acquisition and developing tailor-made pedagogical tools. Engaging with the interdisciplinary nature of this fast-growing field, The Cambridge Handbook of Learner Corpus Research explores the diverse and extensive applications of learner corpora, with 27 chapters written by internationally renowned experts. This comprehensive work is a vital resource for students, teachers and researchers, offering fresh perspectives and a unique overview of the field. With representative studies in each chapter which provide an essential guide on how to conduct learner corpus research in a wide range of areas, this work is a cutting-edge account of learner corpus collection, annotation, methodology, theory, analysis and applications.




History, Features, and Typology of Language Corpora


Book Description

This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora. This book has several important features: it discusses many technical issues of the field in a lucid manner; contains extensive new diagrams and charts for easy comprehension; and presents discussions in simplified English to cater to the needs of non-native English readers. This is an important resource authored by academics who have many years of experience teaching and researching corpus linguistics. Its focus on Indian languages and on English corpora makes it applicable to students of graduate and postgraduate courses in applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.