Text Variability Measures in Corpus Design for Setswana Lexicography


Book Description

This book is about the design of a Setswana corpus for lexicography. While various corpora have been compiled and a variety of corpora-based research has been attempted in African languages, no effort has been made towards corpus design. Additionally, although extensive analysis of the Setswana language has been done by missionaries, grammarians and linguists since the 1800s, none of this research is in corpus design. Most research has been largely on the grammatical study of the language. The recent corpora research in African languages in general has been on the use of corpora for the compilation of dictionaries and little of it is in corpus design. Pioneers of this kind of corpora research in African languages are Prinsloo and De Schryver (1999), De Schryver and Prisloo (2000 and 2001) and Gouws and Prisloo (2005). Because of a lack of research in corpora design particularly in African languages, this book attempts to fill that gap, especially for Setswana. It is hoped that the finding of this study will inspire similar designs in other languages comparable to Setswana. We explore corpus design by focusing on measuring a variety of text types for lexical richness at comparable token points. The study explores the question of whether a corpus compiled for lexicography must comprise a variety of texts drawn from different text types or whether the quality of retrieved information for lexicographic purposes from a corpus comprising diverse text varieties could be equally extracted from a corpus with a single text type. This study therefore determines whether linguistic variability is crucial in corpus design for lexicography.







Native and Non-Native Teacher Talk in the EFL Classroom


Book Description

Native and Non-Native Teacher Talk in the EFL Classroom explores and compares the linguistic features of native and non-native English teacher talk with the aid of corpus linguistics. Setting aside the wide range of audio and video materials available, the EFL teacher is in many instances the main model of English to which students are exposed in secondary-level education. The basis of this book is to work towards a framework for the language that teachers of English need to be proficient in, based on an empirical study of language used in the ELT classroom by both native and expert non-native users. Presenting a corpus-informed treatment of the precise linguistic features used by EFL teachers within the framework of their most common teaching functions, this book: • Relates directly to the teacher talk of secondary-level EFL teachers; • Combines quantitative and qualitative approaches to data analysis; • Looks into pedagogical implications for ELT and proposes a flexible language development model based on evidence from the teacher training classroom; • Provides a corpus-based repertoire of language for the classroom which is of relevance to native and non-native student-teachers and practising teachers. Highlighting the need for much greater awareness of the impact of language use in both learning and teaching, this book is a major resource for advanced students and researchers of TESOL, classroom discourse, corpus linguistics, ELT, English for professional purposes, and teaching placement preparation.




Corpus linguistics around the world


Book Description

This volume contains a selection of the papers delivered at the Corpus Linguistics 2003 conference, held at Lancaster University in April 2003. The papers selected address a wide range of world languages - Basque, Chinese, Danish, Dutch, English, French, German, Maltese, Russian, Spanish, and Slovene. Both synchronic and diachronic studies are included, as well as studies of learner language. In addition to mainstream linguistic analyses of phonetics, vocabulary, syntax, semantics, and rhetoric, application areas covered in the volume include financial forecasting, cross-cultural research, corpus processing, and language teaching.




Principles and Practice of South African Lexicography


Book Description

Principles and Practice of South African Lexicography is directed at experts in the field of practical and theoretical lexicography in South Africa, applying the general theory of lexicography to the South African lexicographic environment. The authors of this book are leaders in the field of South African lexicography and active participants in the international lexicographic arena, publishing regularly in national and international journals and giving papers at international conferences and workshops.




The international corpus of learner English


Book Description

Manual to accompany the CD-ROM or online resource containing Version 2 of the ICLE. The ICLE is the computerized databank of the Centre for English Corpus Linguistics at the Université catholique de Louvain. The Centre focuses on the development and use of learner corpora (electronic collections of authentic foreign language data). The ICLE is the result of over ten years of collaborative activity between a number of universities internationally. It contains over 3 million words of writing by learners of English from 21 different mother tongue backgrounds. The writing in the corpus has been contributed by advanced learners of English as a foreign language and is made up of 21 distinct sub-corpora, each containing one language variety (E2French, E2German, E2Swedish etc).




The Dynamics of Language


Book Description

Linguistics – the close study of language and languages – is an indispensable foundation for all forms of knowledge. The African continent is blessed with hundreds of languages which act as local repositories of culture and interaction. South Africa alone has eleven official languages, plus Sign Language, many heritage languages, and new languages of global movements and migration. Part of the linguist’s business is to document, record and affirm languages and diversity. Applied linguists use their training to understand and enhance the role of language in education and upliftment, and the opportunities and challenges of new technologies of communication. The International Congress of Linguists meets every five years to reflect the development of the field and 2018 is the first time that the congress is being held in Africa. This book is a collection of the plenary and focus papers presented at the conference and thus represents current thinking in the major branches of language study as represented by leading local and international scholars. The papers discuss the history of languages, their structure, acquisition, diversity and use. At the same time due regard is paid to the African continent in connection with its linguistic diversity, multilingualism and educational and societal concerns. The Congress is meant to affirm the value of the languages of Africa, of languages and Linguistics in general, as well as to inspire and equip younger scholars to undertake advanced research into language in its many facets.




Encyclopedia of Language and Education


Book Description

This volume covers basic fields of Sociolinguistics and the Sociology of Language; both macro- and micro-domains are presented in the fields of language teaching, minority languages, and problems of language acquisition as well as practical issues of curricula planning and textbook writing. This book addresses students and scholars in the social sciences as well as public officials in education, language teachers and textbook writers.




The Routledge Handbook of Language Revitalization


Book Description

The Routledge Handbook of Language Revitalization is the first comprehensive overview of the language revitalization movement, from the Arctic to the Amazon and across continents. Featuring 47 contributions from a global range of top scholars in the field, the handbook is divided into two parts, the first of which expands on language revitalization issues of theory and practice while the second covers regional perspectives in an effort to globalize and decolonize the field. The collection examines critical issues in language revitalization, including: language rights, language and well-being, and language policy; language in educational institutions and in the home; new methodologies and venues for language learning; and the roles of documentation, literacies, and the internet. The volume also contains chapters on the kinds of language that are less often researched such as the revitalization of music, of whistled languages and sign languages, and how languages change when they are being revitalized. The Routledge Handbook of Language Revitalization is the ideal resource for graduate students and researchers working in linguistic anthropology and language revitalization and endangerment.




Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig


Book Description

This bilingual book provides a detailed overview of the project to construct a National Corpus of Contemporary Welsh (CorCenCC), addressing the conceptual and methodological challenges faced when developing language corpora for minoritised languages. A conceptual framework is presented for the user-driven design that underpinned the CorCenCC project, along with a detailed blueprint that can function as a scaffold for other researchers embarking on projects of this nature. This book will be of value to those working in language teaching, learning and assessment, language policy and planning, translation, corpus linguistics and language technology, and to anyone with an interest in Welsh and other minoritised languages. Mae'r llyfr dwyieithog hwn yn rhoi trosolwg manwl o'r prosiect i greu Corpws Cenedlaethol Cymraeg Cyfoes (CorCenCC), ac yn mynd i'r afael â'r heriau cysyniadol a methodolegol a wynebir wrth ddatblygu corpora iaith ar gyfer ieithoedd lleiafrifoledig. Cyflwynir fframwaith cysyniadol ar gyfer y cynllun wedi'i yrru gan ddefnyddwyr sy'n greiddiol i brosiect CorCenCC, ynghyd â glasbrint manwl a all weithredu fel sgaffald i ymchwilwyr eraill sy'n dechrau ar brosiectau o'r fath. Bydd y llyfr hwn o werth i'r rhai sy'n gweithio ym meysydd addysgu, dysgu ac asesu ieithoedd, polisi iaith a chynllunio ieithyddol, cyfieithu, ieithyddiaeth gorpws a thechnoleg iaith, ac unrhyw un â diddordeb yn y Gymraeg ac ieithoedd lleiafrifoledig eraill.