Spoken Corpora and Linguistic Studies


Book Description

The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media




Spoken Corpora in Applied Linguistics


Book Description

This volume explores the opportunities that spoken corpora offer and the challenges of research with such corpora. The use and applications of spoken corpora are discussed from the perspective of both language analysis and language pedagogy. Twelve chapters written by corpus linguists analyse an extensive number of spoken corpora based on the oral production of speakers as varied as language learners, users of English as Lingua Franca, native speakers, or speakers of English in academic contexts. This book also highlights the growing emphasis on the use of corpus-based research by examining the implications of corpus findings in educational settings.




Spoken Language Corpus and Linguistic Informatics


Book Description

Printbegrænsninger: Der kan printes 10 sider ad gangen og max. 40 sider pr. session




Discourse Patterns in Spoken and Written Corpora


Book Description

This book brings together a number of empirical studies that use corpora to study discourse patterns in speech and writing. It explores new trends in the area of text and discourse characterized by the alliance between text linguistics and areas such as corpus linguistics, genre analysis, literary stylistics and cross-linguistic studies. The contributions to the volume show how established corpora can be used to ask a number of new questions about the interface between speech and writing, the relation between grammar and discourse, academic discourse, cohesive markers, stylistic devices such as metaphor, deixis and non-verbal communication. The corpora used for text-analysis can also be tailor-made for the study of particular genres such as journal article abstracts, lectures, e-mailing list messages, headlines and titles. A recent development is to bring in contrastive data from bilingual corpora to show what is language-specific in the organization of the text.




Exploring Spoken English Learner Language Using Corpora


Book Description

This book presents a corpus-based study of spoken learner language produced by university-level ESL students in the classroom. Using contemporary theories as a guide and employing cutting-edge corpus analysis tools and methods, the authors analyse a variety of learner speech to offer many new insights into the nature and characteristics of the spoken language of college ESL learners. Focusing on types of speech that are rarely examined, this original work makes a significant contribution to the study and understanding of ESL spoken language at university level. It will appeal to students and scholars of applied linguistics, corpus linguistics, second language acquisition and discourse analysis.




Spoken Corpus Linguistics


Book Description

In this book, Adolphs and Carter explore key approaches to work in spoken corpus linguistics. The book discusses some of the pioneering challenges faced in designing, building and utilising insights from the analysis of spoken corpora, arguing that, even though writing is heavily privileged in corpus research, the spoken language can reveal patterns of language use that are both different and distinctive and that this has important implications for the way in which language is described, for the study of human communication and for the field of applied linguistics as a whole. Spoken Corpus Linguistics is divided into two main parts. The first part sets the scene by discussing traditional and new approaches to monomodal spoken corpus analysis, with a focus on discourse organisation and conversational interaction and with particular attention to forms of language such as discourse markers and multi-word units, areas of language not conventionally described but which are argued to be of importance to spoken language description and to spoken language learning and teaching research within the field of applied linguistics. The second part of the book moves into the multimodal domain and focuses on alignments between language and gesture in a spoken corpus, with particular reference to gestural movements of the head and the hand and to the different ways in which prosody might be used to enhance communication. A brief final chapter discusses new developments in the area of spoken corpus research, including the relationship between language and context, emerging research methods as well as discussing possible shifts in scope and emphasis in spoken corpus research in the future.




Corpus-based Perspectives in Linguistics


Book Description

UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics – Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.




C-ORAL-ROM


Book Description

The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.




Developing Linguistic Corpora


Book Description

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.




In Search of Basic Units of Spoken Language


Book Description

What is the best way to analyze spontaneous spoken language? In their search for the basic units of spoken language the authors of this volume opt for a corpus-driven approach. They share a strong conviction that prosodic structure is essential for the study of spoken discourse and each bring their own theoretical and practical experience to the table. In the first part of the book they segment spoken material from a range of different languages (Russian, Hebrew, Central Pomo (an indigenous language from California), French, Japanese, Italian, and Brazilian Portuguese). In the second part of the book each author analyzes the same two spoken English samples, but looking at them from different perspectives, using different methods of analysis as reflected in their respective analyses in Part I. This approach allows for common tendencies of segmentation to emerge, both prosodic and segmental.