French through Corpora


Book Description

This book offers a series of studies that come together in their concern for furthering an understanding of the French language, of its uses, its forms, its variation, and its acquisition. Other than contributing to a general understanding of French, this book also addresses the use of corpora for the study of language and the links between tools, methods, analyses and applications. What data are used, and how? What are the underlying theoretical and/or methodological considerations? How have these changed our way of formulating linguistic descriptions? What are the implications for descriptive accounts of French today? What are the applications of corpus studies? These questions (and many more) are addressed here in a series of scholarly contributions grouped into four broad areas: diachrony, syntax, sociolinguistics, and the learning and teaching of French. The book provides an up-to-date and challenging account of French for researchers in French linguistics. It will also be suitable for use on post-graduate and some undergraduate courses, providing useful information not only for students and teachers of French, but also for language and linguistics students in general. Many of the issues addressed are not specific to the French language and can be accounted for within a more general area of reflection within linguistics on changing relations between data, theory and methods.




French Through Corpora


Book Description

This book offers a series of studies that come together in their concern for furthering an understanding of the French language, of its uses, its forms, its variation, and its acquisition. Other than contributing to a general understanding of French, this book also addresses the use of corpora for the study of language and the links between tools, methods, analyses and applications. What data are used, and how? What are the underlying theoretical and/or methodological considerations? How have these changed our way of formulating linguistic descriptions? What are the implications for descriptive accounts of French today? What are the applications of corpus studies? These questions (and many more) are addressed here in a series of scholarly contributions grouped into four broad areas: diachrony, syntax, sociolinguistics, and the learning and teaching of French. The book provides an up-to-date and challenging account of French for researchers in French linguistics. It will also be suitable for use on post-graduate and some undergraduate courses, providing useful information not only for students and teachers of French, but also for language and linguistics students in general. Many of the issues addressed are not specific to the French language and can be accounted for within a more general area of reflection within linguistics on changing relations between data, theory and methods.




Corpus-based Perspectives in Linguistics


Book Description

UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics – Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.




Parallel Corpora for Contrastive and Translation Studies


Book Description

This volume assesses the state of the art of parallel corpus research as a whole, reporting on advances in both recent developments of parallel corpora – with some particular references to comparable corpora as well– and in ways of exploiting them for a variety of purposes. The first part of the book is devoted to new roles that parallel corpora can and should assume in translation studies and in contrastive linguistics, to the usefulness and usability of parallel corpora, and to advances in parallel corpus alignment, annotation and retrieval. There follows an up-to-date presentation of a number of parallel corpus projects currently being carried out in Europe, some of them multimodal, with certain chapters illustrating case studies developed on the basis of the corpora at hand. In most of these chapters, attention is paid to specific technical issues of corpus building. The third part of the book reflects on specific applications and on the creation of bilingual resources from parallel corpora. This volume will be welcomed by scholars, postgraduate and PhD students in the fields of contrastive linguistics, translation studies, lexicography, language teaching and learning, machine translation, and natural language processing.




Using Corpora to Explore Linguistic Variation


Book Description

Using Corpora to Explore Linguistic Variation illustrates the ways in which linguistic variation can be explored through corpus-based investigation. Two major kinds of research questions are considered: variation in the use of a particular linguistic feature, and variation across dialects or registers. Part 1: “Exploring variation in the use of linguistic features” focuses on the study of specific words, expressions, or grammatical constructions, to study variation in the use of a particular linguistic feature. Part 2: “Exploring dialect and register variation” describes salient characteristics of dialects or registers and the patterns of variation across varieties. Part 3: “Exploring Historical Variation” applies these same two major perspectives to historical variation. One recurring theme is the extent to which linguistic variation depends on register differences, reflecting the importance of register as a key methodological and thematic concern in current corpus linguistic research.




French Applied Linguistics


Book Description

Focuses on French applied linguistics




C-ORAL-ROM


Book Description

The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.




Spoken Corpora and Linguistic Studies


Book Description

The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media




The Routledge Handbook of Second Language Acquisition and Corpora


Book Description

The Routledge Handbook of Second Language Acquisition and Corpora is a state-of-the-art collection of cutting-edge scholarship at the intersection of second language acquisition and learner corpus research. It draws on data-driven, statistical analysis to outline the background, methods, and outcomes of language learning, with a range of global experts providing detailed guidelines and findings. The volume is organized into five sections: Methodological and theoretical contributions to the study of learner language using corpora – setting the scene Key aspects in corpus design, annotation, and analysis for SLA Corpora in SLA theory and practice SLA constructs and corpora Future directions This is a ground-breaking collection of essays offering incisive and essential reading for anyone with an interest in second language acquisition, learner corpus research, and applied linguistics.




Seeing through Multilingual Corpora


Book Description

Through electronic corpora we can observe patterns which we were unaware of before or only vaguely glimpsed. The availability of multilingual corpora has led to a renewal of contrastive studies. We gain new insight into similarities and differences between languages, at the same time as the characteristics of each language are brought into relief. The present book focuses on the work in building and using the English-Norwegian Parallel Corpus and the Oslo Multilingual Corpus. Case studies are reported on lexis, grammar, and discourse. A concluding chapter sums up problems and prospects of corpus-based contrastive studies, including applications in lexicography, translator training, and foreign-language teaching. Though the main focus is on English and Norwegian, the approach should be of interest more generally for corpus-based contrastive research and for language studies in general. Seeing through corpora we can see through language.