Nordisk Sprogteknologi


Book Description




Nordisk Sprogteknologi 2004


Book Description




Systems and Frameworks for Computational Morphology


Book Description

This book constitutes the refereed proceedings of the Third International Workshop on Systems and Frameworks for Computational Morphology, SFCM 2013, held in Berlin, in September 2013. The 7 full papers were carefully reviewed and selected from 15 submissions and are complemented with an invited talk. The papers discuss recent advances in the field of computational morphology.




Shall We Play the Festschrift Game?


Book Description

There are not many people who can be said to have influenced and impressed researchers in so many disparate areas and language-geographic fields as Lauri Carlson, as is evidenced in the present Festschrift. His insight and acute linguistic sensitivity and linguistic rationality have spawned findings and research work in many areas, from non-standard etymology to hardcore formal linguistics, not forgetting computational areas such as parsing, terminological databases, and, last but not least, machine translation. In addition to his renowned and widely acknowledged insights in tense and aspect and its relationship with nominal quantification, and his ground-breaking work in dialog using game-theoretic machinery, Lauri has in the last fifteen years as Professor of Language Theory and Translation Technology contributed immensely to areas such as translation, terminology and general applications of computational linguistics. The three editors of the present volume have successfully performed doctoral studies under Lauri’s supervision, and wish with this volume to pay tribute to his supervision and to his influence in matters associated with research and scientific, linguistic and philosophical inquiry, as well as to his humanity and friendship.




Exploring Newspaper Language


Book Description

This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.




Annotation, exploitation and evaluation of parallel corpora: TC3 I


Book Description

Exchange between the translation studies and the computational linguistics communities has traditionally not been very intense. Among other things, this is reflected by the different views on parallel corpora. While computational linguistics does not always strictly pay attention to the translation direction (e.g. when translation rules are extracted from (sub)corpora which actually only consist of translations), translation studies are amongst other things concerned with exactly comparing source and target texts (e.g. to draw conclusions on interference and standardization effects). However, there has recently been more exchange between the two fields – especially when it comes to the annotation of parallel corpora. This special issue brings together the different research perspectives. Its contributions show – from both perspectives – how the communities have come to interact in recent years.




Language Engineering for Lesser-studied Languages


Book Description

"Technologies enabling computers to process specific languages facilitate economic and political progress of societies where these languages are spoken. Development of methods and systems for language processing is therefore a worthy goal for national governments as well as for business entities and scientific and educational institutions in every country in the world. As work on systems and resources for the 'lower-density' languages becomes more widespread, an important question is how to leverage the results and experience accumulated by the field of computational linguistics for the major languages in the development of resources and systems for lower-density languages. This issue has been at the core of the NATO Advanced Studies Institute on language technologies for middle- and low-density languages held in Georgia in October 2007. This publication is a collection - of publication-oriented versions - of the lectures presented there and is a useful source of knowledge about many core facets of modern computational-linguistic work. By the same token, it can serve as a reference source for people interested in learning about strategies that are best suited for developing computational-linguistic capabilities for lesser-studied languages - either 'from scratch' or using components developed for other languages. The book should also be quite useful in teaching practical system- and resource-building topics in computational linguistics."--Site Web de l'éditeur.




Language Technology for Cultural Heritage


Book Description

The digital age has had a profound effect on our cultural heritage and the academic research that studies it. Staggering amounts of objects, many of them of a textual nature, are being digitised to make them more readily accessible to both experts and laypersons. Besides a vast potential for more effective and efficient preservation, management, and presentation, digitisation offers opportunities to work with cultural heritage data in ways that were never feasible or even imagined. To explore and exploit these possibilities, an interdisciplinary approach is needed, bringing together experts from cultural heritage, the social sciences and humanities on the one hand, and information technology on the other. Due to a prevalence of textual data in these domains, language technology has a crucial role to play in this endeavour. Language technology can break through the "Google barrier" by offering the potential to analyse texts at advanced levels, extracting information and knowledge at the level of the humanities or social sciences researcher, who wants to know about the who, what, where, and when, but also the how and the why. At the same time cultural heritage data poses considerable challenges for existing language technology: technology aimed at "generic" language has to face such disparate problems as historical language variation, OCR digitisation errors, and near-extinct academic expertise. This book is primarily intended for researchers in information technology and language processing who would like to receive a state-of-the-art overview of the whole breadth of the new and vibrant field of language technology for cultural heritage and its associated academic research in the humanities and social sciences. Researchers working in the target domains of cultural heritage, the social sciences and humanities will also find this book useful, as it provides an overview of how language technology can help them with their information needs. The book covers applications ranging from pre-processing and data cleaning, to the adaptation and compilation of linguistic resources, to personalisation, narrative analysis, visualisation and retrieval.




Advances in Natural Language Processing


Book Description

This book constitutes the refereed proceedings of the 5th International Conference on Natural Language Processing, FinTAL 2006, held in Turku, Finland in August 2006. The book presents 72 revised full papers together with 1 invited talk and the extended abstracts of 2 invited keynote addresses. The papers address all current issues in computational linguistics and monolingual and multilingual intelligent language processing - theory, methods and applications.