State of the Art in Computational Morphology


Book Description

From the point of view of computational linguistics, morphological resources are the basis for all higher-level applications. This is especially true for languages with a rich morphology, such as German or Finnish. A morphology component should thus be capable of analyzing single word forms as well as whole corpora. For many practical applications, not only morphological analysis, but also generation is required, i.e., the production of surfaces corresponding to speci?c categories. Apart from uses in computational linguistics, there are also numerous practical - plications that either require morphological analysis and generation or that can greatly bene?t from it, for example, in text processing, user interfaces, or information - trieval. These applications have speci?c requirements for morphological components, including requirements from software engineering, such as programming interfaces or robustness. In 1994, the First Morpholympics took place at the University of Erlangen- Nuremberg, a competition between several systems for the analysis and generation of German word forms. Eight systems participated in the First Morpholympics; the conference proceedings [1] thus give a very good overview of the state of the art in computational morphologyfor German as of 1994.




Computational Approaches to Morphology and Syntax


Book Description

The book will appeal to scholars and advanced students of morphology, syntax, computational linguistics and natural language processing (NLP). It provides a critical and practical guide to computational techniques for handling morphological and syntactic phenomena, showing how these techniques have been used and modified in practice. The authors discuss the nature and uses of syntactic parsers and examine the problems and opportunities of parsing algorithms for finite-state, context-free and various context-sensitive grammars. They relate approaches for describing syntax and morphology to formal mechanisms and algorithms, and present well-motivated approaches for augmenting grammars with weights or probabilities.




Computational Morphology


Book Description

Previous work on morphology has largely tended either to avoid precise computational details or to ignore linguistic generality. Computational Morphologyis the first book to present an integrated set of techniques for the rigorous description of morphological phenomena in English and similar languages. By taking account of all facets of morphological analysis, it provides a linguistically general and computationally practical dictionary system for use within an English parsing program. The authors covermorphographemics (variations in spelling as words are built from their component morphemes),morphotactics (the ways that different classes of morphemes can combine, and the types of words that result), andlexical redundancy (patterns of similarity and regularity among the lexical entries for words). They propose a precise rule-notation for each of these areas of linguistic description and present the algorithms for using these rules computationally to manipulate dictionary information. These mechanisms have been implemented in practical and publicly available software, which is described in detail, and appendixes contain a large number of computer-tested sets of rules and lexical entries for English. Graeme D. Ritchie is a Senior Lecturer in the Department of Artificial Intelligence at the University of Edinburgh, where Alan W. Black is currently a research student. Graham J. Russell is a Research Fellow at ISSCO (Institut Dalle Molle pour les etudes semantiques et cognitives) in Geneva, and Stephen G. Pulman is a Lecturer in the University of Cambridge Computer Laboratory and Director of SRI International's Cambridge Computer Science Research Centre.




Information and Communication Technology for Development for Africa


Book Description

This book constitutes the proceedings of the First International Conference on Information and Communication Technology for Development for Africa, ICT4DA 2017, held in Bahir Dar, Ethiopia, in September 2017. The 31 revised full papers presented were carefully reviewed and selected from 72 submissions. The papers address the impact of ICT in fostering economic development in Africa. In detail they cover the following topics: e-services, natural language processing, intelligent systems, mobile and wireless communication, privacy and security.




The Oxford Handbook of Computational Linguistics


Book Description

This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.




Finite-State Computational Morphology


Book Description

This handbook provides a comprehensive account of current research on the finite-state morphology of Georgian and enables the reader to enter quickly into Georgian morphosyntax and its computational processing. It combines linguistic analysis with application of finite-state technology to processing of the language. The book opens with the author’s synoptic overview of the main lines of research, covers the properties of the word and its components, then moves up to the description of Georgian morphosyntax and the morphological analyzer and generator of Georgian.The book comprises three chapters and accompanying appendices. The aim of the first chapter is to describe the morphosyntactic structure of Georgian, focusing on differences between Old and Modern Georgian. The second chapter focuses on the application of finite-state technology to the processing of Georgian and on the compilation of a tokenizer, a morphological analyzer and a generator for Georgian. The third chapter discusses the testing and evaluation of the analyzer’s output and the compilation of the Georgian Language Corpus (GLC), which is now accessible online and freely available to the research community.Since the development of the analyzer, the field of computational linguistics has advanced in several ways, but the majority of new approaches to language processing has not been tested on Georgian. So, the organization of the book makes it easier to handle new developments from both a theoretical and practical viewpoint.The book includes a detailed index and references as well as the full list of morphosyntactic tags. It will be of interest and practical use to a wide range of linguists and advanced students interested in Georgian morphosyntax generally as well as to researchers working in the field of computational linguistics and focusing on how languages with complicated morphosyntax can be handled through finite-state approaches.




Computational Nonlinear Morphology


Book Description

By the late 1970s phonologists, and later morphologists, had departed from a linear approach for describing morphophonological operations to a nonlinear one. Computational models, however, remain faithful to the linear model, making it very difficult, if not impossible, to implement the morphology of languages whose morphology is nonconcatanative. Computational Nonlinear Morphology aims at presenting a computational system that counters the development in linguistics. It provides a detailed computational analysis of the complex morphophonological phenomena found in Semitic languages based on linguistically motivated models.




Systems and Frameworks for Computational Morphology


Book Description

This book constitutes the refereed proceedings of the Second International Workshop on Systems and Frameworks for Computational Morphology, SFCM 2011, held in Zurich, Switzerland in August 2011. The eight revised full papers presented together with one invited paper were carefully reviewed and selected from 13 submissions. The papers address various topics in computational morphology and the relevance of morphology to computational linguistics more broadly.




Arabic Computational Morphology


Book Description

This is the first comprehensive overview of computational approaches to Arabic morphology. The subtitle aims to reflect that widely different computational approaches to the Arabic morphological system have been proposed. The book provides a showcase of the most advanced language technologies applied to one of the most vexing problems in linguistics. It covers knowledge-based and empirical-based approaches.




Language Engineering for Lesser-studied Languages


Book Description

"Technologies enabling computers to process specific languages facilitate economic and political progress of societies where these languages are spoken. Development of methods and systems for language processing is therefore a worthy goal for national governments as well as for business entities and scientific and educational institutions in every country in the world. As work on systems and resources for the 'lower-density' languages becomes more widespread, an important question is how to leverage the results and experience accumulated by the field of computational linguistics for the major languages in the development of resources and systems for lower-density languages. This issue has been at the core of the NATO Advanced Studies Institute on language technologies for middle- and low-density languages held in Georgia in October 2007. This publication is a collection - of publication-oriented versions - of the lectures presented there and is a useful source of knowledge about many core facets of modern computational-linguistic work. By the same token, it can serve as a reference source for people interested in learning about strategies that are best suited for developing computational-linguistic capabilities for lesser-studied languages - either 'from scratch' or using components developed for other languages. The book should also be quite useful in teaching practical system- and resource-building topics in computational linguistics."--Site Web de l'éditeur.