Principle-Based Parsing


Book Description




Principle-Based Parsing


Book Description




Natural Language Parsing and Linguistic Theories


Book Description

presupposition fails, we now give a short introduction into Unification Grammar. Since all implementations discussed in this volume use PROLOG (with the exception of BlockjHaugeneder), we felt that it would also be useful to explain the difference between unification in PROLOG and in UG. After the introduction to UG we briefly summarize the main arguments for using linguistic theories in natural language processing. We conclude with a short summary of the contributions to this volume. UNIFICATION GRAMMAR 3 Feature Structures or Complex Categories. Unification Grammar was developed by Martin Kay (Kay 1979). Martin Kay wanted to give a precise defmition (and implementation) of the notion of 'feature'. Linguists use features at nearly all levels of linguistic description. In phonetics, for instance, the phoneme b is usually described with the features 'bilabial', 'voiced' and 'nasal'. In the case of b the first two features get the value +, the third (nasal) gets the value -. Feature value pairs in phonology are normally represented as a matrix. bilabial: + voiced: + I nasal: - [Feature matrix for b.] In syntax features are used, for example, to distinguish different noun classes. The Latin noun 'murus' would be characterized by the following feature-value pairs: gender: masculin, number: singular, case: nominative, pred: murus. Besides a matrix representation one frequently fmds a graph representation for feature value pairs. The edges of the graph are labelled by features. The leaves denote the value of a feature.




Parsing with Principles and Classes of Information


Book Description

Parsing with Principles and Classes of Information presents a parser based on current principle-based linguistic theories for English. It argues that differences in the kind of information being computed, whether lexical, structural or syntactic, play a crucial role in the mapping from grammatical theory to parsing algorithms. The direct encoding of homogeneous classes of information has computational and cognitive advantages, which are discussed in detail. Phrase structure is built by using a fast algorithm and compact reference tables. A quantified comparison of different compilation methods shows that lexical and structural information are most compactly represented by separate tables. This finding is reconciled to evidence on the resolution of lexical ambiguity, as an approach to the modularization of information. The same design is applied to the efficient computation of long- distance dependencies. Incremental parsing using bottom-up tabular algorithms is discussed in detail. Finally, locality restrictions are calculated by a parametric algorithm. Students of linguistics, parsing and psycholinguistics will find this book a useful resource on issues related to the implementation of current linguistic theories, using computational and cognitive plausible algorithms.




The Theory and Practice of Discourse Parsing and Summarization


Book Description

Most discourse researchers assume that full semantic understanding is necessary to derive the discourse structure of texts. This book documents an attempt to construct and use automatic and non-semantic computational structures for text summarization.




Parsing Techniques


Book Description

This second edition of Grune and Jacobs’ brilliant work presents new developments and discoveries that have been made in the field. Parsing, also referred to as syntax analysis, has been and continues to be an essential part of computer science and linguistics. Parsing techniques have grown considerably in importance, both in computer science, ie. advanced compilers often use general CF parsers, and computational linguistics where such parsers are the only option. They are used in a variety of software products including Web browsers, interpreters in computer devices, and data compression programs; and they are used extensively in linguistics.




The Mathematics of Syntactic Structure


Book Description

The architecture of the human language faculty has been one of the main foci of the linguistic research of the last half century. This branch of linguistics, broadly known as Generative Grammar, is concerned with the formulation of explanatory formal accounts of linguistic phenomena with the ulterior goal of gaining insight into the properties of the 'language organ'. The series comprises high quality monographs and collected volumes that address such issues. The topics in this series range from phonology to semantics, from syntax to information structure, from mathematical linguistics to studies of the lexicon.




Grammatical Competence and Parsing Performance


Book Description

How does a parser, a device that imposes an analysis on a string of symbols so that they can be interpreted, work? More specifically, how does the parser in the human cognitive mechanism operate? Using a wide range of empirical data concerning human natural language processing, Bradley Pritchett demonstrates that parsing performance depends on grammatical competence, not, as many have thought, on perception, computation, or semantics. Pritchett critiques the major performance-based parsing models to argue that the principles of grammar drive the parser; the parser, furthermore, is the apparatus that tries to enforce the conditions of the grammar at every point in the processing of a sentence. In comparing garden path phenomena, those instances when the parser fails on the first reading of a sentence and must reanalyze it, with occasions when the parser successfully functions the first time around, Pritchett makes a convincing case for a grammar-derived parsing theory.




Industrial Parsing of Software Manuals


Book Description

The task of language engineering is to develop the technology for building computer systems which can perform useful linguistic tasks such as machine assisted translation, text retrieval, message classification and document summarisation. Such systems often require the use of a parser which can extract specific types of grammatical data from pre-defined classes of input text. There are many parsers already available for use in language engineering systems. However, many different linguistic formalisms and parsing algorithms are employed. Grammatical coverage varies, as does the nature of the syntactic information extracted. Direct comparison between systems is difficult because each is likely to have been evaluated using different test criteria. In this volume, eight different parsers are applied to the same task, that of analysing a set of sentences derived from software instruction manuals. Each parser is presented in a separate chapter. Evaluation of performance is carried out using a standard set of criteria with the results being presented in a set of tables which have the same format for each system. Three additional chapters provide further analysis of the results as well as discussing possible approaches to the standardisation of parse tree data. Five parse trees are provided for each system in an appendix, allowing further direct comparison between systems by the reader. The book will be of interest to students, researchers and practitioners in the areas of computational linguistics, computer science, information retrieval, language engineering, linguistics and machine assisted translation.




Current Issues in Parsing Technology


Book Description