Automatic Syntactic Analysis Based on Selectional Preferences


Book Description

This book describes effective methods for automatically analyzing a sentence, based on the syntactic and semantic characteristics of the elements that form it. To tackle ambiguities, the authors use selectional preferences (SP), which measure how well two words fit together semantically in a sentence. Today, many disciplines require automatic text analysis based on the syntactic and semantic characteristics of language and as such several techniques for parsing sentences have been proposed. Which is better? In this book the authors begin with simple heuristics before moving on to more complex methods that identify nouns and verbs and then aggregate modifiers, and lastly discuss methods that can handle complex subordinate and relative clauses. During this process, several ambiguities arise. SP are commonly determined on the basis of the association between a pair of words. However, in many cases, SP depend on more words. For example, something (such as grass) may be edible, depending on who is eating it (a cow?). Moreover, things such as popcorn are usually eaten at the movies, and not in a restaurant. The authors deal with these phenomena from different points of view.




Linguistic Preferences


Book Description

Preferences form a central concept of human categorization. They play an important role in disciplines ranging from psychology to economics and philosophy, from evolutionary biology to artificial intelligence, and, notably for this volume, in linguistics. This volume provides both theoretical and empirical contributions from linguistics to this interdisciplinary field of research.




Evaluation of Text Summaries Based on Linear Optimization of Content Metrics


Book Description

This book provides a comprehensive discussion and new insights about linear optimization of content metrics to improve the automatic Evaluation of Text Summaries (ETS). The reader is first introduced to the background and fundamentals of the ETS. Afterward, state-of-the-art evaluation methods that require or do not require human references are described. Based on how linear optimization has improved other natural language processing tasks, we developed a new methodology based on genetic algorithms that optimize content metrics linearly. Under this optimization, we propose SECO-SEVA as an automatic evaluation metric available for research purposes. Finally, the text finishes with a consideration of directions in which automatic evaluation could be improved in the future. The information provided in this book is self-contained. Therefore, the reader does not require an exhaustive background in this area. Moreover, we consider this book the first one that deals with the ETS in depth.




Systemic Functional Linguistics


Book Description

This user-friendly student guide is the essential resource for all those engaged in studying systemic functional linguistics (SFL). Assuming no prior knowledge, this guide is divided into nine chapters which can be read independently of one another and used for purposes of reference. The reading section maps out and mediates the key SFL literature. The application guides show how SFL has been and can be applied to various domains, from translation to healthcare communication. The term guides demystify the core terminology and the vocabulary guides aid readers in dealing with the most commonly used terms in text analysis. Systemic Functional Linguistics is an invaluable guidebook for all those studying functional grammar and SFL within linguistics, applied linguistics and related courses.




Computer Science, Technology And Application - Proceedings Of The 2016 International Conference (Csta 2016)


Book Description

The 2016 International Conference on Computer Science, Technology and Application (CSTA2016) were held in Changsha, China on March 18-20, 2016. The main objective of the joint conference is to provide a platform for researchers, academics and industrial professionals to present their research findings in the fields of computer science and technology.The CSTA2016 received more than 150 submissions, but only 67 articles were selected to be included in this proceedings, which are organized into 6 chapters; covering Image and Signal Processing, Computer Network, Algorithm and Simulation, Data Mining and Cloud Computing, Computer Systems and Application, Mathematics and Management.




Cluster Analysis for Corpus Linguistics


Book Description

The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.




Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications


Book Description

The 14th Iberoamerican Congress on Pattern Recognition (CIARP 2009, C- gresoIberoAmericanodeReconocimientodePatrones)formedthelatestofanow longseriesofsuccessfulmeetingsarrangedbytherapidlygrowingIberoamerican pattern recognition community. The conference was held in Guadalajara, Jalisco, Mexico and organized by the Mexican Association for Computer Vision, Neural Computing and Robotics (MACVNR). It was sponsodred by MACVNR and ?ve other Iberoamerican PR societies. CIARP 2009 was like the previous conferences in the series supported by the International Association for Pattern Recognition (IAPR). CIARP 2009 attracted participants from all over the world presenting sta- of-the-artresearchon mathematical methods and computing techniques for p- tern recognition, computer vision, image and signal analysis, robot vision, and speech recognition, as well as on a wide range of their applications. This time the conference attracted participants from 23 countries,9 in Ibe- america, and 14 from other parts of the world. The total number of submitted papers was 187, and after a serious review process 108 papers were accepted, all of them with a scienti?c quality above overall mean rating. Sixty-four were selected as oral presentations and 44 as posters. Since 2008 the conference is almost single track, and therefore there was no real grading in quality between oral and poster papers. As an acknowledgment that CIARP has established itself as a high-quality conference, its proceedings appear in the Lecture Notes in Computer Science series. Moreover, its visibility is further enhanced by a selection of a set of papers that will be published in a special issue of the journal Pattern Recognition Letters.




Corpus Linguistics. Volume 2


Book Description

In vielen Bereichen der Linguistik werden Textkorpora, Sprachkorpora oder multimodale Korpora heute als empirische Basis verwendet. Aufbauend auf Methoden des 19. Jahrhunderts haben sich dabei mit dem Aufkommen von elektronischen Korpora seit den 1940ern neue Standards für linguistische Annotation und Vorverarbeitung sowie für qualitative und quantitative Untersuchungen entwickelt. Das Handbuch bietet einen umfassenden Überblick über Geschichte, Methoden und Anwendungen der Korpuslinguistik. Die einzelnen Überblicks- und Spezialartikel sind von Experten und Expertinnen der jeweiligen Gebiete geschrieben. Dabei wird auf klare und umfassende Darstellung, eine gute Vernetzung zwischen den Artikel und weiterführende Hinweise Wert gelegt.




Comprehension Processes in Reading


Book Description

Comprehension Processes in Reading addresses the interrelationship among several areas relevant to understanding how people comprehend text. The contributors focus on the on-line processes associated with text understanding rather than simply with the product of that comprehension -- what people remember from reading. Presenting the latest theories and research findings from a distinguished group of contributors, Comprehension Processes in Reading is divided into four major sections. Each section, concluding with a commentary chapter, discusses a different aspect of reader understanding or dysfunction such as individual word comprehension, sentence parsing, text comprehension, and comprehension failures and dyslexia .




Text, Speech and Dialogue


Book Description

This book constitutes the refereed proceedings of the 6th International Conference on Text, Speech and Dialogue, TSD 2003, held in Ceské Budejovice, Czech Republic in September 2003.The 60 revised full papers presented together with 2 invited contributions were carefully reviewed and selected from 121 submissions. The papers present a wealth of state-of-the-art research and development results in the field of natural language processing with an emphasis on text, speech, and spoken language ranging from theoretical and methodological issues to applications in various fields, such as web information retrieval, the semantic web, algorithmic learning, and dialogue systems.