Named Entities for Computational Linguistics


Book Description

One of the challenges brought on by the digital revolution of the recent decades is the mechanism by which information carried by texts can be extracted in order to access its contents. The processing of named entities remains a very active area of research, which plays a central role in natural language processing technologies and their applications. Named entity recognition, a tool used in information extraction tasks, focuses on recognizing small pieces of information in order to extract information on a larger scale. The authors use written text and examples in French and English to present the necessary elements for the readers to familiarize themselves with the main concepts related to named entities and to discover the problems associated with them, as well as the methods available in practice for solving these issues.




Named Entities


Book Description

Named Entities provides critical information for many NLP applications. Named Entity recognition and classification (NERC) in text is recognized as one of the important sub-tasks of Information Extraction (IE). The seven papers in this volume cover various interesting and informative aspects of NERC research. Nadeau & Sekine provide an extensive survey of past NERC technologies, which should be a very useful resource for new researchers in this field. Smith & Osborne describe a machine learning model which tries to solve the over-fitting problem. Mazur & Dale tackle a common problem of NE and conjunction; as conjunctions are often a part of NEs or appear close to NEs, this is an important practical problem. A further three papers describe analyses and implementations of NERC for different languages: Spanish (Galicia-Haro & Gelbukh), Bengali (Ekbal, Naskar & Bandyopadhyay), and Serbian (Vitas, Krstev & Maurel). Finally, Steinberger & Pouliquen report on a real WEB application where multilingual NERC technology is used to identify occurrences of people, locations and organizations in newspapers in different languages. The contributions to this volume were previously published in Lingvisticae Investigationes 30:1 (2007).




Named Entities for Computational Linguistics


Book Description

One of the challenges brought on by the digital revolution of the recent decades is the mechanism by which information carried by texts can be extracted in order to access its contents. The processing of named entities remains a very active area of research, which plays a central role in natural language processing technologies and their applications. Named entity recognition, a tool used in information extraction tasks, focuses on recognizing small pieces of information in order to extract information on a larger scale. The authors use written text and examples in French and English to present the necessary elements for the readers to familiarize themselves with the main concepts related to named entities and to discover the problems associated with them, as well as the methods available in practice for solving these issues.




Clinical Chinese Named Entity Recognition in Natural Language Processing


Book Description

This book introduces how to enhance the context capture ability of the model, improve the position information perception ability of the pretrained models, and identify and denoise the unlabeled entities. The Chinese medical named entity recognition is an important branch of the intelligent medicine, which is beneficial to mine the information hidden in medical texts and provide the medical entity information for clinical medical decision-making and medical classification. Researchers, engineers and post-graduate students in the fields of medicine management and software engineering.




Time Expression and Named Entity Recognition


Book Description

This book presents a synthetic analysis about the characteristics of time expressions and named entities, and some proposed methods for leveraging these characteristics to recognize time expressions and named entities from unstructured text. For modeling these two kinds of entities, the authors propose a rule-based method that introduces an abstracted layer between the specific words and the rules, and two learning-based methods that define a new type of tagging scheme based on the constituents of the entities, different from conventional position-based tagging schemes that cause the problem of inconsistent tag assignment. The authors also find that the length-frequency of entities follows a family of power-law distributions. This finding opens a door, complementary to the rank-frequency of words, to understand our communicative system in terms of language use.




Named Entity Recognition


Book Description

What Is Named Entity Recognition Named-entity recognition, or NER, is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, and so on. Other names for this subtask include (named) entity identification, entity chunking, and entity extraction. Named-entity recognition is also known as named-entity identification. How You Will Benefit (I) Insights, and validations about the following topics: Chapter 1: Named-entity recognition Chapter 2: Natural language processing Chapter 3: Information extraction Chapter 4: Named entity Chapter 5: Relationship extraction Chapter 6: Outline of natural language processing Chapter 7: Entity linking Chapter 8: Apache cTAKES Chapter 9: SpaCy Chapter 10: Zero-shot learning (II) Answering the public top questions about named entity recognition. (III) Real world examples for the usage of named entity recognition in many fields. (IV) 17 appendices to explain, briefly, 266 emerging technologies in each industry to have 360-degree full understanding of named entity recognition' technologies. Who This Book Is For Professionals, undergraduate and graduate students, enthusiasts, hobbyists, and those who want to go beyond basic knowledge or information for any kind of named entity recognition.




Language Technologies for the Challenges of the Digital Age


Book Description

This open access volume constitutes the refereed proceedings of the 27th biennial conference of the German Society for Computational Linguistics and Language Technology, GSCL 2017, held in Berlin, Germany, in September 2017, which focused on language technologies for the digital age. The 16 full papers and 10 short papers included in the proceedings were carefully selected from 36 submissions. Topics covered include text processing of the German language, online media and online content, semantics and reasoning, sentiment analysis, and semantic web description languages.




Semantic Processing of Legal Texts


Book Description

Recent years have seen much new research on the interface between artificial intelligence and law, looking at issues such as automated legal reasoning. This collection of papers represents the state of the art in this fascinating and highly topical field.







Language Engineering for Lesser-studied Languages


Book Description

"Technologies enabling computers to process specific languages facilitate economic and political progress of societies where these languages are spoken. Development of methods and systems for language processing is therefore a worthy goal for national governments as well as for business entities and scientific and educational institutions in every country in the world. As work on systems and resources for the 'lower-density' languages becomes more widespread, an important question is how to leverage the results and experience accumulated by the field of computational linguistics for the major languages in the development of resources and systems for lower-density languages. This issue has been at the core of the NATO Advanced Studies Institute on language technologies for middle- and low-density languages held in Georgia in October 2007. This publication is a collection - of publication-oriented versions - of the lectures presented there and is a useful source of knowledge about many core facets of modern computational-linguistic work. By the same token, it can serve as a reference source for people interested in learning about strategies that are best suited for developing computational-linguistic capabilities for lesser-studied languages - either 'from scratch' or using components developed for other languages. The book should also be quite useful in teaching practical system- and resource-building topics in computational linguistics."--Site Web de l'éditeur.