Language Technology for Cultural Heritage


Book Description

The digital age has had a profound effect on our cultural heritage and the academic research that studies it. Staggering amounts of objects, many of them of a textual nature, are being digitised to make them more readily accessible to both experts and laypersons. Besides a vast potential for more effective and efficient preservation, management, and presentation, digitisation offers opportunities to work with cultural heritage data in ways that were never feasible or even imagined. To explore and exploit these possibilities, an interdisciplinary approach is needed, bringing together experts from cultural heritage, the social sciences and humanities on the one hand, and information technology on the other. Due to a prevalence of textual data in these domains, language technology has a crucial role to play in this endeavour. Language technology can break through the "Google barrier" by offering the potential to analyse texts at advanced levels, extracting information and knowledge at the level of the humanities or social sciences researcher, who wants to know about the who, what, where, and when, but also the how and the why. At the same time cultural heritage data poses considerable challenges for existing language technology: technology aimed at "generic" language has to face such disparate problems as historical language variation, OCR digitisation errors, and near-extinct academic expertise. This book is primarily intended for researchers in information technology and language processing who would like to receive a state-of-the-art overview of the whole breadth of the new and vibrant field of language technology for cultural heritage and its associated academic research in the humanities and social sciences. Researchers working in the target domains of cultural heritage, the social sciences and humanities will also find this book useful, as it provides an overview of how language technology can help them with their information needs. The book covers applications ranging from pre-processing and data cleaning, to the adaptation and compilation of linguistic resources, to personalisation, narrative analysis, visualisation and retrieval.




Handbook of Research on Technologies and Cultural Heritage: Applications and Environments


Book Description

Handbook of Research on Technologies and Cultural Heritage: Applications and Environments covers the many important uses information communication technology in enhancing the experience at cultural environments. From museums, to archaeological sites, to festivals and artistic events to even government institutions and public buildings, information communication technology is revolutionizing the way the public participates at and with these cultural sites, and this reference source provides both a thorough exploration of this revolution and springboard for future discoveries.




Data Analytics for Cultural Heritage


Book Description

This book considers the challenges related to the effective implementation of artificial intelligence (AI) and machine learning (ML) technologies to the cultural heritage digitization process. Particular focus is placed on improvements to the data acquisition stage, as well as the data enrichment and curation stages, using advanced artificial intelligence techniques and tools. An emphasis is placed on recent applications related to deep learning for visual recognition, generative models, natural language processing, and super resolution. The book is a valuable reference for researchers working in the multidisciplinary field of cultural heritage and AI, as well as professional experts in the art and culture domains, such as museums, libraries, and historic sites and buildings. Reports on techniques and methods that leverage AI and machine learning and their impact on the digitization of cultural heritage; Addresses challenges of improving data acquisition, enrichment and management processes; Highlights contributions from international researchers from diverse fields and subject areas.




Science and Technology for the Conservation of Cultural Heritage


Book Description

From 2nd to 5th October 2012 an International Congress on Science and Technology for the conservation of Cultural Heritage was held in Santiago de Compostela, Spain, organized by the Universidade of Santiago de Compostela on behalf of TechnoHeritage Network. The congress was attended by some 160 participants from 10 countries, which presented a tot




Artificial Intelligence for Cultural Heritage


Book Description

Artificial Intelligence and Cultural Heritage represent a combination that for several years has interested both scientific and cultural institutions regarding the potential of possible interactions and aggregations among the various players in these areas. This volume defines roles and provides connections where research and new technologies can suggest routes and competitive solutions that integrate tourism and culture with business and the market. The volume is multidisciplinary, presenting and discussing a variety of new ideas, resulting from the integration of different scientific approaches. The papers brought together here deal with topics including the representation of cultural history, semantic digital archives, the use of analytic tools to support visitor interpretation, augmented reality, and robotics. As such, this book represents the detailed investigation of methodological and applicative aspects that the continued proliferation of computer applications in the cultural heritage field demands.




VR Technologies in Cultural Heritage


Book Description

This open access book constitutes the refereed proceedings of the First International Conference on VR Technologies in Cultural Heritage, VRTCH 2018, held in Brasov, Romania in May 2018. The 13 revised full papers along with the 5 short papers presented were carefully reviewed and selected from 21 submissions. The papers of this volume are organized in topical sections on data acquisition and modelling, visualization methods / audio, sensors and actuators, data management, restoration and digitization, cultural tourism.




Language Technologies for the Challenges of the Digital Age


Book Description

This open access volume constitutes the refereed proceedings of the 27th biennial conference of the German Society for Computational Linguistics and Language Technology, GSCL 2017, held in Berlin, Germany, in September 2017, which focused on language technologies for the digital age. The 16 full papers and 10 short papers included in the proceedings were carefully selected from 36 submissions. Topics covered include text processing of the German language, online media and online content, semantics and reasoning, sentiment analysis, and semantic web description languages.




Natural Language Processing for Historical Texts


Book Description

More and more historical texts are becoming available in digital form. Digitization of paper documents is motivated by the aim of preserving cultural heritage and making it more accessible, both to laypeople and scholars. As digital images cannot be searched for text, digitization projects increasingly strive to create digital text, which can be searched and otherwise automatically processed, in addition to facsimiles. Indeed, the emerging field of digital humanities heavily relies on the availability of digital text for its studies. Together with the increasing availability of historical texts in digital form, there is a growing interest in applying natural language processing (NLP) methods and tools to historical texts. However, the specific linguistic properties of historical texts -- the lack of standardized orthography, in particular -- pose special challenges for NLP. This book aims to give an introduction to NLP for historical texts and an overview of the state of the art in this field. The book starts with an overview of methods for the acquisition of historical texts (scanning and OCR), discusses text encoding and annotation schemes, and presents examples of corpora of historical texts in a variety of languages. The book then discusses specific methods, such as creating part-of-speech taggers for historical languages or handling spelling variation. A final chapter analyzes the relationship between NLP and the digital humanities. Certain recently emerging textual genres, such as SMS, social media, and chat messages, or newsgroup and forum postings share a number of properties with historical texts, for example, nonstandard orthography and grammar, and profuse use of abbreviations. The methods and techniques required for the effective processing of historical texts are thus also of interest for research in other domains. Table of Contents: Introduction / NLP and Digital Humanities / Spelling in Historical Texts / Acquiring Historical Texts / Text Encoding and Annotation Schemes / Handling Spelling Variation / NLP Tools for Historical Languages / Historical Corpora / Conclusion / Bibliography




Natural Language Processing and Information Systems


Book Description

th The 15 International Conference on Applications of Natural Language to Information Systems (NLDB 2010) took place during June 23–25 in Cardiff (UK). Since the first edition in 1995, the NLDB conference has been aiming at bringing together resear- ers, people working in industry and potential users interested in various applications of natural language in the database and information system area. However, in order to reflect the growing importance of accessing information from a diverse collection of sources (Web, Databases, Sensors, Cloud) in an equally wide range of contexts (- cluding mobile and tethered), the theme of the 15th International Conference on - plications of Natural Language to Information Systems 2010 was "Communicating with Anything, Anywhere in Natural Language. " Natural languages and databases are core components in the development of inf- mation systems. Natural language processing (NLP) techniques may substantially enhance most phases of the information system lifecycle, starting with requirement analysis, specification and validation, and going up to conflict resolution, result pr- essing and presentation. Furthermore, natural language-based query languages and user interfaces facilitate the access to information for all and allow for new paradigms in the usage of computerized services. Hot topics such as information retrieval (IR), software engineering applications, hidden Markov models, natural language interfaces and semantic networks and graphs imply a complete fusion of databases, IR and NLP techniques.




Natural Language Processing of Semitic Languages


Book Description

Research in Natural Language Processing (NLP) has rapidly advanced in recent years, resulting in exciting algorithms for sophisticated processing of text and speech in various languages. Much of this work focuses on English; in this book we address another group of interesting and challenging languages for NLP research: the Semitic languages. The Semitic group of languages includes Arabic (206 million native speakers), Amharic (27 million), Hebrew (7 million), Tigrinya (6.7 million), Syriac (1 million) and Maltese (419 thousand). Semitic languages exhibit unique morphological processes, challenging syntactic constructions and various other phenomena that are less prevalent in other natural languages. These challenges call for unique solutions, many of which are described in this book. The 13 chapters presented in this book bring together leading scientists from several universities and research institutes worldwide. While this book devotes some attention to cutting-edge algorithms and techniques, its primary purpose is a thorough explication of best practices in the field. Furthermore, every chapter describes how the techniques discussed apply to Semitic languages. The book covers both statistical approaches to NLP, which are dominant across various applications nowadays and the more traditional, rule-based approaches, that were proven useful for several other application domains. We hope that this book will provide a "one-stop-shop'' for all the requisite background and practical advice when building NLP applications for Semitic languages.