Human Language Technology. Challenges for Computer Science and Linguistics


Book Description

This book constitutes the refereed proceedings of the 8th Language and Technology Conference: Challenges for Computer Science and Linguistics, LTC 2017, held in Poznan, Poland, in November 2017. The 26 revised papers presented in this volume were carefully reviewed and selected from 97 submissions. The papers selected to this volume belong to various fields of: Language Resources, Tools and Evaluation, Less-Resourced-Languages, Speech Processing, Morphology, Computational Semantics, Machine Translation, and Information Retrieval and Information Extraction.




Coreference


Book Description

‘Coreference’ presents specificities of reference, anaphora and coreference in Polish, establish identity-of-reference annotation model and present methodology used to create the corpus of Polish general nominal coreference. Various resolution approaches are presented, followed by their evaluation. By discussing the subsequent steps of building a coreference-related component of the natural language processing toolset and offering deeper explanation of the decisions taken, this volume might also serve as a reference book on state-of the art methods of carrying out coreference projects for new languages and a tutorial for NLP practitioners. Apart from serving as a description of the fi rst complete approach to annotation and resolution of direct nominal coreference for Polish, this book is a useful starting point for further work on other types of anaphora/coreference, semantic annotation, cognitive linguistics (related to the topic of near-identity, discussed in the book) etc. With extended tutorial-like sections on important subtopics, such as evaluation metrics for coreference resolution, it can prove useful to both researchers and practitioners interested in semantic description of Balto-Slavic languages and their processing, engineers developing language resources, tools and linguistic processing chains, as well as computational linguists in general.




Human Language Technology. Challenges of the Information Society


Book Description

Half a centuryago not manypeople had realizedthat a new epoch in the history of homo sapiens had just started. The term “Information Society Age” seems an appropriate name for this epoch. Communication was without a doubt a lever of the conquest of the human race over the rest of the animate world. There is little doubt that the human racebegan when our predecessorsstarted to communicate with each other using language.This highly abstractmeans of communicationwas probably one of the major factors contributing to the evolutionary success of the human race within the animal world. Physically weak and imperfect, humans started to dominate the rest of the world through the creation of communication-based societies where individuals communicated initially to satisfy immediate needs, and then to create, accumulate and process knowledge for future use. The crucial step in the history of humanity was the invention of writing. It is worth noting that writing is a human invention, not a phenomenon resulting from natural evolution. Humans invented writing as a technique for recording speech as well as for storing and facilitating the dissemination of knowledge across the world. Humans continue to be born illiterate, and therefore teaching and conscious supervised learning is necessary to maintain this basic social skill.




Human Language Technologies – The Baltic Perspective


Book Description

Throughout the last decade, the Baltic states have played an active role in regional and international language technology activities, supporting less-resourced languages in the digital age. This book presents the proceedings of the 7th International Conference: Human Language Technologies – The Baltic Perspective (Baltic HLT 2016), held in Riga, Latvia, in October 2016. Baltic HLT 2016 provided a forum for sharing ideas and recent advances in human language processing with a special focus on less-resourced languages. Papers selected for the conference cover a wide range of topics, including a general overview of language technology progress in the Baltic states, actual research topics in written and spoken language processing, the creation of language resources and their applications, and proposals for a European language platform. The book is divided into five sections: overview; speech technologies and corpora; machine translation; written language resources; and methods and tools for language processing. The book will be a useful resource, not only for Baltic language researchers, but also for those working with other less-resourced languages in Europe and beyond.




Language technologies for a multilingual Europe


Book Description

This volume of the series “Translation and Multilingual Natural Language Processing” includes most of the papers presented at the Workshop “Language Technology for a Multilingual Europe”, held at the University of Hamburg on September 27, 2011 in the framework of the conference GSCL 2011 with the topic “Multilingual Resources and Multilingual Applications”, along with several additional contributions. In addition to an overview article on Machine Translation and two contributions on the European initiatives META-NET and Multilingual Web, the volume includes six full research articles. Our intention with this workshop was to bring together various groups concerned with the umbrella topics of multilingualism and language technology, especially multilingual technologies. This encompassed, on the one hand, representatives from research and development in the field of language technologies, and, on the other hand, users from diverse areas such as, among others, industry, administration and funding agencies. The Workshop “Language Technology for a Multilingual Europe” was co-organised by the two GSCL working groups “Text Technology” and “Machine Translation” (http://gscl.info) as well as by META-NET (http://www.meta-net.eu).




European Language Equality


Book Description

This open access book presents a comprehensive collection of the European Language Equality (ELE) project’s results, its strategic agenda and roadmap with key recommendations to the European Union on how to achieve digital language equality in Europe by 2030. The fabric of the EU linguistic landscape comprises 24 official languages and over 60 regional and minority languages. However, language barriers still hamper communication and the free flow of information. Multilingualism is a key cultural cornerstone of Europe, signifying what it means to be and to feel European. Various studies and resolutions have found a striking imbalance in the support of Europe’s languages through technologies, issuing a call to action. Following an introduction, the book is divided into two parts. The first part describes the state of the art of language technology and language-centric AI and the definition and metrics developed to measure digital language equality. It also presents the status quo in 2022/2023, i.e., the current level of technology support for over 30 European languages. The second part describes plans and recommendations on how to bring about digital language equality in Europe by 2030. It includes chapters on the setup and results of the community consultation process, four technical deep dives, an overview of existing strategic documents and an abridged version of the strategic agenda and roadmap. The recommendations have been prepared jointly with the European community in the fields of language technology, natural language processing, and language-centric AI, as well as with representatives of relevant initiatives and associations, language communities and regional and minority language groups. Ensuring appropriate technology support for all European languages will not only create jobs, growth and opportunities in the digital single market. Overcoming language barriers in the digital environment is also essential for an inclusive society and for providing unity in diversity for many years to come.




Artificial Intelligence and Soft Computing


Book Description

The two-volume set LNAI 10245 and LNAI 10246 constitutes the refereed proceedings of the 16th International Conference on Artificial Intelligence and Soft Computing, ICAISC 2017, held in Zakopane, Poland in June 2017. The 133 revised full papers presented were carefully reviewed and selected from 274 submissions. The papers included in the second volume are organized in the following five parts: data mining; artificial intelligence in modeling, simulation and control; various problems of artificial intelligence; special session: advances in single-objective continuous parameter optimization with nature-inspired algorithms; special session: stream data mining.




Corpus Linguistics, Computer Tools, and Applications - State of the Art


Book Description

Contents: Barbara Lewandowska-Tomaszczyk: PALC 2007: Where are we now? - Paul Rayson/Dawn Archer/Alistair Baron/Nicholas Smith: Travelling through time with corpus annotation software - Eugene H. Casad: Parsing texts and compiling a dictionary with shoebox - Belinda Maia/Rui Silva/Anabela Barreiro/Cecília Fróis: 'N-grams in search of theories' - Piotr Pęzik/Jung-jae Kim/Dietrich Rebholz-Schuhmann: MedEvi - A permuted concordancer for the biomedical domain - Patrick Hanks: Why the «word sense disambiguation problem» can't be solved, and what should be done instead - Rafał




The Polish Language in the Digital Age


Book Description

This white paper is part of a series that promotes knowledge about language technology and its potential. It addresses educators, journalists, politicians, language communities and others. The availability and use of language technology in Europe varies between languages. Consequently, the actions that are required to further support research and development of language technologies also differ for each language. The required actions depend on many factors, such as the complexity of a given language and the size of its community. META-NET, a Network of Excellence funded by the European Commission, has conducted an analysis of current language resources and technologies. This analysis focused on the 23 official European languages as well as other important national and regional languages in Europe. The results of this analysis suggest that there are many significant research gaps for each language. A more detailed expert analysis and assessment of the current situation will help maximise the impact of additional research and minimize any risks. META-NET consists of 54 research centres from 33 countries that are working with stakeholders from commercial businesses, government agencies, industry, research organisations, software companies, technology providers and European universities. Together, they are creating a common technology vision while developing a strategic research agenda that shows how language technology applications can address any research gaps by 2020.




Advanced Approaches to Intelligent Information and Database Systems


Book Description

This book consists of 35 chapters presenting different theoretical and practical aspects of Intelligent Information and Database Systems. Nowadays both Intelligent and Database Systems are applied in most of the areas of human activities which necessitates further research in these areas. In this book various interesting issues related to the intelligent information models and methods as well as their advanced applications, database systems applications, data models and their analysis and digital multimedia methods and applications are presented and discussed both from the practical and theoretical points of view. The book is organized in four parts devoted to intelligent systems models and methods, intelligent systems advanced applications, database systems methods and applications and multimedia systems methods and applications. The book will be interesting for practitioners and researchers, especially graduate and PhD students of information technology and computer science, as well more experienced academics and specialists interested in developing and verification of intelligent information, database and multimedia systems models, methods and applications. The readers of this volume are enabled to find many inspiring ideas and motivating practical examples that will help them in the current and future work.