Machine Translation


Book Description

The use of the computer in translating natural languages ranges from that of a translator's aid for word processing and dictionary lookup to that of a full-fledged translator on its own. However the obstacles to translating by means of the computer are primarily linguistic. To overcome them it is necessary to resolve the ambiguities that pervade a natural language when words and sentences are viewed in isolation. The problem then is to formalize, in the computer, these aspects of natural language understanding. The authors show how, from a linguistic point of view, one may form some idea of what goes on inside a system's black box, given only the input (original text) and the raw output (translated text before post-editing). Many examples of English/French translation are used to illustrate the principles involved.




Translation Quality Assessment


Book Description

This is the first volume that brings together research and practice from academic and industry settings and a combination of human and machine translation evaluation. Its comprehensive collection of papers by leading experts in human and machine translation quality and evaluation who situate current developments and chart future trends fills a clear gap in the literature. This is critical to the successful integration of translation technologies in the industry today, where the lines between human and machine are becoming increasingly blurred by technology: this affects the whole translation landscape, from students and trainers to project managers and professionals, including in-house and freelance translators, as well as, of course, translation scholars and researchers. The editors have broad experience in translation quality evaluation research, including investigations into professional practice with qualitative and quantitative studies, and the contributors are leading experts in their respective fields, providing a unique set of complementary perspectives on human and machine translation quality and evaluation, combining theoretical and applied approaches.




Neural Machine Translation


Book Description

Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.




Quality Estimation for Machine Translation


Book Description

Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.




Evaluation of Translation Technology


Book Description

This collection of scholarly articles asks the question How useful is translation technology? Pointing to the need for a widely used and reliable way to test the efficiency of language translation programs, the presenters show that commercial tools such as translation memories and translation workbenches are popular, and their developers find them useful in terms of productivity, consistency, or quality. However, these claims are rarely proven using objective comparative studies, and this group describes several new statistical approaches to more rigorous evaluation methods. -- Product Description.







Analysing English-Arabic Machine Translation


Book Description

Machine Translation (MT) has become widely used throughout the world as a medium of communication between those who live in different countries and speak different languages. However, translation between distant languages constitutes a challenge for machines. Therefore, translation evaluation is poised to play a significant role in the process of designing and developing effective MT systems. This book evaluates three prominent MT systems, including Google Translate, Microsoft Translator, and Sakhr, each of which provides translation between English and Arabic. In the book Almahasees scrutinizes the capacity of the three systems in dealing with translation between English and Arabic in a large corpus taken from various domains, including the United Nation (UN), the World Health Organization (WHO), the Arab League, Petra News Agency reports, and two literary texts: The Old Man and the Sea and The Prophet. The evaluation covers holistic analysis to assess the output of the three systems in terms of Translation Automation User Society (TAUS) adequacy and fluency scales. The text also looks at error analysis to evaluate the systems’ output in terms of orthography, lexis, grammar, and semantics at the entire-text level and in terms of lexis, grammar, and semantics at the collocation level. The research findings contained within this volume provide important feedback about the capabilities of the three MT systems with respect to EnglishArabic translation and paves the way for further research on such an important topic. This book will be of interest to scholars and students of translation studies and translation technology.




Handbook of Natural Language Processing and Machine Translation


Book Description

This comprehensive handbook, written by leading experts in the field, details the groundbreaking research conducted under the breakthrough GALE program--The Global Autonomous Language Exploitation within the Defense Advanced Research Projects Agency (DARPA), while placing it in the context of previous research in the fields of natural language and signal processing, artificial intelligence and machine translation. The most fundamental contrast between GALE and its predecessor programs was its holistic integration of previously separate or sequential processes. In earlier language research programs, each of the individual processes was performed separately and sequentially: speech recognition, language recognition, transcription, translation, and content summarization. The GALE program employed a distinctly new approach by executing these processes simultaneously. Speech and language recognition algorithms now aid translation and transcription processes and vice versa. This combination of previously distinct processes has produced significant research and performance breakthroughs and has fundamentally changed the natural language processing and machine translation fields. This comprehensive handbook provides an exhaustive exploration into these latest technologies in natural language, speech and signal processing, and machine translation, providing researchers, practitioners and students with an authoritative reference on the topic.




Recent Trends in Computational Intelligence


Book Description

Traditional models struggle to cope with complexity, noise, and the existence of a changing environment, while Computational Intelligence (CI) offers solutions to complicated problems as well as reverse problems. The main feature of CI is adaptability, spanning the fields of machine learning and computational neuroscience. CI also comprises biologically-inspired technologies such as the intellect of swarm as part of evolutionary computation and encompassing wider areas such as image processing, data collection, and natural language processing. This book aims to discuss the usage of CI for optimal solving of various applications proving its wide reach and relevance. Bounding of optimization methods and data mining strategies make a strong and reliable prediction tool for handling real-life applications.




Machine Translation


Book Description

A concise, nontechnical overview of the development of machine translation, including the different approaches, evaluation issues, and major players in the industry. The dream of a universal translation device goes back many decades, long before Douglas Adams's fictional Babel fish provided this service in The Hitchhiker's Guide to the Galaxy. Since the advent of computers, research has focused on the design of digital machine translation tools—computer programs capable of automatically translating a text from a source language to a target language. This has become one of the most fundamental tasks of artificial intelligence. This volume in the MIT Press Essential Knowledge series offers a concise, nontechnical overview of the development of machine translation, including the different approaches, evaluation issues, and market potential. The main approaches are presented from a largely historical perspective and in an intuitive manner, allowing the reader to understand the main principles without knowing the mathematical details. The book begins by discussing problems that must be solved during the development of a machine translation system and offering a brief overview of the evolution of the field. It then takes up the history of machine translation in more detail, describing its pre-digital beginnings, rule-based approaches, the 1966 ALPAC (Automatic Language Processing Advisory Committee) report and its consequences, the advent of parallel corpora, the example-based paradigm, the statistical paradigm, the segment-based approach, the introduction of more linguistic knowledge into the systems, and the latest approaches based on deep learning. Finally, it considers evaluation challenges and the commercial status of the field, including activities by such major players as Google and Systran.