Multimodal Interactive Handwritten Text Transcription


Book Description

This book presents an interactive multimodal approach for efficient transcription of handwritten text images. This approach, rather than full automation, assists the expert in the recognition and transcription process. Until now, handwritten text recognition (HTR) systems are far from being perfect and heavy human intervention is often required to check and correct the results of such systems. The interactive scenario studied in this book combines the efficiency of automatic handwriting recognition systems with the accuracy of the experts, leading to a cost-effective perfect transcription of the handwritten text images. The interactive system here allows the user to repeatedly interact with the system. Hence, the quality and ergonomy of the interactive process is crucial for the success of the system. Moreover, more ergonomic multimodal interfaces are used to obtain an easier and more comfortable human-machine interaction.




Multimodal Interactive Pattern Recognition and Applications


Book Description

This book presents a different approach to pattern recognition (PR) systems, in which users of a system are involved during the recognition process. This can help to avoid later errors and reduce the costs associated with post-processing. The book also examines a range of advanced multimodal interactions between the machine and the users, including handwriting, speech and gestures. Features: presents an introduction to the fundamental concepts and general PR approaches for multimodal interaction modeling and search (or inference); provides numerous examples and a helpful Glossary; discusses approaches for computer-assisted transcription of handwritten and spoken documents; examines systems for computer-assisted language translation, interactive text generation and parsing, relevance-based image retrieval, and interactive document layout analysis; reviews several full working prototypes of multimodal interactive PR applications, including live demonstrations that can be publicly accessed on the Internet.




Machine Learning for Multimodal Interaction


Book Description

This book constitutes the refereed proceedings of the 5th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2008, held in Utrecht, The Netherlands, in September 2008. The 12 revised full papers and 15 revised poster papers presented together with 5 papers of a special session on user requirements and evaluation of multimodal meeting browsers/assistants were carefully reviewed and selected from 47 submissions. The papers cover a wide range of topics related to human-human communication modeling and processing, as well as to human-computer interaction, using several communication modalities. Special focus is given to the analysis of non-verbal communication cues and social signal processing, the analysis of communicative content, audio-visual scene analysis, speech processing, interactive systems and applications.




Digital Libraries and Multimedia Archives


Book Description

This book constitutes the thoroughly refereed proceedings of the 14th Italian Research Conference on Digital Libraries, IRCDL 2018, held in Udine, Italy, in January 2018. The 14 full papers and 11 short papers presented were carefully selected from 30 submissions. The papers are organized in topical sections on digital library architecture; multimedia content analysis; models and applications.




Digital Libraries: Supporting Open Science


Book Description

This book constitutes the thoroughly refereed proceedings of the 15th Italian Research Conference on Digital Libraries, IRCDL 2019, held in Pisa, Italy, in January/February 2019. The 22 full papers and 5 short papers presented were carefully selected from 42 submissions. The papers are organized in topical sections on information retrieval, digital libraries and archives, information integration, open science, and data mining.




Pattern Recognition and Image Analysis


Book Description

This book constitutes the proceedings of the 7th Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA 2015, held in Santiage de Compostela, Spain, in June 2015. The 83 papers presented in this volume were carefully reviewed and selected from 141 submissions. They were organized in topical sections named: Pattern Recognition and Machine Learning; Computer Vision; Image and Signal Processing; Applications; Medical Image; Pattern Recognition and Machine Learning; Computer Vision; Image and Signal Processing; and Applications




Multimedia for Cultural Heritage


Book Description

This book constitutes the revised selected papers from the First International Workshop on Multimedia for Cultural Heritage, MM4CH 2011, held in Modena, Italy, on May 3, 2011. The 8 full papers and 9 poster papers included in this volume were carefully reviewed and selected from 25 submissions. In addition, the book contains a paper resuming the outcome of the discussion session. The workshop aimed on creating a profitable informal working day to discuss hot topics in multimedia, with special application to cultural heritage. The papers of the oral session are divided in topical sections named interaction and analysis and management.




Probabilistic Indexing for Information Search and Retrieval in Large Collections of Handwritten Text Images


Book Description

This book provides a comprehensive presentation of a recently introduced framework, named "probabilistic indexing" (PrIx), for searching text in large collections of document images and other related applications. It fosters the development of new search engines for effective information retrieval from manuscripts which, however, lack the electronic text (transcripts) that would typically be required for such search and retrieval tasks. The book is structured into 11 chapters and three appendices. The first two chapters briefly outline the necessary fundamentals and state of the art in pattern recognition, statistical decision theory, and handwritten text recognition. Chapter 3 presents approaches for indexing (as opposed to spotting) each region of a handwritten text image which is likely to contain a word. Next, Chapter 4 describes models adopted for handwritten text in images, namely hidden Markov models, convolutional and recurrent neural networks and language models, and provides full details of weighted finite-state transducer (WFST) concepts and methods, needed in further chapters of the book. Chapter 5 explains the set of techniques and algorithms developed to generate image probabilistic indexes which allow for fast search and retrieval of textual information in the indexed images. Chapter 6 then presents experimental evaluations of the proposed framework and algorithms on different traditional benchmark datasets and compares them with other approaches, while Chapter 7 reviews the most popular keyword-spotting approaches. Chapter 8 explains how PrIx can support classical free-text search tools, while Chapter 9 presents new methods that use PrIx not only for searching, but also to deal with text analytics and other related natural language processing and information extraction tasks. Chapter 10 shows how the proposed solutions can be used to effectively index very large collections of handwritten document images, before Chapter 11 eventually summarizes the book and suggests promising lines of future research. The appendices detail the necessary mathematical foundations for the work and presents details of the text image collections and datasets used in the experiments throughout the book. This book is written for researchers and (post-)graduate students in pattern recognition and information retrieval. It will also be of interest to people in areas like history, criminology, or psychology who need technical support to evaluate, understand or decode historical or contemporary handwritten text.




New Trends in Image Analysis and Processing – ICIAP 2019


Book Description

This book constitutes the refereed proceedings of five workshops and an industrial session held at the 20th International Conference on Image Analysis and Processing, ICIAP 2019, in Trento, Italy, in September 2019: Second International Workshop on Recent Advances in Digital Security: Biometrics and Forensics (BioFor 2019); First International Workshop on Pattern Recognition for Cultural Heritage (PatReCH 2019); First International Workshop eHealth in the Big Data and Deep Learning Era (e-BADLE 2019); International Workshop on Deep Understanding Shopper Behaviors and Interactions in Intelligent Retail Environments (DEEPRETAIL 2019); Industrial Session.




Document Analysis Systems


Book Description

This book constitutes the refereed proceedings of the 15th IAPR International Workshop on Document Analysis Systems, DAS 2022, held in La Rochelle, France, in May 2022. The full papers presented were carefully reviewed and selected from numerous submissions addressing key techniques of document analysis.