Document Analysis Systems V


Book Description

This book constitutes the refereed proceedings of the 5th International Workshop on Document Analysis Systems, DAS 2002, held in Princeton, NJ, USA in August 2002 with sponsorship from IAPR.The 44 revised full papers presented together with 14 short papers were carefuly reviwed and selected for inclusion in the book. All current issues in document analysis systems are adressed. The papers are organized in topical sections on OCR features and systems, handwriting recognition, layout analysis, classifiers and learning, tables and forms, text extraction, indexing and retrieval, document engineering, and new applications.




Content Computing


Book Description

Welcome to the Advanced Workshop on Content Computing 2004. The focus of this workshop was "Content Computing". It emphasized research areas that facilitate efficient, appropriate dissemination of content to users with the necessary access rights. We use the word "content" instead of "information" or "data" because we want to cover not only raw data but also presentation quality. The fast growth of the Internet has already made it the key infrastructure for information dissemination,education,business and entertainment. While the client-server model has been the most widely adopted paradigm for the WWW, the desire to provide more value-added services in the delivery layer has led to the concept of an active network, where content-driven, intelligent computation will be performed to provide quality-of-service for content presentation and best-?t client demand. These value-added services typically aim to enhance information security, provide pervasive Internet access, and improve application robustness, system/network performance, knowledge extraction,etc. They are realized by incorporating sophisticated mechanisms at the delivery layer,which is transparent to the content providers and Web surfers. Consequently, the notion of "Content Computing" has emerged. Content computing is a new paradigm for coordinating distributed systems and intelligent networks, based on a peer-to-peer model and with value-added processing of the application-specific contents at the - livery layer. This paradigm is especially useful to pervasive lightweight client devices such as mobile and portable end-user terminals with a wide variation of hardware/software configurations. This year, the workshop was held in Zhenjiang, Jiangsu, China. We received 194 high-quality papers from 11 regions, namely PR China, Korea, Singapore, Japan, United States, Canada, Australia, Germany, Taiwan, Italy, and Hong Kong. Totally, 62 papers were accepted and presented in the workshop.




Developments in Handwriting and Signature Identification in the Digital Age


Book Description

The examination of handwriting and signatures has a long and established history as a forensic discipline. With the advancement of technology in the use of digital tablets for signature capture, changes in handwriting examination are necessary. Other changes in handwriting, such as in increase in printed writing styles and the decrease in handwriting training in schools necessitates a re-examination of forensic handwriting identification problems. This text takes a fresh and modern look at handwriting examination as it pertains to forensic, legal, and criminal justice applications.




Technology Diffusion and Adoption: Global Complexity, Global Innovation


Book Description

Technology Diffusion and Adoption: Global Complexity, Global Innovation discusses the emerging topics of information technology and the IT based solutions in global and multi-cultural environments. This comprehensive collection addresses the aspects of innovation diffusion in the field of business computing technologies and is essential for researchers, practitioners, academicians and educators all over the world.







Document Image Processing


Book Description

This book is a printed edition of the Special Issue "Document Image Processing" that was published in J. Imaging




A Large Vocabulary Online Handwriting Recognition System for Turkish


Book Description

Handwriting recognition in general and online handwriting recognition in particular has been an active research area for several decades. Most of the research have been focused on English and recently on other scripts like Arabic and Chinese. There is a lack of research on recognition in Turkish text and this work primarily fills that gap with a state-of-the-art recognizer for the first time. It contains design and implementation details of a complete recognition system for recognition of Turkish isolated words. It considers the recognition of unconstrained handwriting with a limited vocabulary size first and then evolves to a large vocabulary system. Turkish script has many similarities with other Latin scripts, like English, which makes it possible to adapt strategies that work for them. However, there are some other issues which are particular to Turkish that should be taken into consideration separately. Two of the challenging issues in recognition of Turkish text are determined as delayed strokes and high Out-of-Vocabulary (OOV). This work examines these problems and alternative solutions at depth and proposes suitable solutions for Turkish script particularly.




Web Document Analysis


Book Description

This book provides the first comprehensive look at the emerging field of web document analysis. It sets the scene in this new field by combining state-of-the-art reviews of challenges and opportunities with research papers by leading researchers. Readers will find in-depth discussions on the many diverse and interdisciplinary areas within the field, including web image processing, applications of machine learning and graph theories for content extraction and web mining, adaptive web content delivery, multimedia document modeling and human interactive proofs for web security. Contents: Content Extraction and Web Mining; Document Analysis for Adaptive Content Delivery; Table Understanding on the Web; Web Image Analysis and Retrieval; New Opportunities. Readership: Graduate students and researchers in document-analysis and web communities.