A Computational Perspective on Visual Attention


Book Description

The derivation, exposition, and justification of the Selective Tuning model of vision and attention. Although William James declared in 1890, "Everyone knows what attention is," today there are many different and sometimes opposing views on the subject. This fragmented theoretical landscape may be because most of the theories and models of attention offer explanations in natural language or in a pictorial manner rather than providing a quantitative and unambiguous statement of the theory. They focus on the manifestations of attention instead of its rationale. In this book, John Tsotsos develops a formal model of visual attention with the goal of providing a theoretical explanation for why humans (and animals) must have the capacity to attend. He takes a unique approach to the theory, using the full breadth of the language of computation—rather than simply the language of mathematics—as the formal means of description. The result, the Selective Tuning model of vision and attention, explains attentive behavior in humans and provides a foundation for building computer systems that see with human-like characteristics. The overarching conclusion is that human vision is based on a general purpose processor that can be dynamically tuned to the task and the scene viewed on a moment-by-moment basis. Tsotsos offers a comprehensive, up-to-date overview of attention theories and models and a full description of the Selective Tuning model, confining the formal elements to two chapters and two appendixes. The text is accompanied by more than 100 illustrations in black and white and color; additional color illustrations and movies are available on the book's Web site.




Selective Visual Attention


Book Description

Visual attention is a relatively new area of study combining a number of disciplines: artificial neural networks, artificial intelligence, vision science and psychology. The aim is to build computational models similar to human vision in order to solve tough problems for many potential applications including object recognition, unmanned vehicle navigation, and image and video coding and processing. In this book, the authors provide an up to date and highly applied introduction to the topic of visual attention, aiding researchers in creating powerful computer vision systems. Areas covered include the significance of vision research, psychology and computer vision, existing computational visual attention models, and the authors' contributions on visual attention models, and applications in various image and video processing tasks. This book is geared for graduates students and researchers in neural networks, image processing, machine learning, computer vision, and other areas of biologically inspired model building and applications. The book can also be used by practicing engineers looking for techniques involving the application of image coding, video processing, machine vision and brain-like robots to real-world systems. Other students and researchers with interdisciplinary interests will also find this book appealing. Provides a key knowledge boost to developers of image processing applications Is unique in emphasizing the practical utility of attention mechanisms Includes a number of real-world examples that readers can implement in their own work: robot navigation and object selection image and video quality assessment image and video coding Provides codes for users to apply in practical attentional models and mechanisms




Computational Visual Attention Models


Book Description

The human visual system has evolved to have the ability to selectively focus on the most relevant parts of a visual scene. This mechanism, referred to as visual attention, has been the focus of several neurological and psychological studies in the past few decades. These studies have inspired several computational visual attention models which have been successfully applied to problems in computer vision and robotics. Computational Visual Attention Models provides a comprehensive survey of the state-of-the-art in computational visual attention modeling with a special focus on the latest trends. By reviewing several models published since 2012, the theoretical advantages and disadvantages of each approach are discussed. In addition, existing methodologies to evaluate computational models through the use of eye-tracking data along with the visual attention performance metrics used are described. The shortcomings in existing approaches and approaches to overcome them are also covered. Finally, a subjective evaluation for benchmarking existing visual attention metrics is presented and open problems in visual attention are highlighted. This monograph provides the reader with an in-depth survey of the research conducted to date in computational visual attention models and provides the basis for further research in this exciting area.




Vision


Book Description

Available again, an influential book that offers a framework for understanding visual perception and considers fundamental questions about the brain and its functions. David Marr's posthumously published Vision (1982) influenced a generation of brain and cognitive scientists, inspiring many to enter the field. In Vision, Marr describes a general framework for understanding visual perception and touches on broader questions about how the brain and its functions can be studied and understood. Researchers from a range of brain and cognitive sciences have long valued Marr's creativity, intellectual power, and ability to integrate insights and data from neuroscience, psychology, and computation. This MIT Press edition makes Marr's influential work available to a new generation of students and scientists. In Marr's framework, the process of vision constructs a set of representations, starting from a description of the input image and culminating with a description of three-dimensional objects in the surrounding environment. A central theme, and one that has had far-reaching influence in both neuroscience and cognitive science, is the notion of different levels of analysis—in Marr's framework, the computational level, the algorithmic level, and the hardware implementation level. Now, thirty years later, the main problems that occupied Marr remain fundamental open problems in the study of perception. Vision provides inspiration for the continuing efforts to integrate knowledge from cognition and computation to understand vision and the brain.







Visual Attention and Cortical Circuits


Book Description

An attempt to derive a comprehensive theory of attention from both neurobiological and psychological data.




The Cambridge Handbook of Computational Psychology


Book Description

A cutting-edge reference source for the interdisciplinary field of computational cognitive modeling.




VISIT


Book Description

One of the challenges for models of cognitive phenomena is the development of efficient and flexible interfaces between low level sensory information and high level processes. For visual processing, researchers have long argued that an attentional mechanism is required to perform many of the tasks required by high level vision. This thesis presents VISIT, a connectionist model of covert visual attention that has been used as a vehicle for studying this interface. The model is efficient, flexible, and is biologically plausible. The complexity of the network is linear in the number of pixels. Effective parallel strategies are used to minimize the number of iterations required. The resulting system is able to efficiently solve two tasks that are particularly difficult for standard bottom-up models of vision: computing spatial relations and visual search. Simulations show that the network's behavior matches much of the known psychophysical data on human visual attention. The general architecture of the model also closely matches the known physiological data on the human attention system. Various extensions to VISIT are discussed, including methods for learning the component modules.




Computational Models of Visual Processing


Book Description

The more than twenty contributions in this book, all new and previously unpublished, provide an up-to-date survey of contemporary research on computational modeling of the visual system. The approaches represented range from neurophysiology to psychophysics, and from retinal function to the analysis of visual cues to motion, color, texture, and depth. The contributions are linked thematically by a consistent consideration of the links between empirical data and computational models in the study of visual function. An introductory chapter by Edward Adelson and James Bergen gives a new and elegant formalization of the elements of early vision. Subsequent sections treat receptors and sampling, models of neural function, detection and discrimination, color and shading, motion and texture, and 3D shape. Each section is introduced by a brief topical review and summary. ContributorsEdward H. Adelson, Albert J. Ahumada, Jr., James R. Bergen, David G. Birch, David H. Brainard, Heinrich H. Bülthoff, Charles Chubb, Nancy J. Coletta, Michael D'Zmura, John P. Frisby, Norma Graham, Norberto M. Grzywacz, P. William Haake, Michael J. Hawken, David J. Heeger, Donald C. Hood, Elizabeth B. Johnston, Daniel Kersten, Michael S. Landy, Peter Lennie, J. Stephen Mansfield, J. Anthony Movshon, Jacob Nachmias, Andrew J. Parker, Denis G. Pelli, Stephen B. Pollard, R. Clay Reid, Robert Shapley, Carlo L. M. Tiana, Brian A. Wandell, Andrew B. Watson, David R. Williams, Hugh R. Wilson, Yuede. Yang, Alan L. Yuille




The Oxford Handbook of Attention


Book Description

During the last three decades, there have been enormous advances in our understanding of the neural mechanisms of selective attention at the network as well as the cellular level. The Oxford Handbook of Attention brings together the different research areas that constitute contemporary attention research into one comprehensive and authoritative volume. In 40 chapters, it covers the most important aspects of attention research from the areas of cognitive psychology, neuropsychology, human and animal neuroscience, computational modelling, and philosophy. The book is divided into 4 main sections. Following an introduction from Michael Posner, the books starts by looking at theoretical models of attention. The next two sections are dedicated to spatial attention and non-spatial attention respectively. Within section 4, the authors consider the interactions between attention and other psychological domains. The last two sections focus on attention-related disorders, and finally, on computational models of attention. Aimed at both scholars and students, the Oxford Handbook of Attention provides a concise and state-of-the-art review of the current literature in this field.