3D Reconstruction and Recognition of Objects Using a Kinect Camera


Book Description

This project is based on the recent advances of 3D depth cameras, one being the Microsoft Kinect sensors. Nowadays, this technology enables to create really accurate 3D figures and scenes simplifying difficult tasks in Robotics and embedded systems. The purpose of this project consists of using both the color data and this depth sensing technology to reconstruct and recognize objects in a scene. The data from the Kinect sensor is processed first to eliminate the planes of the scene to get a better perception of the elements we may find, and consequently, to obtain relevant features of every single object for the creation of a model for every different class. The results show the performance of the system, by testing a database of different elements against the training of some specific objects previously selected: Bowls, pillows and monitors. We can conclude that the classification using the color information is more accurate than using 3D data, even though in both cases the results are quite satisfactory.




Consumer Depth Cameras for Computer Vision


Book Description

The potential of consumer depth cameras extends well beyond entertainment and gaming, to real-world commercial applications. This authoritative text reviews the scope and impact of this rapidly growing field, describing the most promising Kinect-based research activities, discussing significant current challenges, and showcasing exciting applications. Features: presents contributions from an international selection of preeminent authorities in their fields, from both academic and corporate research; addresses the classic problem of multi-view geometry of how to correlate images from different viewpoints to simultaneously estimate camera poses and world points; examines human pose estimation using video-rate depth images for gaming, motion capture, 3D human body scans, and hand pose recognition for sign language parsing; provides a review of approaches to various recognition problems, including category and instance learning of objects, and human activity recognition; with a Foreword by Dr. Jamie Shotton.




Computer Vision and Machine Learning with RGB-D Sensors


Book Description

This book presents an interdisciplinary selection of cutting-edge research on RGB-D based computer vision. Features: discusses the calibration of color and depth cameras, the reduction of noise on depth maps and methods for capturing human performance in 3D; reviews a selection of applications which use RGB-D information to reconstruct human figures, evaluate energy consumption and obtain accurate action classification; presents an approach for 3D object retrieval and for the reconstruction of gas flow from multiple Kinect cameras; describes an RGB-D computer vision system designed to assist the visually impaired and another for smart-environment sensing to assist elderly and disabled people; examines the effective features that characterize static hand poses and introduces a unified framework to enforce both temporal and spatial constraints for hand parsing; proposes a new classifier architecture for real-time hand pose recognition and a novel hand segmentation and gesture recognition system.




RGB-D Image Analysis and Processing


Book Description

This book focuses on the fundamentals and recent advances in RGB-D imaging as well as covering a range of RGB-D applications. The topics covered include: data acquisition, data quality assessment, filling holes, 3D reconstruction, SLAM, multiple depth camera systems, segmentation, object detection, salience detection, pose estimation, geometric modelling, fall detection, autonomous driving, motor rehabilitation therapy, people counting and cognitive service robots. The availability of cheap RGB-D sensors has led to an explosion over the last five years in the capture and application of colour plus depth data. The addition of depth data to regular RGB images vastly increases the range of applications, and has resulted in a demand for robust and real-time processing of RGB-D data. There remain many technical challenges, and RGB-D image processing is an ongoing research area. This book covers the full state of the art, and consists of a series of chapters by internationally renowned experts in the field. Each chapter is written so as to provide a detailed overview of that topic. RGB-D Image Analysis and Processing will enable both students and professional developers alike to quickly get up to speed with contemporary techniques, and apply RGB-D imaging in their own projects.




Image-Based 3D Reconstruction of Dynamic Objects Using Instance-Aware Multibody Structure from Motion


Book Description

"This work proposes a Multibody Structure from Motion (MSfM) algorithm for moving object reconstruction that incorporates instance-aware semantic segmentation and multiple view geometry methods. The MSfM pipeline tracks two-dimensional object shapes on pixel level to determine object specific feature correspondences, in order to reconstruct 3D object shapes as well as 3D object motion trajectories" -- Publicaciones de Arquitectura y Arte.




Pattern Recognition and Image Analysis


Book Description

This book constitutes the proceedings of the 7th Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA 2015, held in Santiage de Compostela, Spain, in June 2015. The 83 papers presented in this volume were carefully reviewed and selected from 141 submissions. They were organized in topical sections named: Pattern Recognition and Machine Learning; Computer Vision; Image and Signal Processing; Applications; Medical Image; Pattern Recognition and Machine Learning; Computer Vision; Image and Signal Processing; and Applications




Pattern Recognition. ICPR International Workshops and Challenges


Book Description

This 8-volumes set constitutes the refereed of the 25th International Conference on Pattern Recognition Workshops, ICPR 2020, held virtually in Milan, Italy and rescheduled to January 10 - 11, 2021 due to Covid-19 pandemic. The 416 full papers presented in these 8 volumes were carefully reviewed and selected from about 700 submissions. The 46 workshops cover a wide range of areas including machine learning, pattern analysis, healthcare, human behavior, environment, surveillance, forensics and biometrics, robotics and egovision, cultural heritage and document analysis, retrieval, and women at ICPR2020.




Kinect-based Object Reconstruction


Book Description

The Microsoft Kinect has recently grown to prominence as a widely used 3D sensor for both academics and hobbyists alike. This thesis presents a Kinect-oriented framework for reproducing physical objects using open or closed source software, GPU-accelerated hardware, and a 3D printer. Specifics of data capture, surface reconstruction, and manufacture are discussed, along with examples of reproducing mechanical and biological models. Finally, future systematic improvements and additional application areas are discussed.




Image Analysis and Recognition


Book Description

The two volumes LNCS 8814 and 8815 constitute the thoroughly refereed proceedings of the 11th International Conference on Image Analysis and Recognition, ICIAR 2014, held in Vilamoura, Portugal, in October 2014. The 107 revised full papers presented were carefully reviewed and selected from 177 submissions. The papers are organized in the following topical sections: image representation and models; sparse representation; image restoration and enhancement; feature detection and image segmentation; classification and learning methods; document image analysis; image and video retrieval; remote sensing; applications; action, gestures and audio-visual recognition; biometrics; medical image processing and analysis; medical image segmentation; computer-aided diagnosis; retinal image analysis; 3D imaging; motion analysis and tracking; and robot vision.




Computer Vision -- ECCV 2014


Book Description

The seven-volume set comprising LNCS volumes 8689-8695 constitutes the refereed proceedings of the 13th European Conference on Computer Vision, ECCV 2014, held in Zurich, Switzerland, in September 2014. The 363 revised papers presented were carefully reviewed and selected from 1444 submissions. The papers are organized in topical sections on tracking and activity recognition; recognition; learning and inference; structure from motion and feature matching; computational photography and low-level vision; vision; segmentation and saliency; context and 3D scenes; motion and 3D scene analysis; and poster sessions.