Multi-Modal Geometric Learning for Grasping and Manipulation

Watkins-Valls, David; Varley, Jacob; Allen, Peter

doi:10.1109/ICRA.2019.8794233

Citation Details

Multi-Modal Geometric Learning for Grasping and Manipulation

This work provides an architecture that incorporates depth and tactile information to create rich and accurate 3D models useful for robotic manipulation tasks. This is accomplished through the use of a 3D convolutional neural network (CNN). Offline, the network is provided with both depth and tactile information and trained to predict the object’s geometry, thus filling in regions of occlusion. At runtime, the network is provided a partial view of an object. Tactile information is acquired to augment the captured depth information. The network can then reason about the object’s geometry by utilizing both the collected tactile and depth information. We demonstrate that even small amounts of additional tactile information can be incredibly helpful in reasoning about object geometry. This is particularly true when information from depth alone fails to produce an accurate geometric prediction. Our method is benchmarked against and outperforms other visual-tactile approaches to general geometric reasoning. We also provide experimental results comparing grasping success with our method. more »

Award ID(s):: 1734557

PAR ID:: 10111003

Author(s) / Creator(s):: Watkins-Valls, David; Varley, Jacob; Allen, Peter

Date Published:: 2019-05-01

Journal Name:: 2019 International Conference on Robotics and Automation (ICRA)

Page Range / eLocation ID:: 7339 to 7345

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICRA.2019.8794233

More Like this