Abstract

Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints and angles, even in the presence of optical distortions. In robotics, such skills are typically referred to as visual servoing: moving a tool or end-point to a desired location using primarily visual feedback. In this paper, we study how viewpoint-independent visual servoing skills can be learned automatically in a robotic manipulation scenario. To this end, we train a deep, recurrent controller that can automatically determine which actions move the end-point of a robotic arm to a desired object. The problem that must be solved by this controller is fundamentally ambiguous: under severe variation in viewpoint, it may be impossible to determine the actions in a single feedforward operation. Instead, our visual servoing system must use its memory of past movements to understand how the actions affect the robot motion from the current viewpoint, correcting mistakes and gradually moving closer to the target. This ability is in stark contrast to most visual servoing methods, which either assume known dynamics or require a calibration phase. We show how we can learn this recurrent controller using simulated data, and then describe how the resulting model can be transferred to a real-world robot by disentangling perception from control and only adapting the visual layers. The adapted model can servo to previously unseen objects from novel viewpoints on a real-world Kuka IIWA robotic arm.

Bibtex

@inproceedings{sadeghi2018sim2realservo,
  title={Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control},
  author={Sadeghi, Fereshteh and Toshev, Alexander and Jang, Eric and Levine, Sergey},
  booktitle={CVPR},
  year={2018}
}

Sim2Real View Invariant Visual Servoing by Recurrent Control

Fereshteh Sadeghi Alexander Toshev Eric Jang Sergey Levine

Abstract

Bibtex