(CAD)2RL

Real Single-Image Flight Without a Single Real Image

Fereshteh Sadeghi                 Sergey Levine

Generic placeholder image

Abstract

We propose (CAD)2RL, a flight controller for Collision Avoidance via Deep Reinforcement Learning that can be used to perform collision-free flight in the real world although it is trained entirely in a 3D CAD model simulator. Our method uses only single RGB images from a monocular camera mounted on the robot as the input and is specialized for indoor hallway following and obstacle avoidance. In contrast to most indoor navigation techniques that aim to directly reconstruct the 3D geometry of the environment, our approach directly predicts the probability of collision given the current monocular image and a candidate action. To obtain accurate predictions, we develop a deep reinforcement learning algorithm for learning indoor navigation, which uses the actual performance of the current policy to construct accurate supervision. The collision prediction model is represented by a deep convolutional neural network that directly processes raw image inputs. Our collision avoidance system is entirely trained in simulation and thus addresses the high sample complexity of deep reinforcement learning and avoids the dangers of trial-and-error learning in the real world. By highly randomizing the rendering settings for our simulated training set, we show that we can train a collision predictor that generalizes to new environments with substantially different appearance from the training scenarios. Finally, we evaluate our method in the real world by controlling a real quadrotor flying through real hallways. We demonstrate that our method can perform real-world collision avoidance and hallway following after being trained exclusively on synthetic images, without ever having seen a single real image at the training time. For supplementary video see:



Bibtex

	@article{sadeghi2016cadrl,
	  title={{(CAD)$^2$RL}: Real Singel-Image Flight without a Singel Real Image},
	  author={Sadeghi, Fereshteh and Levine, Sergey},
	  journal={arXiv preprint arXiv:1611.04201},
	  year={2016}
	}
			

For questions/comments please contact fsadeghi ~AT~ cs.washington.edu