Abstract
This article presents an online local path planning approach for autonomous drone navigating a 2D plane in an unknown, indoor corridor-like environment. The proposed method utilizes a reinforcement learning approach for training a local path planner for navigation in the said environment. With a continuous actor-critic learning automaton (CACLA) applied for continuous action spaces, the proposed algorithm uses a reward structure that formulates a balancing function that gives reward based on balancing the vehicle between artificial potential hills. The drone thereby learns steering control and obstacle avoidance while maintaining a central aligned position with respect to the unknown hallways or corridors. A novel CACLA algorithm and incorporation of a special experience replay memory for the better converging tendency of drone toward the balancing point have been introduced in this article. The proposed reinforcement learning-based online local path planner has been tested on a simulated drone in Gazebo environment.
Original language | English |
---|---|
Pages (from-to) | 32-41 |
Number of pages | 10 |
Journal | IEEE Intelligent Systems |
Volume | 37 |
Issue number | 5 |
DOIs | |
Publication status | Published - 2022 |
Externally published | Yes |
Keywords
- Artificial Potential Field
- CACLA
- Online local path planning
- Reinforcement Learning