Sanchez, Francisco Roldan (2025) Sample and time efficient strategies for off-policy reinforcement learning in robotic manipulation tasks. PhD thesis, Dublin City University.
In recent years, reinforcement learning algorithms have become very popular in solving robotic manipulation tasks. However, these algorithms usually suffer from sample inefficiency, meaning that they require extensive exploration to find an appropriate control policy capable of solving the robotic manipulation tasks targeted, and, consequently, this sample inefficiency leads to training time inefficiency. The work presented in this thesis aims to solve this problem through three main methods: (i) reward engineering; (ii) a sequential execution of primitive skills; and (iii) an automatic execution of primitive policies. In particular, a novel reward that encourages lifting behaviour is presented and evaluated in the context of the Real Robot Challenge competition problem. A novel learning framework for in-hand object 3D rotations based on sequentially executed primitives is proposed and evaluated in the Shadow Dexterous Hand simulated environment of the OpenAI Gym. Finally, a new algorithm is proposed that is capable of automatically deciding, during exploration, whether or not to use a primitive behaviour, with this approach evaluated
in different tasks for different manipulators available in the OpenAI Gym.
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | 7 January 2025 |
Refereed: | No |
Supervisor(s): | O'Connor, Noel E., McGuinness, Kevin and Redmond, Stephen |
Subjects: | Computer Science > Machine learning Engineering > Robotics |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Institutes and Centres > INSIGHT Centre for Data Analytics |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License |
Funders: | Science Foundation Ireland |
ID Code: | 30641 |
Deposited On: | 10 Mar 2025 15:10 by Noel Edward O'connor . Last Modified 10 Mar 2025 15:10 |
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0 8MB |
Downloads
Downloads per month over past year
Archive Staff Only: edit this record