Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Sample and time efficient strategies for off-policy reinforcement learning in robotic manipulation tasks

Sanchez, Francisco Roldan (2025) Sample and time efficient strategies for off-policy reinforcement learning in robotic manipulation tasks. PhD thesis, Dublin City University.

In recent years, reinforcement learning algorithms have become very popular in solving robotic manipulation tasks. However, these algorithms usually suffer from sample inefficiency, meaning that they require extensive exploration to find an appropriate control policy capable of solving the robotic manipulation tasks targeted, and, consequently, this sample inefficiency leads to training time inefficiency. The work presented in this thesis aims to solve this problem through three main methods: (i) reward engineering; (ii) a sequential execution of primitive skills; and (iii) an automatic execution of primitive policies. In particular, a novel reward that encourages lifting behaviour is presented and evaluated in the context of the Real Robot Challenge competition problem. A novel learning framework for in-hand object 3D rotations based on sequentially executed primitives is proposed and evaluated in the Shadow Dexterous Hand simulated environment of the OpenAI Gym. Finally, a new algorithm is proposed that is capable of automatically deciding, during exploration, whether or not to use a primitive behaviour, with this approach evaluated in different tasks for different manipulators available in the OpenAI Gym.
Item Type:Thesis (PhD)
Date of Award:7 January 2025
Refereed:No
Supervisor(s):O'Connor, Noel E., McGuinness, Kevin and Redmond, Stephen
Subjects:Computer Science > Machine learning
Engineering > Robotics
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering
Research Institutes and Centres > INSIGHT Centre for Data Analytics
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License
Funders:Science Foundation Ireland
ID Code:30641
Deposited On:10 Mar 2025 15:10 by Noel Edward O'connor . Last Modified 10 Mar 2025 15:10

Full text available as:

[thumbnail of PhD_Final_Thesis-1.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0
8MB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record