Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Saliency modelling for 2D and 360° visual data: deep learning models and performance evaluation

Djilali, Yasser Abdelaziz Dahou (2024) Saliency modelling for 2D and 360° visual data: deep learning models and performance evaluation. PhD thesis, Dublin City University.

Abstract
This thesis explores the field of visual attention modeling, focusing on saliency prediction for both 2D and 360° visual data. The research is grounded in integrating deep learning techniques with human fixation data, aiming to refine current methods and address existing approaches’ limitations. Key contributions include the analysis of self-attention maps in Vision Transformers for improved saliency prediction and developing a transformer-based model for image saliency. Then, we target the field of 360° data and introduce ATSal, which integrates attention mechanisms with expert instances for omnidirectional scene analysis. Additionally, the thesis explores the application of contrastive learning to 360° scenes, presenting an unsupervised framework that effectively learns general representations. The final chapter introduces a new probabilistic metric suited for 360° saliency. This comprehensive study advances the understanding and application of deep learning in visual attention modeling.
Metadata
Item Type:Thesis (PhD)
Date of Award:August 2024
Refereed:No
Supervisor(s):O'Connor, Noel
Subjects:Computer Science > Artificial intelligence
Computer Science > Image processing
Computer Science > Machine learning
Computer Science > Digital video
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License
Funders:Science Foundation Ireland
ID Code:30202
Deposited On:18 Nov 2024 14:36 by Noel Edward O'connor . Last Modified 18 Nov 2024 14:36
Documents

Full text available as:

[thumbnail of Thesis_Yasser_corrected_v4.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0
77MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record