Nguyen, Manh Duy (2024) A search engine using graph-based structures for lifelog retrieval. PhD thesis, Dublin City University.
Abstract
Lifelog, a personal digital record of daily activities, is becoming popular and can operate as a form of a digital diary. Although having a wide range of applications, the basic underlying challenge of lifelogs is how to build a lifelog retrieval system that can retrieve a specific activity efficiently from large multimodal lifelog data. Recently, there have been many workshops organized to encourage researchers to solve that problem leading to the introduction of many lifelog retrieval systems with different approaches. In addition to the user interface, the backend search engine plays a critical role in the lifelog retrieval system. The fundamental approach for the lifelog search engine is the concept-based model which matches the objects in lifelog images with keywords in a semantic query. However, many state-of-the-art embedding multimodal retrieval models, such as CLIP, have been introduced recently where they encode both images and texts into the same vector space to perform the retrieval. This has opened the opportunity to apply this new generation search engine to the lifelog retrieval task. In this thesis, I propose applying embedding- based models to the lifelog retrieval system. Furthermore, I enhance these models by applying a graph-based structure to them. I also build an interactive lifelog retrieval system that follows the concept-based approach to serve as the baseline in my experiment. Through my interactive user studies, I observe that the graph- based enhanced retrieval model surpasses the conventional concept-based by a large margin in various evaluation metrics. In summary, this thesis primarily contributes to the field of creating an accurate lifelog search engine using graph neural network structures that can be employed to build an effective and efficient lifelog retrieval system to address lifelog retrieval tasks.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | August 2024 |
Refereed: | No |
Supervisor(s): | Gurrin, Cathal |
Subjects: | Computer Science > Artificial intelligence Computer Science > Information retrieval Computer Science > Lifelog |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License |
ID Code: | 30148 |
Deposited On: | 18 Nov 2024 12:01 by Cathal Gurrin . Last Modified 18 Nov 2024 12:01 |
Documents
Full text available as:
Preview |
PDF (Dissertation)
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0 81MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record