Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

An exploration into the benefits of the CLIP model for Lifelog Retrieval

Tran, Ly-Duyen orcid logoORCID: 0000-0002-9597-1832, Alam, Naushad orcid logoORCID: 0000-0002-3144-5622, Vo, Linh Khanh, Diep, Nghiem Tuong, Nguyen, Binh orcid logoORCID: 0000-0001-5249-9702, Graham, Yvette orcid logoORCID: 0000-0001-6741-4855, Zhou, Liting orcid logoORCID: 0000-0002-7778-8743 and Gurrin, Cathal orcid logoORCID: 0000-0003-2903-3968 (2022) An exploration into the benefits of the CLIP model for Lifelog Retrieval. In: International Conference on Content-Based Multimedia Indexing, 14–16 Sept 2022, Graz, Austria. ISBN 978-1-4503-9720-9

Abstract
In this paper, we attempt to fine-tune the CLIP (Contrastive Language-Image Pre-Training) model on the Lifelog Question Answering dataset (LLQA) to investigate retrieval performance of the fine-tuned model over the zero-shot baseline model. We train the model adopting a weight space ensembling approach using a modified loss function to take into account the differences in our dataset (LLQA) when compared with the dataset the CLIP model was originally pretrained on. We further evaluate our fine-tuned model using visual as well as multimodal queries on multiple retrieval tasks, demonstrating improved performance over the zero-shot baseline model.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:lifelogging; image retrieval; pretrained models
Subjects:Computer Science > Algorithms
Computer Science > Information retrieval
Computer Science > Lifelog
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: International Conference on Content-Based Multimedia Indexing (CBMI 2022). . Association for Computing Machinery. ISBN 978-1-4503-9720-9
Publisher:Association for Computing Machinery
Official URL:https://doi.org/10.1145/3549555.3549593
Copyright Information:© 2022 The Authors
Funders:Science Foundation Ireland grant numbers SFI/12/RC/2289, SFI/12/RC/2289-P2, SFI/13/RC/2106, 18/CRT/6223 and 18/CRT/6224
ID Code:27842
Deposited On:10 Oct 2022 09:09 by Ly Duyen Tran . Last Modified 04 Mar 2024 12:37
Documents

Full text available as:

[thumbnail of 3549555.3549593.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
2MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record