Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Domain generalisation with bidirectional encoder representations from vision transformers

Riaz, Hamza and Smeaton, Alan F. orcid logoORCID: 0000-0003-1028-8389 (2023) Domain generalisation with bidirectional encoder representations from vision transformers. In: 25th Irish Machine Vision and Image Processing Conference, 30 Aug - 1 Sept 2023, Galway, Ireland.

Abstract
Domain generalisation involves pooling knowledge from source domain(s) into a single model that can generalise to unseen target domain(s). Recent research in domain generalisation has faced challenges when using deep learning models as they interact with data distributions which differ from those they are trained on. % Here we perform domain generalisation on out-of-distribution (OOD) vision benchmarks using vision transformers. Initially we examine four vision transformer architectures namely ViT, LeViT, DeiT, and BEIT on out-of-distribution data. As the bidirectional encoder representation from image transformers (BEIT) architecture performs best, we use it in further experiments on three benchmarks PACS, Home-Office and DomainNet. Our results show significant improvements in validation and test accuracy and our implementation significantly overcomes gaps between within-distribution and OOD data.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Domain generalisation, vision transformers, benchmarking
Subjects:Computer Science > Artificial intelligence
Computer Science > Machine learning
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > INSIGHT Centre for Data Analytics
Copyright Information:© 2023 The Authors.
Funders:Science Foundation Ireland
ID Code:28772
Deposited On:18 Aug 2023 16:51 by Alan Smeaton . Last Modified 26 Jan 2024 15:34
Documents

Full text available as:

[thumbnail of zsbyfjddvyfqyvyzgyhmqxjpwtfrxcsh.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial 4.0
83kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record