Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Generative adversarial networks in computer vision: a survey and taxonomy

Wang, Zhengwei orcid logoORCID: 0000-0001-7706-553X, She, Qi and Ward, Tomás E. orcid logoORCID: 0000-0002-6173-6607 (2021) Generative adversarial networks in computer vision: a survey and taxonomy. ACM Computing Surveys, 54 (2). ISSN 0360-0300

Abstract
Generative adversarial networks (GANs) have been extensively studied in the past few years. Arguably their most significant impact has been in the area of computer vision where great advances have been made in challenges such as plausible image generation, image-to-image translation, facial attribute manipulation, and similar domains. Despite the significant successes achieved to date, applying GANs to real-world problems still poses significant challenges, three of which we focus on here. These are as follows: (1) the generation of high quality images, (2) diversity of image generation, and (3) stabilizing training. Focusing on the degree to which popular GAN technologies have made progress against these challenges, we provide a detailed review of the state-of-the-art in GAN-related research in the published scientific literature. We further structure this review through a convenient taxonomy we have adopted based on variations in GAN architectures and loss functions. While several reviews for GANs have been presented to date, none have considered the status of this field based on their progress toward addressing practical challenges relevant to computer vision. Accordingly, we review and critically discuss the most popular architecture-variant, and loss-variant GANs, for tackling these challenges. Our objective is to provide an overview as well as a critical analysis of the status of GAN research in terms of relevant progress toward critical computer vision application requirements. As we do this we also discuss the most compelling applications in computer vision in which GANs have demonstrated considerable success along with some suggestions for future research directions. Codes related to the GAN-variants studied in this work is summarized on https://github.com/sheqi/GAN_Review.
Metadata
Item Type:Article (Published)
Refereed:Yes
Additional Information:Article number: 37
Uncontrolled Keywords:Reconstruction; Unsupervised learning; Neural networks; Generative adversarial networks; computer vision; architecture-variants; loss-variants; stabilizing training
Subjects:UNSPECIFIED
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > INSIGHT Centre for Data Analytics
Publisher:Association for Computing Machinery (ACM)
Official URL:https://dx.doi.org/10.1145/3439723
Copyright Information:© 2021 Association for Computing Machiner.
Funders:Insight Centre for Data Analytics which is supported by Science Foundation Ireland under Grant Number SFI/12/RC/2289_P2.
ID Code:27537
Deposited On:11 Aug 2022 16:20 by Thomas Murtagh . Last Modified 03 Feb 2023 16:26
Documents

Full text available as:

[thumbnail of 3439723.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial 3.0
5MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record