PathGAN: visual scanpath prediction with generative adversarial networks

Assens, Marc; Giró-i-Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel E.

Assens, Marc, Giró-i-Nieto, Xavier, McGuinness, Kevin ORCID: 0000-0003-1336-6477 and O'Connor, Noel E. ORCID: 0000-0002-4033-9135 (2019) PathGAN: visual scanpath prediction with generative adversarial networks. In: ECCV Workshop on Egocentric Perception, Interaction and Computing (EPIC), 9 Sept 2018, Munich, Germany. ISBN 978-3-030-11020-8

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its gaze. PathGAN is composed of two parts, the generator and the discriminator. Both parts extract features from images using off-the-shelf networks, and train recurrent layers to generate or discriminate scanpaths accordingly. In scanpath prediction, the stochastic nature of the data makes it very difficult to generate realistic predictions using supervised learning strategies, but we adopt adversarial training as a suitable alternative. Our experiments prove how PathGAN improves the state of the art of visual scanpath prediction on the iSUN and Salient360! datasets.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Workshop
Refereed:	Yes
Subjects:	Computer Science > Artificial intelligence Computer Science > Image processing Computer Science > Machine learning
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Institutes and Centres > INSIGHT Centre for Data Analytics
Published in:	Leal-Taixé, Laura and Roth, Stefan, (eds.) Computer Vision – ECCV 2018 Workshops Proceedings Part 5. Lecture Notes in Computer Science 11133. Springer. ISBN 978-3-030-11020-8
Publisher:	Springer
Official URL:	http://dx.doi.org/10.1007/978-3-030-11021-5_25
Copyright Information:	© 2019 Springer
Funders:	Science Foundation Ireland 15/SIRG/3283, Insight
ID Code:	22730
Deposited On:	18 Oct 2018 09:35 by Kevin Mcguinness . Last Modified 11 Feb 2019 15:45

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3MB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

PathGAN: visual scanpath prediction with generative adversarial networks

Downloads