Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016

Marsden, Mark, Mohedano, Eva, McGuinness, Kevin ORCID: 0000-0003-1336-6477, Calafell, Andrea, Giró-i-Nieto, Xavier ORCID: 0000-0002-9935-5332, O'Connor, Noel E. ORCID: 0000-0002-4033-9135, Zhou, Jiang ORCID: 0000-0002-3067-8512, Azevedo, Lucas, Daudert, Tobias, Davis, Brian ORCID: 0000-0002-5759-2655, Hurlimann, Manuela, Afli, Haithem ORCID: 0000-0002-7449-4707, Du, Jinhua, Ganguly, Debasis ORCID: 0000-0003-0050-7138, Li, Wei B. ORCID: 0000-0001-7347-3501, Way, Andy ORCID: 0000-0001-5736-5930 and Smeaton, Alan F. ORCID: 0000-0003-1028-8389 (2016) Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016. In: TRECVid Conference, 14-16 Nov 2016, Gaithersburg, Md., USA.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

Dublin City University participated with a consortium of colleagues from NUI Galway and Universitat Politecnica de Catalunya in two tasks in TRECVid 2016, Instance Search (INS) and Video to Text (VTT). For the INS task we developed a framework consisting of face detection and representation and place detection and representation, with a user annotation of top-ranked videos. For the VTT task we ran 1,000 concept detectors from the VGG-16 deep CNN on 10 keyframes per video and submitted 4 runs for caption re-ranking, based on BM25, Fusion, word2vec and a fusion of baseline BM25 and word2vec. With the same pre-processing for caption generation we used an open source image-to-caption CNN-RNN toolkit NeuralTalk2 to generate a caption for each keyframe and combine them.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	No
Uncontrolled Keywords:	Semantic Concept; Video Captions
Subjects:	Engineering > Imaging systems Computer Science > Computational linguistics Computer Science > Machine learning Computer Science > Artificial intelligence Computer Science > Multimedia systems Computer Science > Image processing Computer Science > Digital video
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Institutes and Centres > INSIGHT Centre for Data Analytics Research Institutes and Centres > ADAPT
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:	Science Foundation Ireland SFI/12/RC/2289 (Insight Centre), Science Foundation Ireland SFI/13/RC/2106 (ADAPT Centre)
ID Code:	21484
Deposited On:	01 Dec 2016 15:24 by Alan Smeaton . Last Modified 06 Jul 2023 15:11

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016

Downloads