Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016

Marsden, Mark, Mohedano, Eva, McGuinness, Kevin orcid logoORCID: 0000-0003-1336-6477, Calafell, Andrea, Giró-i-Nieto, Xavier orcid logoORCID: 0000-0002-9935-5332, O'Connor, Noel E. orcid logoORCID: 0000-0002-4033-9135, Zhou, Jiang orcid logoORCID: 0000-0002-3067-8512, Azevedo, Lucas, Daudert, Tobias, Davis, Brian orcid logoORCID: 0000-0002-5759-2655, Hurlimann, Manuela, Afli, Haithem orcid logoORCID: 0000-0002-7449-4707, Du, Jinhua, Ganguly, Debasis orcid logoORCID: 0000-0003-0050-7138, Li, Wei B. orcid logoORCID: 0000-0001-7347-3501, Way, Andy orcid logoORCID: 0000-0001-5736-5930 and Smeaton, Alan F. orcid logoORCID: 0000-0003-1028-8389 (2016) Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016. In: TRECVid Conference, 14-16 Nov 2016, Gaithersburg, Md., USA.

Abstract
Dublin City University participated with a consortium of colleagues from NUI Galway and Universitat Politecnica de Catalunya in two tasks in TRECVid 2016, Instance Search (INS) and Video to Text (VTT). For the INS task we developed a framework consisting of face detection and representation and place detection and representation, with a user annotation of top-ranked videos. For the VTT task we ran 1,000 concept detectors from the VGG-16 deep CNN on 10 keyframes per video and submitted 4 runs for caption re-ranking, based on BM25, Fusion, word2vec and a fusion of baseline BM25 and word2vec. With the same pre-processing for caption generation we used an open source image-to-caption CNN-RNN toolkit NeuralTalk2 to generate a caption for each keyframe and combine them.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:No
Uncontrolled Keywords:Semantic Concept; Video Captions
Subjects:Engineering > Imaging systems
Computer Science > Computational linguistics
Computer Science > Machine learning
Computer Science > Artificial intelligence
Computer Science > Multimedia systems
Computer Science > Image processing
Computer Science > Digital video
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering
Research Institutes and Centres > INSIGHT Centre for Data Analytics
Research Institutes and Centres > ADAPT
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:Science Foundation Ireland SFI/12/RC/2289 (Insight Centre), Science Foundation Ireland SFI/13/RC/2106 (ADAPT Centre)
ID Code:21484
Deposited On:01 Dec 2016 15:24 by Alan Smeaton . Last Modified 06 Jul 2023 15:11
Documents

Full text available as:

[thumbnail of TRECVid2016(7).pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record