Investigating multi-modal features in the design of a multi-media hyperlinking framework

Chen, Shu (2016) Investigating multi-modal features in the design of a multi-media hyperlinking framework. PhD thesis, Dublin City University.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

Search, as a well-known information retrieval strategy, is widely researched and developed for academic and commercial usage. However, in the context of increasing amounts of multimedia data, search alone cannot satisfy user requirements for exploring multimedia resources. Therefore, preprocessing of multimedia resources is necessary to define potentially related documents to reduce retrieval time and improve the browsing efficiency. Using hyperlinks to connect relevant resources is widely used for multimedia collection. However, the definition of hyperlinks is usually based on textual information. For example, hyperlinks in Wikipedia link a term to relevant webpages. By contrast, content based multimedia retrieval provides the possibility of analysing multimedia materials on the actual content. The availability of these technologies for multimedia search suggests further investigation of content-based hyperlinking for multimedia collections. This thesis is dedicated to a novel topic of automatically creating hyperlinks within TV data collections for content-based browsing and navigation. Hyperlinks are created between video segments determined to be related based on their multimodal features. First, we detail the methodologies to create potentially relevant segments across the TV collection in terms of automatically detected spoken information. We present which of these approaches are more efficient to segment video streams. Next, we involve both low-level and high-level visual features to improve the hyperlinking quality. We detail the implementation of data fusion schemes to combine multimodal features. Finally, a novel hyperlinking framework associated with query enrichment, spoken data analysis, and multimodal fusion is proposed. The experiments show the effectiveness of this framework at satisfying user experience which is concluded in crowdsourcing study.

Metadata

Item Type:	Thesis (PhD)
Date of Award:	November 2016
Refereed:	No
Supervisor(s):	O'Connor, Noel E. and Jones, Gareth J.F.
Uncontrolled Keywords:	video hyperlinking
Subjects:	Computer Science > Machine learning Engineering > Electronic engineering Computer Science > Multimedia systems Computer Science > Information retrieval Computer Science > Digital video Computer Science > Image processing
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Initiatives and Centres > CLARITY: The Centre for Sensor Web Technologies Research Initiatives and Centres > ADAPT
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:	European Framework Programme 7, AXES (ICT-269980), Science Foundation Ireland
ID Code:	21321
Deposited On:	21 Nov 2016 11:57 by Gareth Jones . Last Modified 25 Oct 2018 09:20

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
6MB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

Altmetric