Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Investigating multi-modal features in the design of a multi-media hyperlinking framework

Chen, Shu (2016) Investigating multi-modal features in the design of a multi-media hyperlinking framework. PhD thesis, Dublin City University.

Search, as a well-known information retrieval strategy, is widely researched and developed for academic and commercial usage. However, in the context of increasing amounts of multimedia data, search alone cannot satisfy user requirements for exploring multimedia resources. Therefore, preprocessing of multimedia resources is necessary to define potentially related documents to reduce retrieval time and improve the browsing efficiency. Using hyperlinks to connect relevant resources is widely used for multimedia collection. However, the definition of hyperlinks is usually based on textual information. For example, hyperlinks in Wikipedia link a term to relevant webpages. By contrast, content based multimedia retrieval provides the possibility of analysing multimedia materials on the actual content. The availability of these technologies for multimedia search suggests further investigation of content-based hyperlinking for multimedia collections. This thesis is dedicated to a novel topic of automatically creating hyperlinks within TV data collections for content-based browsing and navigation. Hyperlinks are created between video segments determined to be related based on their multimodal features. First, we detail the methodologies to create potentially relevant segments across the TV collection in terms of automatically detected spoken information. We present which of these approaches are more efficient to segment video streams. Next, we involve both low-level and high-level visual features to improve the hyperlinking quality. We detail the implementation of data fusion schemes to combine multimodal features. Finally, a novel hyperlinking framework associated with query enrichment, spoken data analysis, and multimodal fusion is proposed. The experiments show the effectiveness of this framework at satisfying user experience which is concluded in crowdsourcing study.
Item Type:Thesis (PhD)
Date of Award:November 2016
Supervisor(s):O'Connor, Noel E. and Jones, Gareth J.F.
Uncontrolled Keywords:video hyperlinking
Subjects:Computer Science > Machine learning
Engineering > Electronic engineering
Computer Science > Multimedia systems
Computer Science > Information retrieval
Computer Science > Digital video
Computer Science > Image processing
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering
Research Institutes and Centres > CLARITY: The Centre for Sensor Web Technologies
Research Institutes and Centres > ADAPT
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:European Framework Programme 7, AXES (ICT-269980), Science Foundation Ireland
ID Code:21321
Deposited On:21 Nov 2016 11:57 by Gareth Jones . Last Modified 25 Oct 2018 09:20

Full text available as:

[thumbnail of Thesis_SHU_CHEN.pdf]
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Downloads per month over past year

Archive Staff Only: edit this record