Analyzing image-text relations for semantic media adaptation and personalization
Hughes, Mark, Salway, Andrew, Jones, Gareth J.F.ORCID: 0000-0003-2923-8365 and O'Connor, Noel E.ORCID: 0000-0002-4033-9135
(2007)
Analyzing image-text relations for semantic media adaptation and personalization.
In: SMAP 2007 - Second International Workshop on Semantic Media Adaptation and Personalization, 17-18 December 2007, London, UK.
Progress in semantic media adaptation and personalisation requires that we know more about how different media types, such as texts and images, work together in multimedia communication. To this end, we present our ongoing investigation into image-text relations. Our idea is that the ways in which the meanings of images and texts relate in multimodal documents, such as web pages, can be classified on the basis of low-level media features and that this classification should be an early processing step in systems targeting semantic multimedia analysis. In this paper we present the first empirical evidence that humans can predict something about the main theme of a text from an accompanying image, and that this prediction can be emulated by a machine via analysis of low- level image features. We close by discussing how these findings could impact on applications for news adaptation and personalisation, and how they may generalise to other kinds of multimodal documents and to applications for semantic media retrieval, browsing, adaptation and creation.