Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Unified Multimedia Segmentation - A Comprehensive Model for URI-based Media Segment Representation

Willi, Jan orcid logoORCID: 0009-0000-6584-3744, Bernstein, Abraham orcid logoORCID: 0000-0002-0128-4602 and Rossetto, Luca orcid logoORCID: 0000-0002-5389-9465 (2024) Unified Multimedia Segmentation - A Comprehensive Model for URI-based Media Segment Representation. Transactions on Graph Data and Knowledge, 2 (3). 1:1-1:34. ISSN 2942-7517

Abstract
In multimedia annotation, referencing specific segments of a document is often desired due to its richness and multimodality, but no universal representation for such references exists. This significantly hampers the usage of multimedia content in knowledge graphs, as it is modeled as one large atomic information container. Unstructured data – such as text, audio, images, and video – can commonly be decomposed into its constituent parts, as such documents rarely contain only one semantic concept. Hence, it is reasonable to assume that these advances will make it possible to decompose these previous atomic components into logical segments. To be processable by the knowledge graph stack, however, one needs to break the atomic nature of multimedia content, providing a mechanism to address media segments. This paper proposes a Unified Segmentation Model capable of depicting arbitrary segmentations on any media document type. The work begins with a formal analysis of multimedia and segmentation, exploring segmentation operations and how to describe them. Building on this analysis, it then develops a practical scheme for expressing segmentation in Uniform Resource Identifiers (URIs). Given that this approach makes segments of multimedia content referencable, it breaks their atomic nature and makes them first-class citizens within knowledge graphs. The proposed model is implemented as a proof of concept in the MediaGraph Store, a multimedia knowledge graph storage and querying engine.
Metadata
Item Type:Article (Published)
Refereed:Yes
Uncontrolled Keywords:Multimodal Knowledge Graphs, Multimedia Segmentation, Multimedia Representation
Subjects:Computer Science > Information technology
Computer Science > Multimedia systems
Computer Science > World Wide Web
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Publisher:Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik GmbH
Official URL:https://doi.org/10.4230/TGDK.2.3.1
Copyright Information:Authors
ID Code:30707
Deposited On:28 Jan 2025 14:58 by Luca Rossetto . Last Modified 28 Jan 2025 14:58
Documents

Full text available as:

[thumbnail of TGDK.2.3.1.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
3MB
Metrics

Altmetric Badge

Dimensions Badge

Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record