Nguyen-Ho, Thang-Long
ORCID: 0000-0003-1953-7679, Huynh, Viet-Tham
ORCID: 0000-0002-8537-1331, Tran, Ly-Duyen
ORCID: 0000-0002-9597-1832, Tran, Minh-Triet
ORCID: 0000-0003-3046-3041, Gurrin, Cathal
ORCID: 0000-0003-2903-3968 and Healy, Graham
ORCID: 0000-0001-6429-6339
(2026)
H-EAGLE: Hierarchical Extension of EAGLE for Multi-level Semantic Video Retrieval.
In: 32nd International Conference on Multimedia Modeling, MMM 2026, 29-31 Jan. 2026, Prague, Czech Republic.
ISBN 978-981-95-6962-5
Abstract
Modern Video Retrieval systems face challenges in computational efficiency and semantic depth when handling complex queries, particularly those with time-sensitive requirements. These systems typically rely on a “flat” index structure that encodes each frame independently, resulting in high search costs and difficulty capturing higher-level events or context semantics. To address these limitations, we propose a novel three-level hierarchical index concept that organizes video data at different semantic abstraction levels. The first level involves embedding vectors for individual frames to facilitate fine-grained retrieval. The second level groups visually similar frames into “shots” and encodes them into a semantic temporal representation. The top layer uses a Visual-Language Model (VLM) to identify and group frames related to narrative actions. This architecture allows the system to first quickly identify high-level related scenes or actions, and then refine the results by searching within individual frames within those groups. Our approach helps users to query data at the most relevant conceptual level.
Metadata
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Event Type: | Conference |
| Refereed: | Yes |
| Uncontrolled Keywords: | Video Retrieval; Temporal Retrieval; Retrieval System; Hierarchical Indexing |
| Subjects: | Engineering > Electronics Engineering > Engineering education Engineering > Electronic engineering |
| DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > ADAPT |
| Published in: | 32nd International Conference on Multimedia Modeling, MMM 2026, Prague, Czech Republic, January 29–31, 2026, Proceedings, Part IV}. 16415. Springer Nature Singapore. ISBN 978-981-95-6962-5 |
| Publisher: | Springer Nature Singapore |
| Official URL: | https://link.springer.com/chapter/10.1007/978-981-... |
| Copyright Information: | Authors |
| Funders: | 18/CRT/6223, 13/RC/2106_P2 |
| ID Code: | 32449 |
| Deposited On: | 23 Mar 2026 11:25 by Thang-Long Nguyen-Ho . Last Modified 23 Mar 2026 11:25 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0 1MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record