Voxento 4.0: A more flexible visualisation and control for
lifelogs
Alateeq, AhmedORCID: 0000-0001-7916-6393, Mark, RoantreeORCID: 0000-0002-1329-2570 and Gurrin, CathalORCID: 0000-0003-2903-3968
(2023)
Voxento 4.0: A more flexible visualisation and control for
lifelogs.
In: ICMR '23: International Conference on Multimedia Retrieval, 12 - 15 Jun 2023, Thessaloniki Greece.
ISBN 979-8-4007-0188-7
In this paper, we introduce Voxento 4.0 – an interactive voice-based
retrieval system for lifelogs which has been developed to participate
in the sixth Lifelog Search Challenge LSC’23, at ACM ICMR’23. Voxento has participated three times in the LSC editions and achieved
the rank of 4th in LSC21 and 5th in LSC22 respectively. In this
version, Voxento 4.0, we have focused on improving the previous
system’s interface, voice interaction and retrieval functionality. The
current version has implemented some processing and cleaning of
the dataset and employs the CLIP model to extract image features.
In addition, the system’s interface was redesigned for better visualisation of the elements and the images for effective interaction. This improvement in the interface will help to support voice interaction
in future work. The interface developments include logging voice
interaction and images displayed, submitted, selected and starred
to enhance user experience with the system. The voice interaction
part has also been enhanced in the workflow of the voice lifecycle
interaction and with additional voice commands.
Gurrin, Cathal and Jónsson, Björn Þór, (eds.)
Proceedings of the 6th Annual Workshop on Lifelog Search Challenge (LSC'21).
.
Association for Computing Machinery (ACM). ISBN 979-8-4007-0188-7