Multimodal segmentation of lifelog data

Doherty, Aiden R. ORCID: 0000-0003-4395-7702, Smeaton, Alan F. ORCID: 0000-0003-1028-8389, Lee, Keansub and Ellis, Daniel P.W. (2007) Multimodal segmentation of lifelog data. In: RIAO 2007 - Large-Scale Semantic Access to Content (Text, Image, Video and Sound), 30 May - 1 June 2007, Pittsburgh, PA, USA.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

A personal lifelog of visual and audio information can be very helpful as a human memory augmentation tool. The SenseCam, a passive wearable camera, used in conjunction with an iRiver MP3 audio recorder, will capture over 20,000 images and 100 hours of audio per week. If used constantly, very soon this would build up to a substantial collection of personal data. To gain real value from this collection it is important to automatically segment the data into meaningful units or activities. This paper investigates the optimal combination of data sources to segment personal data into such activities. 5 data sources were logged and processed to segment a collection of personal data, namely: image processing on captured SenseCam images; audio processing on captured iRiver audio data; and processing of the temperature, white light level, and accelerometer sensors onboard the SenseCam device. The results indicate that a combination of the image, light and accelerometer sensor data segments our collection of personal data better than a combination of all 5 data sources. The accelerometer sensor is good for detecting when the user moves to a new location, while the image and light sensors are good for detecting changes in wearer activity within the same location, as well as detecting when the wearer socially interacts with others.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Subjects:	Computer Science > Lifelog Computer Science > Information storage and retrieval systems
DCU Faculties and Centres:	Research Institutes and Centres > Centre for Digital Video Processing (CDVP) Research Institutes and Centres > Adaptive Information Cluster (AIC)
Publisher:	CID Paris
Official URL:	http://riao.free.fr/index.htm
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:	Microsoft Research, Irish Research Council for Science Engineering and Technology, Science Foundation Ireland, SFI 03/IN.3/I361, European Commission FP6-027026
ID Code:	359
Deposited On:	19 Mar 2008 by DORAS Administrator . Last Modified 04 Oct 2018 11:31

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
757kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Multimodal segmentation of lifelog data

Downloads