Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

An action recognition framework for uncontrolled video capture based on a spatio-temporal video graph

Jargalsaikhan, Iveel (2017) An action recognition framework for uncontrolled video capture based on a spatio-temporal video graph. PhD thesis, Dublin City University.

Abstract
The task of automatic categorization and localization of human action in video sequences is valuable for a variety of applications such as detecting relevant activities in surveillance video, summarizing and indexing video sequences or organizing a digital video library according to the relevant actions. However it remains a challenging problem for computers to robustly recognize action due to cluttered backgrounds, camera motion, occlusion, view point changes and the geometric and photometric variances of objects. An important question in action recognition is how to efficiently and effectively represent a video scene while maintaining the discriminative appearance, motion and contextual cues of the scene. Recently, local feature-based action recognition methods have gained popularity due to their simplicity and the-state-of-the-performance with various benchmarking datasets. However, the existing feature representation schemes e.g, Bag-of-Features, Fisher and VLAD, ignore the the spatial and temporal cues in the local features e.g, the spatio-temporal location and relationship. Inspired by this fact, this thesis aims to overcome the underlying limitation of the feature representation by proposing a new way to construct graph structure that aims to capture the spatial and temporal relationship between the local features while maintaining discriminative power. The key contributions can be summarized as follows (i) comprehensive evaluation of the several key elements in the recognition pipeline (ii) novel video graph based human action recognition framework (iii) evaluation of the different techniques involved in the video graph construction process and (iv) extension of the proposed video graph based video analysis to the challenging problem of action localization.
Metadata
Item Type:Thesis (PhD)
Date of Award:November 2017
Refereed:No
Supervisor(s):O'Connor, Noel E. and Little, Suzanne
Uncontrolled Keywords:computer vision; action recognition
Subjects:Computer Science > Artificial intelligence
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering
Research Institutes and Centres > INSIGHT Centre for Data Analytics
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:Science Foundation Ireland
ID Code:21816
Deposited On:13 Nov 2017 11:03 by Suzanne Little . Last Modified 08 Nov 2019 13:36
Documents

Full text available as:

[thumbnail of Iveel_Jargalsaikhan_DCU_Thesis.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
17MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record