Andrearczyk, Vincent (2017) Deep learning for texture and dynamic texture analysis. PhD thesis, Dublin City University.
Abstract
Texture is a fundamental visual cue in computer vision which provides useful information about image regions. Dynamic Texture (DT) extends the analysis of texture to sequences of moving scenes. Classic approaches to texture and DT analysis are based on shallow hand-crafted descriptors including local binary patterns and filter banks. Deep learning and in particular Convolutional Neural Networks (CNNs) have significantly contributed to the field of computer vision in the last decade. These biologically inspired networks trained with powerful algorithms have largely improved the state of the art in various tasks such as digit, object and face recognition. This thesis explores the use of CNNs in texture and DT analysis, replacing classic hand-crafted filters by deep trainable filters. An introduction to deep learning is provided in the thesis as well as a thorough review of texture and DT analysis methods. While CNNs present interesting features for the analysis of textures such as a dense extraction of filter responses trained end to end, the deepest layers used in the decision rules commonly learn to detect large shapes and image layout instead of local texture patterns. A CNN architecture is therefore adapted to textures by using an orderless pooling of intermediate layers to discard the overall shape analysis, resulting in a reduced computational cost and improved accuracy. An application to biomedical texture images is proposed in which large tissue images are tiled and combined in a recognition scheme. An approach is also proposed for DT recognition using the developed CNNs on three orthogonal planes to combine spatial and temporal analysis. Finally, a fully convolutional network is adapted to texture segmentation based on the same idea of discarding the overall shape and by combining local shallow features with larger and deeper features.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | November 2017 |
Refereed: | No |
Supervisor(s): | Whelan, Paul F. |
Uncontrolled Keywords: | Deep Learning; computer vision; biomedical image analysis |
Subjects: | Engineering > Imaging systems Computer Science > Artificial intelligence Computer Science > Image processing Engineering > Biomedical engineering |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License |
Funders: | Vision Systems Group |
ID Code: | 22040 |
Deposited On: | 10 Nov 2017 15:09 by Paul Whelan . Last Modified 04 Dec 2019 13:37 |
Documents
Full text available as:
Preview |
PDF (PhD Thesis)
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
11MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record