Colour technologies for content production and distribution of broadcast content

Blanch, Marc Gorriz

Blanch, Marc Gorriz (2023) Colour technologies for content production and distribution of broadcast content. PhD thesis, Dublin City University.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

The requirement of colour reproduction has long been a priority driving the development of new colour imaging systems that maximise human perceptual plausibility. This thesis explores machine learning algorithms for colour processing to assist both content production and distribution. First, this research studies colourisation technologies with practical use cases in restoration and processing of archived content. The research targets practical deployable solutions, developing a cost-effective pipeline which integrates the activity of the producer into the processing workflow. In particular, a fully automatic image colourisation paradigm using Conditional GANs is proposed to improve content generalisation and colourfulness of existing baselines. Moreover, a more conservative solution is considered by providing references to guide the system towards more accurate colour predictions. A fast-end-to-end architecture is proposed to improve existing exemplar-based image colourisation methods while decreasing the complexity and runtime. Finally, the proposed image-based methods are integrated into a video colourisation pipeline. A general framework is proposed to reduce the generation of temporal flickering or propagation of errors when such methods are applied frame-to-frame. The proposed model is jointly trained to stabilise the input video and to cluster their frames with the aim of learning scene-specific modes. Second, this research explored colour processing technologies for content distribution with the aim to effectively deliver the processed content to the broad audience. In particular, video compression is tackled by introducing a novel methodology for chroma intra prediction based on attention models. Although the proposed architecture helped to gain control over the reference samples and better understand the prediction process, the complexity of the underlying neural network significantly increased the encoding and decoding time. Therefore, aiming at efficient deployment within the latest video coding standards, this work also focused on the simplification of the proposed architecture to obtain a more compact and explainable model.

Metadata

Item Type:	Thesis (PhD)
Date of Award:	March 2023
Refereed:	No
Supervisor(s):	O'Connor, Noel E. and Mrak, Marta
Subjects:	Computer Science > Image processing Computer Science > Multimedia systems Computer Science > Information storage and retrieval systems Computer Science > Digital video Computer Science > Video compression
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Institutes and Centres > INSIGHT Centre for Data Analytics Research Institutes and Centres > FUJO. Institute for Future Media, Democracy and Society
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License
Funders:	European Union’s Horizon 2020 Marie Sklodowska Curie JOLT project
ID Code:	27982
Deposited On:	31 Mar 2023 09:27 by Noel Edward O'connor . Last Modified 08 Dec 2023 15:38

Documents

Full text available as:

[thumbnail of PhD_Final_Thesis_Jan_2023.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0
25MB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Colour technologies for content production and distribution of broadcast content

Downloads