Semi-supervised learning with generative adversarial networks for pathological speech classification

Trinh, Nam; O'Brien, Darragh

Trinh, Nam ORCID: 0000-0003-0307-3793 and O'Brien, Darragh (2020) Semi-supervised learning with generative adversarial networks for pathological speech classification. In: 31st Irish Signals and Systems Conference (ISSC2020), 11 - 12 June 2020, Letterkenny, Ireland (Virtual). ISBN 978-1-7281-9418-9

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

One application of deep learning in medical applications is the use of deep neural networks to classify human speech as healthy or pathological. In such applications, the audio signal is transformed into a spectrogram that captures its time-varying content and the latter “images” are fed into a classifier for classification. A challenge in applying this approach is the shortage of suitable speech data for training purposes. Labelled data acquisition requires significant human effort and/or time-consuming experiments. In this paper, we propose a semi-supervised learning approach that employs a Generative Adversarial Network (GAN) to alleviate the problem of insufficient training data. We compare the classification performance of a traditional classifier and our semi-supervised classifier. We observe that the GAN-based semi-supervised approach demonstrates a significant improvement in terms of accuracy and ROC curve when supplied an equivalent number of training samples.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Uncontrolled Keywords:	Generative Adversarial Network; pathological speech classification; semi-supervised learning
Subjects:	Computer Science > Algorithms Computer Science > Artificial intelligence Computer Science > Machine learning
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > ADAPT
Published in:	2020 31st Irish Signals and Systems Conference (ISSC). . IEEE. ISBN 978-1-7281-9418-9
Publisher:	IEEE
Official URL:	http://dx.doi.org/10.1109/ISSC49989.2020.9180211
Copyright Information:	© 2020 The Authors.
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:	Science Foundation Ireland Research Centres Programme (Grant 13/RC/2106), Science Foundation Ireland under grant No. 17/RC/PHD/3488., European Regional Development Fund
ID Code:	24499
Deposited On:	12 Jun 2020 14:50 by Nam Trinh . Last Modified 12 Oct 2020 15:25

Documents

Full text available as:

[thumbnail of ISSC2020_SSL_GAN_NamTrinh_DOB.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
990kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Semi-supervised learning with generative adversarial networks for pathological speech classification

Downloads