IIUM Repository

Using the short-time fourier transform and ResNet to diagnose depression from speech data

Elfaki, Ayman and Asnawi, Ani Liza and Jusoh, Ahmad Zamani and Ismail, Ahmad Fadzil and Ibrahim, Siti Noorjannah and Mohamed Azmin, Nor Fadhillah and Nik Hashim, Nik Nur Wahidah (2021) Using the short-time fourier transform and ResNet to diagnose depression from speech data. In: 2021 IEEE International Conference on Computing (ICOCO 2021), 17-19 November 2021, Kuala Lumpur. (Unpublished)

[img] PDF (Programme schedule) - Supplemental Material
Restricted to Registered users only

Download (657kB) | Request a copy
[img] PDF (Unpublished paper)
Restricted to Repository staff only

Download (313kB) | Request a copy

Abstract

Depression is a common illness that is affecting many people nowadays, this is especially true now with the advent of the COVID-19 pandemic. It often arises when a person is having difficulty coping with stressful life events. It can occur throughout the lifespan of a person, and it pervades all aspects of our lives. Currently, depression diagnoses rely on patient interviews and self-report questionnaires, which depend heavily on the patient honesty and the subjective experience of the clinician. In this paper, we will begin with investigating the viability of using the Short-Time Fourier Transform (STFT) as a feature descriptor to objectively diagnose depression from speech data. The dataset used in this research is the Audio-Visual Emotion Challenging 2017 (AVEC2017). The model is based on a modified ResNet18 model architecture to perform a binary classification (i.e., depressed or non-depressed). The STFT is computed from the speech signal to generate a mel-spectrogram for training and testing the model. The experiment shows that relying solely on STFT as an input feature resulted in an F1 score of 74.71% in classifying depression.

Item Type: Conference or Workshop Item (Plenary Papers)
Additional Information: Online Conference
Uncontrolled Keywords: Depression, Speech, Deep Learning, Short-Time Fourier Transform
Subjects: T Technology > T Technology (General)
T Technology > TK Electrical engineering. Electronics Nuclear engineering
T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices
T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices > TK7885 Computer engineering
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Engineering
Kulliyyah of Engineering > Department of Electrical and Computer Engineering
Kulliyyah of Engineering > Department of Mechatronics Engineering
Depositing User: DR. Ani Liza Asnawi
Date Deposited: 05 Jan 2022 15:42
Last Modified: 09 Mar 2022 11:45
URI: http://irep.iium.edu.my/id/eprint/94895

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year