Development of U-Net architecture for audio super resolution

Gunawan, Teddy Surya and Mohd Sarif, Muhammad Rusydy and Kartiwi, Mira and Ahmad, Yasser Asrul (2023) Development of U-Net architecture for audio super resolution. In: 2023 9th International Conference on Computer and Communication Engineering (ICCCE), Kuala Lumpur, Malaysia.

PDF (Full Paper) - Published Version
Restricted to Registered users only
Download (684kB) | Request a copy

Preview

PDF (Scopus) - Supplemental Material
Download (123kB) | Preview

Official URL: https://ieeexplore.ieee.org/abstract/document/1024...

Abstract

Audio processing is used in a wide range of applications, including telecommunications and music streaming. Audio quality degradation during transmission and processing is a common issue in these fields, often caused by bandwidth constraints and the use of subpar equipment. This problem is exacerbated when the task requires converting low-quality audio input to high-resolution output, which is difficult for deep neural networks to do. To improve audio superresolution, this paper proposes a novel solution to this problem by embedding a U-Net-based architecture model within deep neural networks. Over 100 iterations, the U-Net architecture was trained, with loss values and Mean Squared Error (MSE) monitored at each epoch. A diverse dataset of audio signals with varying Signal-to-Noise Ratio (SNR) values ranging from 1 dB to 30 dB was used. The model’s average SNR of 17.29 dB exceeds thresholds where listener detection of enhancements becomes difficult, demonstrating its ability to preserve subtle audio details. Furthermore, the Log-Spectral Distance (LSD) values revealed a mere 1.41 dB difference between the actual and reconstructed spectrograms, indicating that the model can recover lost information during upsampling. This research suggests a promising method for improving audio quality, particularly when bandwidth constraints or insufficient equipment prevent high-resolution audio transmission and processing.

Item Type:	Proceeding Paper (Plenary Papers)
Uncontrolled Keywords:	Low-quality audio, audio super-resolution, audio quality enhancement, U-Net architecture
Subjects:	T Technology > TK Electrical engineering. Electronics Nuclear engineering T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK5101 Telecommunication. Including telegraphy, radio, radar, television T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button):	Kulliyyah of Engineering Kulliyyah of Engineering > Department of Electrical and Computer Engineering Kulliyyah of Information and Communication Technology Kulliyyah of Information and Communication Technology Kulliyyah of Information and Communication Technology > Department of Information System Kulliyyah of Information and Communication Technology > Department of Information System
Depositing User:	Dr Ir Yasser Asrul Ahmad
Date Deposited:	27 Sep 2023 11:35
Last Modified:	26 Oct 2023 12:16
URI:	http://irep.iium.edu.my/id/eprint/107100

Actions (login required)

View Item

Download Statistics

Downloads

Downloads per month over past year