IIUM Repository

SMaTTS: standard malay text to speech system

Khalifa, Othman Omran and Ahmad, Zakiah Hanim and Gunawan, Teddy Surya (2007) SMaTTS: standard malay text to speech system. International Journal of Electrical and Computer Engineering, 4 (2). pp. 285-293. ISSN 0974-2190

[img]
Preview
PDF
Download (507kB) | Preview

Abstract

This paper presents a rule-based text- to- speech (TTS) Synthesis System for Standard Malay, namely SMaTTS. The proposed system using sinusoidal method and some pre- recorded wave files in generating speech for the system. The use of phone database significantly decreases the amount of computer memory space used, thus making the system very light and embeddable. The overall system was comprised of two phases the Natural Language Processing (NLP) that consisted of the high-level processing of text analysis, phonetic analysis, text normalization and morphophonemic module. The module was designed specially for SM to overcome few problems in defining the rules for SM orthography system before it can be passed to the DSP module. The second phase is the Digital Signal Processing (DSP) which operated on the low-level process of the speech waveform generation. A developed an intelligible and adequately natural sounding formant-based speech synthesis system with a light and user-friendly Graphical User Interface (GUI) is introduced. A Standard Malay Language (SM) phoneme set and an inclusive set of phone database have been constructed carefully for this phone-based speech synthesizer. By applying the generative phonology, a comprehensive letter-to-sound (LTS) rules and a pronunciation lexicon have been invented for SMaTTS. As for the evaluation tests, a set of Diagnostic Rhyme Test (DRT) word list was compiled and several experiments have been performed to evaluate the quality of the synthesized speech by analyzing the Mean Opinion Score (MOS) obtained. The overall performance of the system as well as the room for improvements was thoroughly discussed.

Item Type: Article (Journal)
Additional Information: 4119/23635
Uncontrolled Keywords: Natural Language Processing, Text-To-Speech (TTS), Diphone, source filter, low-/ high- level synthesis
Subjects: P Language and Literature > PA Classical philology
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Engineering > Department of Electrical and Computer Engineering
Depositing User: Prof. Dr Othman O. Khalifa
Date Deposited: 24 Apr 2012 13:50
Last Modified: 24 Apr 2012 13:50
URI: http://irep.iium.edu.my/id/eprint/23635

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year