IIUM Repository

Speech recognition system using MATLAB : design, implementation, and samples codes

Abushariah, Ahmad A. M. and Gunawan, Teddy Surya (2011) Speech recognition system using MATLAB : design, implementation, and samples codes. Lambert Academic Publishing, Saarbrucken, Germany. ISBN 978-3-8465-0376-8

This is the latest version of this item.

[img]
Preview
PDF (Speech Recognition System Using MATLAB) - Published Version
Download (196kB) | Preview

Abstract

Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. This project aims to develop automated English digits speech recognition system. The project relies heavily on the well known and widely used statistical method in characterizing the speech pattern, the Hidden Markov Model (HMM), which provides a highly reliable way for recognizing speech. This project discusses the theory of HMM and then extends the ideas to the development and implementation by applying this method in computational speech recognition. Basically, the system is able to recognize the spoken utterances by translating the speech waveform into a set of feature vectors using Mel Frequency Cepstral Coefficients (MFCC) technique, which then estimates the observation likelihood by using the Forward algorithm. The HMM parameters are estimated by applying the Baum Welch algorithm on previously trained samples. The most likely sequence is then decoded using Viterbi algorithm, thus producing the recognized word. This project focuses on all English digits from (Zero through Nine), which is based on isolated words structure. Two modules were developed, namely the isolated words speech recognition and the continuous speech recognition. Both modules were tested in both clean and noisy environments and showed relatively successful recognition rates. In clean environment and isolated words speech recognition module, the multi-speaker mode achieved 99.5% whereas the speaker-independent mode achieved 79.5%. In clean environment and continuous speech recognition module, the multi-speaker mode achieved 70% whereas the speaker-independent mode achieved 55%. However in noisy environment and isolated words speech recognition module, the multi-speaker mode achieved 88% whereas the speaker-independent mode achieved 67%. In noisy environment and continuous speech recognition module, the multi-speaker mode achieved 92.5% whereas the speaker-independent mode achieved 75%. These recognition rates are relatively successful if compared to similar systems.

Item Type: Book
Additional Information: 5588/27200
Uncontrolled Keywords: Speech recognition system, MATLAB
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK5101 Telecommunication. Including telegraphy, radio, radar, television
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Engineering > Department of Electrical and Computer Engineering
Depositing User: Dr Teddy Surya Gunawan
Date Deposited: 12 Dec 2012 16:46
Last Modified: 06 Feb 2013 10:29
URI: http://irep.iium.edu.my/id/eprint/27200

Available Versions of this Item

  • Speech recognition system using MATLAB : design, implementation, and samples codes. (deposited 12 Dec 2012 16:46) [Currently Displayed]

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year