IIUM Repository

Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems

Abushariah, Mohammad Abd-Alrahman Mahmoud and Ainon, Raja Noor and Zainuddin, Roziati and Elshafei, Moustafa and Khalifa, Othman Omran (2010) Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems. In: 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), 10-13 May 2010, Kuala Lumpur, Malaysia.

[img] PDF (Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems) - Presentation
Restricted to Repository staff only

Download (1MB) | Request a copy

Abstract

This paper describes an efficient framework for designing and developing Arabic speaker-independent continuous automatic speech recognition systems based on a phonetically rich and balanced speech corpus. The speech corpus contains 415 sentences recorded by 42 (21 male and 21 female) Arabic native speakers from 11 Arab countries representing three major regions (Levant, Gulf, and Africa). The developed system is based on the Carnegie Mellon University (CMU) Sphinx tools. The Cambridge HTK tools were also used in some testing stages. The speech engine uses 3-emitting state Hidden Markov Models (HMM) for tri-phone based acoustic models. Based on experimental analysis of 4.07 hours of training speech data, the acoustic model used continuous observation's probability model of 16 Gaussian mixture distributions and the state distributions were tied to 400 senons. The language model contains both bi-grams and tri-grams. The system obtained 91.23% and 92.54% correct word recognition with and without diacritical marks respectively.

Item Type: Conference or Workshop Item (Full Paper)
Additional Information: 4119/6978 Print ISBN: 978-1-4244-7165-2 Digital Object Identifier: 10.1109/ISSPA.2010.5605554
Uncontrolled Keywords: Arabic Continuous Speech Recognition, Arabic speech corpus, Phonetically rich and balanced, Acoustic model, Statistical language model
Subjects: T Technology > T Technology (General)
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Engineering > Department of Electrical and Computer Engineering
Depositing User: Prof. Dr Othman O. Khalifa
Date Deposited: 14 Dec 2011 08:26
Last Modified: 14 Dec 2011 08:26
URI: http://irep.iium.edu.my/id/eprint/6978

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year