IIUM Repository

Issues in evaluating the retrieval performance of multiscript translation of Al-Quran

Othman, Roslina and Abdul Wahid, Fauziah (2011) Issues in evaluating the retrieval performance of multiscript translation of Al-Quran. In: 6th World Congress of Muslim Librarians and Information Scientists 2011 (WCOMLIS 2011), 16 - 17 November 2011, IIUM. (Unpublished)

[img]
Preview
PDF (Issues in Evaluating the Retrieval Performance of Multiscript Translation of Al-Quran)
Download (47kB) | Preview

Abstract

The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated words, among the public. Even in querying, non-Arabic speakers will find the texts through the translated words in addition to topical search. Transliteration is a need in the absence of terminology in the normal conduct of Cross-Language Information Retrieval research area, while in the case of this research, the transliterated version was meant for those with the ability to read the older script in its own original translation. The Malay Roman script has its own version of the translation. Objectives include to examine the reported retrieval performance of these texts and to evaluate the retrieval performance of the translations available in two different scripts of a language: Malay Rumi and Malay Jawi, built upon Pimpinan ar-Rahman version, Indri and Jawi software. Measures include recall, precision and overlap. Recall explains the performance in retrieving all relevant items, while precision describes the performance in rejecting non-relevant items. Overlap exhibits the retrieval of items common in both sub-collections. Queries are constructed from questions posed by newspaper readers in both scripts resulted as keywords with semantic, while relevance judgment is made by a panel of expert based on answers to the questions. Findings based on recall, precision and overlaps revealed the major issues of standardized texts, translation and transliteration, text alignments, queries construction, question-answering relevance vs. topical relevance. Indri's performance is not a major issue, while the Jawi software requires improvement to a minor extent. This paper contributes to the issues of handling test collections involving parallel corpus in the area of Cross Language IR facing the Muslim World.

Item Type: Conference or Workshop Item (Full Paper)
Additional Information: 1675/7481
Uncontrolled Keywords: Retrieval performance, Al-Quran, Malay Rumi, Malay Jawi
Subjects: Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Information and Communication Technology > Department of Library & Information Science
Kulliyyah of Information and Communication Technology > Department of Library & Information Science
Depositing User: Prof Datin Dr Roslina Bt Othman
Date Deposited: 21 Nov 2011 11:36
Last Modified: 21 Nov 2011 21:06
URI: http://irep.iium.edu.my/id/eprint/7481

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year