IIUM Repository

Content extraction of historical Malay manuscripts based on Event Ontology Framework

Mohd Nor, Zahila and M. Khalid, Yanti Idaya Aspura and Abdullah, Noorhidawati (2021) Content extraction of historical Malay manuscripts based on Event Ontology Framework. Applied Ontology, 16 (3). 249 -275. ISSN 1570-5838 E-ISSN 1875-8533

[img] PDF
Restricted to Repository staff only

Download (1MB) | Request a copy
[img] PDF
Restricted to Repository staff only

Download (66kB) | Request a copy
[img] PDF (SCOPUS) - Supplemental Material
Restricted to Registered users only

Download (447kB) | Request a copy

Abstract

This article aims to explore representation of the content knowledge of historical Malay manuscripts by extracting the event features using an event ontology framework. The manuscript used during the testing is Sulalatus Salatin (Sejarah Melayu ) by Abdul Ahmad Samad and it was published at University of Malaya Digital Library database. In aligning to a domain-specific ontology, the Simple Event Model (SEM) model is adopted and an event-based ontology for historical Malay manuscripts is designed. Information extraction approach is done manually to extract events from the manuscript and mapped into Protégé editor. Competency questions were constructed and submitted to the Protégé editor using SPARQL to check the ontology capability of providing answers as well as to examine its correctness. Event-based ontology model assists in discovering and representing the content knowledge of historical Malay manuscripts and supports organisation of knowledge. All the main concepts are extracted from selected Malay manuscript and 17 concepts used to develop the event-based ontology model. The knowledge was verified by three domain experts in Malay manuscript. In the findings, the interrater reliability for Event and Actor instances is 84%, which means 16% of instances and its type are incorrect and need amendment. For Place, interrater reliability is 95% and 99% for Role. Meanwhile, the experts achieved 100% agreement for Time. In addition, the experts agreed that the concepts, properties and instances for Malay Manuscript Ontology and complied with the criteria of consistency, completeness, conciseness, expandability and ease of use. The development of the event-based model of an ontology-based system with a high level of semantic granularity reflects the various cultural riches and intellectual aspect stored in Malay manuscripts. This will enable systematic research of the knowledge embedded in the manuscripts and make it widely and easily accessible by everyone.

Item Type: Article (Journal)
Additional Information: 5131/90003
Uncontrolled Keywords: Content development, event ontology, Simple Event Model (SEM), Malay manuscript, digital library, cultural heritage, DBpedia
Subjects: Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): IIUM Library
Depositing User: Pn. Zahila Mohd. Nor
Date Deposited: 27 May 2021 09:57
Last Modified: 13 Aug 2021 08:19
URI: http://irep.iium.edu.my/id/eprint/90003

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year