IIUM Repository

Using language-based search in mining large software repositories

Awang Abu Bakar, Normi Sham (2011) Using language-based search in mining large software repositories. Procedia - Social and Behavioral Sciences, 27. pp. 160-168. ISSN 18770428

[img] PDF (Using Language-Based Search in Mining Large Software Repositories) - Published Version
Restricted to Repository staff only

Download (233kB) | Request a copy

Abstract

Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software.

Item Type: Article (Journal)
Additional Information: 3509/11831
Uncontrolled Keywords: Data retrieval; Software repository; Language – based search; Automation; Software quality
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Information and Communication Technology > Department of Computer Science
Kulliyyah of Information and Communication Technology > Department of Computer Science
Depositing User: Dr. Normi Sham Awang Abu Bakar
Date Deposited: 20 Dec 2011 13:59
Last Modified: 20 Dec 2011 13:59
URI: http://irep.iium.edu.my/id/eprint/11831

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year