IIUM Repository (IREP)

Using language-based search in mining large software repositories

Awang Abu Bakar, Normi Sham (2011) Using language-based search in mining large software repositories. In: Pacific Association for Computational Linguistics (PACLING 2011), 19-21 July 2011, Kuala Lumpur.

[img] PDF (Using Language-Based Search in Mining Large Software Repositories) - Published Version
Restricted to Repository staff only

Download (166kB) | Request a copy

Abstract

Language component plays an important role in data/information retrieval. Data retrieval in software engineering is often hindered by the difficulty of getting data from commercial software. The emergence of the open source repositories has contributed tremendously in the collection of software data. This paper highlights the data retrieval method for mining software from a vast open source software repository, SourceForge. For the purpose of automating the data retrieval from the repository, a parser was written using the Python programming language, and based on the pattern matching algorithm. The retrieved data were later used to estimate the quality of the open source software.

Item Type: Conference or Workshop Item (Full Paper)
Additional Information: 3509/8451 doi:10.1016/j.sbspro.2011.10.594
Uncontrolled Keywords: Data retrieval; Software repository; Language – based search; Automation; Software quality
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Kulliyyahs/Centres/Divisions/Institutes: Kulliyyah of Information and Communication Technology > Department of Computer Science
Kulliyyah of Information and Communication Technology > Department of Computer Science
Depositing User: Dr. Normi Sham Awang Abu Bakar
Date Deposited: 20 Dec 2011 13:51
Last Modified: 20 Dec 2011 13:51
URI: http://irep.iium.edu.my/id/eprint/8451

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year