Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/17
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorSurekha, Ashish
dc.contributor.authorSangeeta
dc.date.accessioned2012-03-14T10:36:50Z
dc.date.available2012-03-14T10:36:50Z
dc.date.issued2012-03-14T10:36:50Z
dc.identifier.urihttps://repository.iiitd.edu.in/jspui/handle/123456789/17
dc.description.abstractBug or Fault localization is a process of identifying the speci c location(s) or region(s) of source code (at various granularity levels such as the directory path, le, method or state- ment) that is faulty and needs to be modi ed to repair the defect. Bug localization is a routine task in software maintenance (corrective maintenance). Due to the increasing size and complexity of current software applications, automated solutions for bug localization can signi cantly reduce human e ort and software maintenance cost. We presented a technique (which falls into the class of static techniques for bug localiza- tion) for bug localization using a character N-gram based Information Retrieval (IR) model. We framed the problem of bug localization as a relevant document(s) search task for a given query and investigated the application of character-level N-gram based textual features de- rived from bug reports and source-code le attributes. We implemented the proposed IR model and evaluated its performance on dataset downloaded from two popular open-source projects (JBOSS and Apache). We conducted a series of experiments to validate our hypothesis and presented evidences to demonstrate that the proposed approach is e ective. The accuracy of the proposed ap- proach is measured in terms of the standard and commonly used SCORE and MAP (Mean Average Precision) metrics for the task of bug localization. Experimental results reveal that the median value for the SCORE metric for JBOSS and Apache dataset is 99.03% and 93.70% respectively. We observed that for 16.16% of the bug reports in the JBOSS dataset and for 10.67% of the bug reports in the Apache dataset, the average precision value (computed at all recall levels) is between 0.9 and 1.0.en_US
dc.language.isoen_USen_US
dc.subjectBug Localizationen_US
dc.subjectMining Software repositoriesen_US
dc.subjectInformation Retrievalen_US
dc.subjectAutomated Software engineeringen_US
dc.titleStatic technique for bug localization using character N-Gram based information retrieval modelen_US
dc.typeThesisen_US
Appears in Collections:Year-2011

Files in This Item:
File Description SizeFormat 
MTech-Thesis-Sangeeta-Lal-13-OCT-2011.pdf1.24 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.