Please use this identifier to cite or link to this item:
http://repository.iiitd.edu.in/xmlui/handle/123456789/1413| Title: | A comparative analysis of large language models for code documentation generation |
| Authors: | Dvivedi, Shubhang Shekhar Pujari, Sai Leela Rahul Vijay, Vyshnav Lodh, Shoumik Kumar, Dhruv (Advisor) |
| Keywords: | Code Documentation Large Language Models Open Source LLMs |
| Issue Date: | 29-Nov-2023 |
| Publisher: | IIIT-Delhi |
| Abstract: | This paper presents a comprehensive comparative analysis of Large Language Models (LLMs) for code documentation generation. Code documentation is an essential part of the software writing process as it allows a new user to learn and build more code on top of the existing code base with relative ease. The paper evaluates models such as GPT-3.5, GPT-4, Bard, Llama2, and Starchat. Our evaluation employs a checklist-based system to minimize subjectivity, providing a more objective assessment. We find that, barring Starchat, all LLMs consistently outperform the original documentation. Notably, closed-source models GPT-3.5, GPT-4, and Bard exhibit superior performance across various parameters compared to open-source alternatives, namely LLama 2 and StarChat. Additionally, considering the time taken for generation, GPT-4 leads, followed by Llama2, Bard, with ChatGPT and Starchat exhibiting comparable generation times. This study contributes insights into the nuanced challenges of industry-level code documentation generation and establishes benchmarks for future research in this evolving domain. |
| URI: | http://repository.iiitd.edu.in/xmlui/handle/123456789/1413 |
| Appears in Collections: | Year-2023 |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| BTP report auto code documentation using LLMs - Shubhang Shekhar Dvivedi.pdf Restricted Access | 363.33 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.