Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1413
Title: A comparative analysis of large language models for code documentation generation
Authors: Dvivedi, Shubhang Shekhar
Pujari, Sai Leela Rahul
Vijay, Vyshnav
Lodh, Shoumik
Kumar, Dhruv (Advisor)
Keywords: Code Documentation
Large Language Models
Open Source LLMs
Issue Date: 29-Nov-2023
Publisher: IIIT-Delhi
Abstract: This paper presents a comprehensive comparative analysis of Large Language Models (LLMs) for code documentation generation. Code documentation is an essential part of the software writing process as it allows a new user to learn and build more code on top of the existing code base with relative ease. The paper evaluates models such as GPT-3.5, GPT-4, Bard, Llama2, and Starchat. Our evaluation employs a checklist-based system to minimize subjectivity, providing a more objective assessment. We find that, barring Starchat, all LLMs consistently outperform the original documentation. Notably, closed-source models GPT-3.5, GPT-4, and Bard exhibit superior performance across various parameters compared to open-source alternatives, namely LLama 2 and StarChat. Additionally, considering the time taken for generation, GPT-4 leads, followed by Llama2, Bard, with ChatGPT and Starchat exhibiting comparable generation times. This study contributes insights into the nuanced challenges of industry-level code documentation generation and establishes benchmarks for future research in this evolving domain.
URI: http://repository.iiitd.edu.in/xmlui/handle/123456789/1413
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
BTP report auto code documentation using LLMs - Shubhang Shekhar Dvivedi.pdf
  Restricted Access
363.33 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.