Aligning large language models (LLMs) using curriculum learning in multilingual settings in education do-main

Dulloo, Sushane; Shah, Rajiv Ratn (Advisor)

dc.contributor.author	Dulloo, Sushane
dc.contributor.author	Shah, Rajiv Ratn (Advisor)
dc.date.accessioned	2026-06-17T07:30:56Z
dc.date.available	2026-06-17T07:30:56Z
dc.date.issued	2024-11-27
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1987
dc.description.abstract	Large Language Models (LLMs) have revolutionized natural language processing (NLP) with exceptional capabilities in reasoning and computational tasks, enabled by extensive pretraining on large datasets dominated by high-resource languages such as English and French. However, this language-specific bias significantly limits their generalizability to low-resource languages like Hindi and Bengali, which lack sufficient digital corpora and contextual representation. Conse- quently, these models struggle with scientific reasoning tasks in low-resource languages. Despite advancements in multilingual models like mBERT and XLM-R, their performance in reasoning- intensive tasks remains inadequate for these underserved languages. Addressing this disparity necessitates effective cross-lingual transfer of reasoning capabilities, augmented by data enhance- ment techniques to simulate reasoning tasks in low-resource linguistic contexts. This research aims to evaluate the reasoning performance of LLMs in low-resource language settings like Hindi/Bengali etc, develop adaptive transfer strategies, and construct LLM agent frameworks with open/close sourced LLM models to better understand reasoning steps and iteratively refine them for improved accuracy.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Scientific Reasoning	en_US
dc.subject	Multilingual Reasoning	en_US
dc.subject	Agent Framework	en_US
dc.title	Aligning large language models (LLMs) using curriculum learning in multilingual settings in education do-main	en_US
dc.type	Other	en_US