Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1469
Title: Teacher-student collaborative knowledge distillation
Authors: Dixit, Shantanu
Akhtar, Md. Shad (Advisor)
Keywords: Model Compression
Knowledge Distillation
Meta Knowledge Distillation
Policy Driven Knowledge Distillation.
Issue Date: 29-Nov-2023
Publisher: IIIT-Delhi
Abstract: Knowledge distillation is a technique that involves transferring knowledge from a larger teacher model to a smaller student model. The latest developments in meta-learning-based knowledge distillation emphasize the significance of fine-tuning the teacher models while taking into account the student’s need for better knowledge distillation. Nevertheless, current MetaKD methods frequently fail to provide incentives for the teacher model to improve itself. We introduce a meta-policy distillation technique aiming to foster both collaboration and competition during the fine-tuning of the teacher model within the meta-learning phase. Additionally, we put forth a curriculum learning framework tailored for the student model within a competitive setting. In this context, the student model endeavors to surpass the teacher model through self-training on a diverse range of tasks. We conduct extensive experiments on two NLU benchmarks GLUE and SuperGLUE [45,46] and validate our methodology’s effectiveness against various KD techniques.
URI: http://repository.iiitd.edu.in/xmlui/handle/123456789/1469
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
BTP_Report_ShantanuDixit - Shantanu Dixit.pdf
  Restricted Access
948.14 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.