Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1440
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBansal, Subhanshu-
dc.contributor.authorAkhtar, Md. Shad (Advisor)-
dc.date.accessioned2024-05-11T11:59:12Z-
dc.date.available2024-05-11T11:59:12Z-
dc.date.issued2023-11-29-
dc.identifier.urihttp://repository.iiitd.edu.in/xmlui/handle/123456789/1440-
dc.description.abstractIn the contemporary multilingual landscape of online communication, the emergence of codemixed language, the seamless integration of multiple languages within a single utterance, has become increasingly prevalent. The Transformer Architecture, a revolutionary development in natural language processing, has significantly facilitated the modeling of such linguistic complexity. However, despite its efficacy, deploying Transformer models to Edge Devices presents challenges. The inherent depth of Transformer models, while enhancing their learning capacity, poses obstacles for deployment on resource-constrained Edge Devices. These devices, characterized by limited computational capabilities, struggle with the computational intensity of deep models, resulting in impractical latency. Consequently, the transformative benefits of code-mixed language processing are hindered when confined to internet-based usage. The current limitation of deploying Transformer models exclusively via the internet restricts their accessibility and utility, especially in scenarios where real-time, low-latency processing is imperative. As technological advancements continue, addressing these deployment challenges and enabling the efficient implementation of Transformer models on Edge Devices could unlock new possibilities for seamless, multilingual communication in diverse settings.en_US
dc.language.isoen_USen_US
dc.publisherIIIT-Delhien_US
dc.subjectMachine Translationen_US
dc.subjectRecurrent Neural Networken_US
dc.subjectLong Short Term Memoryen_US
dc.subjectTransformersen_US
dc.subjectRetention Networken_US
dc.subjectKnowledge Distillationen_US
dc.subjectStudent - Teacher Model Model Pruningen_US
dc.titleLight weight code - mix neural machine translationen_US
dc.typeOtheren_US
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
BTP_Report - Subhanshu Bansal.pdf
  Restricted Access
360.45 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.