Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1585
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSinha, Arya-
dc.contributor.authorSubramanyam, A V (Advisor)-
dc.date.accessioned2024-05-24T05:19:39Z-
dc.date.available2024-05-24T05:19:39Z-
dc.date.issued2023-11-29-
dc.identifier.urihttp://repository.iiitd.edu.in/xmlui/handle/123456789/1585-
dc.description.abstractIn this study, we delve into the realm of attention-based networks, particularly the recent advancements of Vision Transformers (ViT) that outperform conventional Convolutional Neural Networks (CNNs) in numerous vision tasks. However, since ViT has a different architecture than CNN, its behavavior may vary. The differences in robustness between ViTs and CNNs and the underlying reasons for these differences are studied. To investigate the reliability of ViT, this study analyzes the vulnerabilities of ViTs to adversarial samples. To enhance the robustness of ViT, a range of training and modifications in architecture and patch embedding mechanism were explored. A distinctive adversarial sample generation technique tailored for ViT architecture is introduced.en_US
dc.language.isoen_USen_US
dc.publisherIIIT-Delhien_US
dc.subjectAdversarial Robustnessen_US
dc.subjectAdversarial Trainingen_US
dc.subjectComputer Visionen_US
dc.subjectVision Transformeren_US
dc.titleUnderstanding robustness of vision transformersen_US
dc.typeOtheren_US
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
BTP_Report - Arya Sinha.pdf
  Restricted Access
654.71 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.