Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1787
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSiddiqui, Abu Osama-
dc.contributor.authorSubramanyam, A V (Advisor)-
dc.date.accessioned2025-12-20T06:40:59Z-
dc.date.available2025-12-20T06:40:59Z-
dc.date.issued2025-05-21-
dc.identifier.urihttp://repository.iiitd.edu.in/xmlui/handle/123456789/1787-
dc.description.abstractUnderstanding a video from concise summaries is of great importance for various applications such as browsing, retrieval and assistive technologies. In this work, we present unsupervised summarization of videos. Video summarization is extremely challenging as it is difficult to find concise and semantic frame representations. In order to address this problem, our contributions are twofold. First, we study different convolutional and transformer based architectures which can obtain efficient spatio-temporal representations. Second, we propose an optimal transport method to obtain representative clusters of a video. Experimental results on benchmark datasets such as TVSum and SumMe demonstrate that our approach achieves competitive performance.en_US
dc.language.isoen_USen_US
dc.publisherIIIT-Delhien_US
dc.subjectRNNs/LSTMs Based Approachesen_US
dc.subjectContrastive Learning with TCN Encoderen_US
dc.titleOptimal transport guided contrastive video summarizationen_US
dc.typeThesisen_US
Appears in Collections:Year-2025

Files in This Item:
File Description SizeFormat 
MT22006_Abu Osama Siddiqui.pdf3.55 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.