Optimal transport guided contrastive video summarization

Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1787

Full metadata record

DC Field	Value	Language
dc.contributor.author	Siddiqui, Abu Osama	-
dc.contributor.author	Subramanyam, A V (Advisor)	-
dc.date.accessioned	2025-12-20T06:40:59Z	-
dc.date.available	2025-12-20T06:40:59Z	-
dc.date.issued	2025-05-21	-
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1787	-
dc.description.abstract	Understanding a video from concise summaries is of great importance for various applications such as browsing, retrieval and assistive technologies. In this work, we present unsupervised summarization of videos. Video summarization is extremely challenging as it is difficult to find concise and semantic frame representations. In order to address this problem, our contributions are twofold. First, we study different convolutional and transformer based architectures which can obtain efficient spatio-temporal representations. Second, we propose an optimal transport method to obtain representative clusters of a video. Experimental results on benchmark datasets such as TVSum and SumMe demonstrate that our approach achieves competitive performance.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	RNNs/LSTMs Based Approaches	en_US
dc.subject	Contrastive Learning with TCN Encoder	en_US
dc.title	Optimal transport guided contrastive video summarization	en_US
dc.type	Thesis	en_US
Appears in Collections:	Year-2025

Files in This Item:

File	Description	Size	Format
MT22006_Abu Osama Siddiqui.pdf		3.55 MB	Adobe PDF	View/Open

DSpace JSPUI