Please use this identifier to cite or link to this item:
http://repository.iiitd.edu.in/xmlui/handle/123456789/1787Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Siddiqui, Abu Osama | - |
| dc.contributor.author | Subramanyam, A V (Advisor) | - |
| dc.date.accessioned | 2025-12-20T06:40:59Z | - |
| dc.date.available | 2025-12-20T06:40:59Z | - |
| dc.date.issued | 2025-05-21 | - |
| dc.identifier.uri | http://repository.iiitd.edu.in/xmlui/handle/123456789/1787 | - |
| dc.description.abstract | Understanding a video from concise summaries is of great importance for various applications such as browsing, retrieval and assistive technologies. In this work, we present unsupervised summarization of videos. Video summarization is extremely challenging as it is difficult to find concise and semantic frame representations. In order to address this problem, our contributions are twofold. First, we study different convolutional and transformer based architectures which can obtain efficient spatio-temporal representations. Second, we propose an optimal transport method to obtain representative clusters of a video. Experimental results on benchmark datasets such as TVSum and SumMe demonstrate that our approach achieves competitive performance. | en_US |
| dc.language.iso | en_US | en_US |
| dc.publisher | IIIT-Delhi | en_US |
| dc.subject | RNNs/LSTMs Based Approaches | en_US |
| dc.subject | Contrastive Learning with TCN Encoder | en_US |
| dc.title | Optimal transport guided contrastive video summarization | en_US |
| dc.type | Thesis | en_US |
| Appears in Collections: | Year-2025 | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| MT22006_Abu Osama Siddiqui.pdf | 3.55 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.