IIIT-Delhi Institutional Repository

Automatic table of content generation for educational videos by leveraging multimodal information

Show simple item record

dc.contributor.author Meet, Maheshwari
dc.contributor.author Goyal, Vikram (Advisor)
dc.contributor.author Chakraborty, Tanmoy (Advisor)
dc.date.accessioned 2023-04-05T10:10:59Z
dc.date.available 2023-04-05T10:10:59Z
dc.date.issued 2021-12
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1087
dc.description.abstract Online education platforms have diverse learning content like videos, audio lectures, and technical articles. The major drawback of video-based learning content is the inability to directly access the content of interest that describes a particular topic. To enable smart browsing abilities in the video for quick access to an explanation of topics, it is essential for topical segmentation of videos. To obviate the need for manual topical segmentation of the video, this paper presents a system called EduCIndex that can automatically generate a Table of Content for a given video through representation learning by fusing different modalities like Text, Audio, and Video. EduCIndex performs segmentation for a video and assigns a relevant topic to each segment. To develop the system, we curate a novel dataset with around 1500 hrs of educational videos and a table of content for each video by scraping the web. We propose a novel multi-task learning-based approach that combines the tasks of learning the segment boundary and segment topic using sequential attention over a sequence of 1-minute video clips. Our proposed model provides 49.82% and 15.23% relative improvement in the topic name extraction and segmentation of the videos over the baselines, respectively, in terms of ROUGE-1 and F1 score. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Multimodal information en_US
dc.subject Educational videos en_US
dc.subject Video based learning en_US
dc.subject EduCIndex en_US
dc.subject Table of content en_US
dc.subject Representation learning en_US
dc.subject Sequential attention en_US
dc.title Automatic table of content generation for educational videos by leveraging multimodal information en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account