Automatic table of content generation for educational videos by leveraging multimodal information

Meet, Maheshwari; Goyal, Vikram (Advisor); Chakraborty, Tanmoy (Advisor)

Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1087

Title:	Automatic table of content generation for educational videos by leveraging multimodal information
Authors:	Meet, Maheshwari Goyal, Vikram (Advisor) Chakraborty, Tanmoy (Advisor)
Keywords:	Multimodal information Educational videos Video based learning EduCIndex Table of content Representation learning Sequential attention
Issue Date:	Dec-2021
Publisher:	IIIT-Delhi
Abstract:	Online education platforms have diverse learning content like videos, audio lectures, and technical articles. The major drawback of video-based learning content is the inability to directly access the content of interest that describes a particular topic. To enable smart browsing abilities in the video for quick access to an explanation of topics, it is essential for topical segmentation of videos. To obviate the need for manual topical segmentation of the video, this paper presents a system called EduCIndex that can automatically generate a Table of Content for a given video through representation learning by fusing different modalities like Text, Audio, and Video. EduCIndex performs segmentation for a video and assigns a relevant topic to each segment. To develop the system, we curate a novel dataset with around 1500 hrs of educational videos and a table of content for each video by scraping the web. We propose a novel multi-task learning-based approach that combines the tasks of learning the segment boundary and segment topic using sequential attention over a sequence of 1-minute video clips. Our proposed model provides 49.82% and 15.23% relative improvement in the topic name extraction and segmentation of the videos over the baselines, respectively, in terms of ROUGE-1 and F1 score.
URI:	http://repository.iiitd.edu.in/xmlui/handle/123456789/1087
Appears in Collections:	Year-2021

Files in This Item:

File	Description	Size	Format
Meet Maheshwari MT20012.pdf		2.19 MB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets