Mining YouTube metadata for detecting privacy invading harassment and misdemeanour videos

Aggarwal, Nisha; Sureka, Ashish (Advisor)

dc.contributor.author	Aggarwal, Nisha
dc.contributor.author	Sureka, Ashish (Advisor)
dc.date.accessioned	2014-03-14T11:19:09Z
dc.date.available	2014-03-14T11:19:09Z
dc.date.issued	2014-03-14T11:19:09Z
dc.identifier.uri	https://repository.iiitd.edu.in/jspui/handle/123456789/119
dc.description.abstract	YouTube is one of the most popular and largest video sharing websites (with social networking features) on the Internet. A signi cant percentage of videos uploaded on YouTube contains objectionable content and violates YouTube community guidelines. YouTube contains several copyright violated videos, commercial spam, hate and extremism promoting videos, vulgar and pornographic material and privacy invading content. This is primarily due to the low publication barrier and anonymity. We present an approach to identify privacy invading harassment and misdemeanour videos by mining the video metadata. We divide the problem into sub-problems: vulgar video detection, abuse and violence in public places and ragging video detection in school and colleges. We conduct a characterization study on a training dataset by downloading several videos using YouTube API and manually annotating the dataset. We de ne several discrimina- tory features for recognizing the target class objects. We employ a one-class classi er approach to detect the objectionable video and frame the problem as a recognition problem. Our empirical analysis on test dataset reveals that linguistic features (presence of certain terms and people in the title and description of the main and related videos), popularity based, duration and cate- gory of videos can be used to predict the video type. We validate our hypothesis by conducting a series of experiments on evaluation dataset acquired from YouTube. Empirical results reveal that accuracy of proposed approach is more than 80% demonstrating the e ectiveness of the approach.	en_US
dc.language.iso	en_US	en_US
dc.subject	YouTube	en_US
dc.subject	Privacy	en_US
dc.subject	Invading	en_US
dc.subject	Harassment	en_US
dc.subject	Misdemeanour	en_US
dc.subject	Social Media Analytics	en_US
dc.subject	One-Class Classification	en_US
dc.subject	Vulgar	en_US
dc.subject	Ragging	en_US
dc.subject	Violence	en_US
dc.title	Mining YouTube metadata for detecting privacy invading harassment and misdemeanour videos	en_US
dc.type	Thesis	en_US