IIIT-Delhi Institutional Repository

Newsbag: a benchmark dataset for fake news detection

Show simple item record

dc.contributor.author Jindal, Sarthak
dc.contributor.author Vatsa, Mayank (Advisor)
dc.contributor.author Singh, Richa (Advisor)
dc.date.accessioned 2019-10-09T08:49:28Z
dc.date.available 2019-10-09T08:49:28Z
dc.date.issued 2019-04-15
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/779
dc.description.abstract The spread of fake news poses a serious problem in today’s world where the masses consume and produce news using online platforms. One main reason why fake news detection is hard is the lack of ground truth database for training classification models. In this paper, we present a benchmark dataset for fake news detection. The size of this dataset is an order of magnitude larger as compared to existing datasets for fake news detection. Moreover, we collect our training and testing datasets from different news sources to understand how well deep detection architectures generalize to unseen data. We also present an augmented training dataset generated using a custom data augmentation algorithm. The proposed dataset comprises of two modalities, image, and text; therefore, both unimodal and multimodal (deep learning) models can be trained. We also present the baseline results of single modality and multimodal approaches. We observe that the multimodal approaches yield better results compared to unimodal approaches. We assert that the availability of such large database can instigate research in this arduous research problem. en_US
dc.language.iso en_US en_US
dc.publisher IIITD-Delhi en_US
dc.subject Multimodal Deep learning en_US
dc.subject Convolutional Neural Networks en_US
dc.subject Fake News Detection en_US
dc.title Newsbag: a benchmark dataset for fake news detection en_US
dc.type Other en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository

Advanced Search


My Account