IIIT-Delhi Institutional Repository

Audio visual consistency detection

Show simple item record

dc.contributor.author Pathak, Harsh
dc.contributor.author Govil, Shreedhar
dc.contributor.author Subramanyam, A V (Advisor)
dc.date.accessioned 2024-07-20T06:19:50Z
dc.date.available 2024-07-20T06:19:50Z
dc.date.issued 2019-11-16
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1647
dc.description.abstract With recent advancement in deep learning, it has been made possible to create fake images and videos with near perfect precision. This has led to a growing concern about the possible misuse of this technology in context to fake news. We propose a novel architecture that attempts to learn temporal and structural association between the facial images and the audio of a person.In our model, given a sequence of frames and the corresponding audio, we predict the future frame using an ensemble of LSTM and GAN. Further, we compute the distance between the prediction and ground truth frame to determine whether the given video is real or fake. This give us the advantage to use this method on any fake video regardless of its method of creation and classify it as real or fake. This method is completely unsupervised since we do not require the fake video dataset, only original video of the person is needed. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Deep Learning en_US
dc.subject Image Analysis en_US
dc.subject Machine Learning en_US
dc.subject Forensics en_US
dc.title Audio visual consistency detection en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account