IIIT-Delhi Institutional Repository

Implicit and explicit hate speech detection

Show simple item record

dc.contributor.author Aggarwal, Yash
dc.contributor.author Chakraborty, Tanmoy (Advisor)
dc.contributor.author Akhtar, Md. Shad (Advisor)
dc.date.accessioned 2023-04-14T14:42:19Z
dc.date.available 2023-04-14T14:42:19Z
dc.date.issued 2022-05
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1163
dc.description.abstract With an increasing amount of hate on online social media platforms, automatic detection of toxic language plays a protecting role for online users and content moderators. Hence, it be- comes important to ensure that these models are safe and are unbiased against minority groups based on their gender, religion, caste, etc. For mitigating the bias in hate speech detection tasks, data augmentation is not a complete solution as it is not desirable to equalize the data based on the presence of Social group Tokens in the dataset. This is because of the important role that they play in the contextualization of derogatory remarks to a specific group. In this thesis, I approach the problem of bias removal in hate speech models through robustness using counterfactual generation. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Counterfactual Generation en_US
dc.subject Robustness en_US
dc.subject Mitigating Bias en_US
dc.subject Hate Speech en_US
dc.title Implicit and explicit hate speech detection en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account