Implicit and explicit hate speech detection

Aggarwal, Yash; Chakraborty, Tanmoy (Advisor); Akhtar, Md. Shad (Advisor)

dc.contributor.author	Aggarwal, Yash
dc.contributor.author	Chakraborty, Tanmoy (Advisor)
dc.contributor.author	Akhtar, Md. Shad (Advisor)
dc.date.accessioned	2023-04-14T14:42:19Z
dc.date.available	2023-04-14T14:42:19Z
dc.date.issued	2022-05
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1163
dc.description.abstract	With an increasing amount of hate on online social media platforms, automatic detection of toxic language plays a protecting role for online users and content moderators. Hence, it be- comes important to ensure that these models are safe and are unbiased against minority groups based on their gender, religion, caste, etc. For mitigating the bias in hate speech detection tasks, data augmentation is not a complete solution as it is not desirable to equalize the data based on the presence of Social group Tokens in the dataset. This is because of the important role that they play in the contextualization of derogatory remarks to a specific group. In this thesis, I approach the problem of bias removal in hate speech models through robustness using counterfactual generation.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Counterfactual Generation	en_US
dc.subject	Robustness	en_US
dc.subject	Mitigating Bias	en_US
dc.subject	Hate Speech	en_US
dc.title	Implicit and explicit hate speech detection	en_US