IIIT-Delhi Institutional Repository

Fingerprinting fine-tuned language models in the wild

Show simple item record

dc.contributor.author Diwan, Nirav
dc.contributor.author Chakraborty, Tanmoy (Advisor)
dc.date.accessioned 2022-03-31T09:41:00Z
dc.date.available 2022-03-31T09:41:00Z
dc.date.issued 2021-05
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/992
dc.description.abstract There are concerns that the ability of language models (LMs) to generate high quality synthetic text can be misused to launch spam, dis-information, or propaganda. Therefore, the re-search community is actively working on detecting whether a given text is organic or synthetic. While this is a useful first step, it is important to be able to further fingerprint the author LM to attribute its origin. Prior work on fingerprinting LMs is limited to attributing synthetic text generated by a handful (usually<10) of pre-trained LMs. However, LMs such as GPT2 are commonly fine-tuned in a myriad of ways (e.g., on a domain-specific text corpus) before being used to generate synthetic text. Thus, it is challenging to finger-printing fine-tuned LMs because the universe of fine-tuned LMs is much larger in realistic scenarios. To address this challenge, we study the problem of large-scale fingerprinting offline-tuned LMs in the wild. Using a real-world dataset of synthetic text generated by 108 different fine-tuned LMs, we conduct comprehensive experiments to demonstrate the limitations of existing fingerprinting approaches. Our results show that fine-tuning itself is most effective in attributing the synthetic text generated by fine-tuned LMs. en_US
dc.language.iso en_US en_US
dc.publisher IIIT- Delhi en_US
dc.subject Fingerprinting en_US
dc.subject language model en_US
dc.subject security en_US
dc.subject pretrained model en_US
dc.title Fingerprinting fine-tuned language models in the wild en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account