IIIT-Delhi Institutional Repository

Understanding COVID-19 genomic sequences through the lens of strainformer, a transformer model

Show simple item record

dc.contributor.author Chilkoti, Mansi
dc.contributor.author Sethi, Tavpritesh (Advisor)
dc.date.accessioned 2025-06-23T08:00:34Z
dc.date.available 2025-06-23T08:00:34Z
dc.date.issued 2024-04-26
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1753
dc.description.abstract This project aims to unravel the complex genomic dynamics of COVID-19, which are critical for understanding its virulence and developing targeted therapeutic interventions. Our approachfocuses on meticulously analyzing the genomic sequences of the SARS-Cov-2 virus, which wasaccomplished using a transformer model trained on real-world SARS-CoV-2 sequences. The transformer model was trained on approximately 2 million sequences, which generated attention scores for genomic codons. These 2 million COVID-19 genomic sequences were aligned using the MAFFT tool. The DNA sequences were then divided into codons to facilitate mapping between aligned and real-world sequences. This mapping method carefully examined the distribution of attention scores across the sequences’ mutated and non-mutated regions. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject COVID-19 en_US
dc.subject SARS-Cov-2 virus en_US
dc.subject Genomic Sequences en_US
dc.title Understanding COVID-19 genomic sequences through the lens of strainformer, a transformer model en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account