IIIT-Delhi Institutional Repository

A comprehensive comparison of strainformer transformer model and baseline machine learning models for future vaccine generation for COVID-19

Show simple item record

dc.contributor.author Chilkoti, Mansi
dc.contributor.author Mohta, Shagun
dc.contributor.author Sethi, Tavpritesh (Advisor)
dc.date.accessioned 2024-05-10T14:33:31Z
dc.date.available 2024-05-10T14:33:31Z
dc.date.issued 2023-11-29
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1434
dc.description.abstract This project aims to compare traditional baseline models, specifically the Naive Bayes model and the Long Short-Term Memory (LSTM) model, and advanced transformer models, including Strainformer and Vaxformer. The primary objective is to assess the efficacy of these models in generating new sequences and subsequently evaluate the generated sequences based on their antigenicity score(NetMHCpan), stability(DDGun), and Root Mean Square Deviation (RMSD) from a reference spike protein (AlphaFold). The study design involves training each model on relevant biological sequence datasets, emphasizing the diverse nature of antigenic proteins. Following training, the models will generate novel sequences, and their antigenic properties will be quantified using state-of-the-art scoring systems. The antigenicity score stability will be assessed to determine the consistency of the generated sequences in maintaining desirable antigenic features. Additionally, the generated sequences will be compared to a reference spike protein, and the RMSD metric will be employed to quantify the structural differences between the generated and reference sequences. This analysis aims to provide insights into the structural fidelity of the generated sequences and their potential practical utility in the context of vaccine development for COVID-19 or other diseases. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject LSTM(Long Short-Term Memory) en_US
dc.subject Strain-former en_US
dc.subject Vaxformer en_US
dc.subject NetMHCpan en_US
dc.subject DDGun en_US
dc.subject Alpha-Fold en_US
dc.subject Naive Bayes en_US
dc.title A comprehensive comparison of strainformer transformer model and baseline machine learning models for future vaccine generation for COVID-19 en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account