IIIT-Delhi Institutional Repository

Machine learning regression models for predicting hemolytic concentration (HC50) of peptides using curated activity data

Show simple item record

dc.contributor.author Singh, Ayushi
dc.contributor.author Raghava, Gajendra Pal Singh (Advisor)
dc.date.accessioned 2026-04-17T07:52:42Z
dc.date.available 2026-04-17T07:52:42Z
dc.date.issued 2025-06
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1904
dc.description.abstract In this thesis, we first compiled hemolytic activity data of peptides in terms of hemolytic concentration (HC50), defined as the concentration required to lyse 50% of red blood cells (RBCs). We then developed regression models using machine learning techniques to predict HC50 values, which serve as a key indicator of hemolytic potential. This activity data has been integrated into Hemolytik2 (http://webs.iiitd.edu.in/raghava/hemolytik2/), an updated and enhanced version of the Hemolytik database. Hemolytik2 is a manually curated and systematically organized resource that compiles experimentally validated hemolytic peptides from literature and public repositories, including the Antimicrobial Peptide Database (APD), UniProt, and the Dragon Antimicrobial Peptide Database (DAMPD). Over 5,000 of the 13,215 validated peptides in the database have known HC 50 values. Additionally, 2,569 peptides with experimentally established HC50 values against mammalian RBCs were used to train the regression models. With a Pearson correlation coefficient (R) of 0.660 and a coefficient of determination (R2) of 0.408, the top-performing model demonstrated a decent capacity for prediction. All things considered, Hemolytik2.0 is a useful platform for investigating the hemolytic characteristics of peptides and aids in the creation of computational tools meant to create safer and more efficient peptide-based drugs. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Machine Learning en_US
dc.subject Red Blood Cell en_US
dc.subject Amino Acid Composition en_US
dc.title Machine learning regression models for predicting hemolytic concentration (HC50) of peptides using curated activity data en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account