Compilation and prediction of hemolytic peptides using machine learning techniques

S A, Kavin Raj; Raghava, Gajendra Pal Singh (Advisor)

Home
→
Computational Biology
→
MTech Theses
→
Year-2025
→
View Item

dc.contributor.author	S A, Kavin Raj
dc.contributor.author	Raghava, Gajendra Pal Singh (Advisor)
dc.date.accessioned	2026-04-16T07:31:36Z
dc.date.available	2026-04-16T07:31:36Z
dc.date.issued	2025-06
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1900
dc.description.abstract	This thesis describes the compilation, characterization, and prediction of hemolytic peptides, which are responsible for lysing red blood cells. We present Hemolytik2, a comprehensive repository that significantly updates the 2014 Hemolytik database. This new version contains 13,215 entries (7,800 unique peptides), representing a threefold increase over its predecessor, compiled from scientific literature and other peptide databases. Each entry details information such as peptide sequence, terminal modifications, topology, stereochemistry, red blood cell (RBC) source, peptide origin, hemolytic potency, and structural features (SMILES, secondary/tertiary structures). In addition to data compilation, we characterized the peptides and developed a robust method for predicting hemolytic peptides. Peptide features were computed using the widely adopted Pfeature software. A wide range of machine learning techniques, including LightGBM and Random Forest, have been used to develop classification models for discriminating hemolytic and non-hemolytic peptides. SHapley Additive exPlanations (SHAP)-based feature analysis was then applied to identify and rank important features to understand potential of physicochemical descriptors and amino acids. The insights gained from this prediction and feature analysis will be invaluable for the rational design of optimal, safe hemolytic peptides.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Hemolytic Peptides	en_US
dc.subject	Database, Peptide Toxicity	en_US
dc.subject	eature Analysis	en_US
dc.subject	Machine Learning	en_US
dc.subject	Therapeutic Peptides	en_US
dc.subject	SMILES	en_US
dc.subject	Peptide Design	en_US
dc.title	Compilation and prediction of hemolytic peptides using machine learning techniques	en_US
dc.type	Thesis	en_US