Developing machine learning and deep learning models for predicting the folding rate of proteins and peptides

Tyagi, Reegina; Murugan, N Arul (Advisor)

Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1660

Full metadata record

DC Field	Value	Language
dc.contributor.author	Tyagi, Reegina
dc.contributor.author	Murugan, N Arul (Advisor)
dc.date.accessioned	2024-09-14T09:37:08Z
dc.date.available	2024-09-14T09:37:08Z
dc.date.issued	2024-05-21
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1660
dc.description.abstract	In order to comprehend the functionality and stability of proteins and peptides, it is essential to forecast their folding rates. This thesis explores the construction of sophisticated machine learning (ML) and deep learning (DL) models for this purpose. The study employs a comprehensive computational methodology that integrates a wide range of bioinformatics instruments to effectively navigate the intricacies of protein folding dynamics. Using Pfeature, a programme created to extract a wide variety of features from protein sequences and greatly improve the input data quality for machine learning models, is the fundamental step in the feature engineering process. Additionally, to represent protein structures as networks and enable a more in-depth examination of the connections between residues that influence folding kinetics, the study makes use of Graph Signal Processing (GSP) techniques. Amber23 facilitates molecular dynamics (MD) simulations, which are essential to the study since they model the atomic movements within proteins under varied settings and offer dynamic insights into protein behaviour. Understanding the energetic and structural alterations that take place during the folding process is made possible by this method, which also enriches the dataset with crucial parameters for precise model training. The thesis uses a range of machine learning models, including advanced regressors, to interpret the intricate datasets that are produced. These models are able to capture the nuanced parameters that control protein folding rates because they are trained on features generated from both sequence data and MD simulation results. The incorporation of many data sources and analytical methods guarantees that the models created not only accurately forecast folding rates but also help to understand theory of protein Biophysics This work considerably increases the predictive capacities in protein research by fusing data science, machine learning, and computational biology. It provides fresh insights into one of the most intricate biological processes and may find use in genetic and medication design research.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Data Collection	en_US
dc.subject	Data Preparation	en_US
dc.subject	Computational Requirements	en_US
dc.title	Developing machine learning and deep learning models for predicting the folding rate of proteins and peptides	en_US
dc.type	Thesis	en_US
Appears in Collections:	Year-2024

Files in This Item:

File	Description	Size	Format
Reegina_Tyagi(MT21305).pdf		3.59 MB	Adobe PDF	View/Open

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets