Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1346
Title: Machine learning and deep learning models for solvation energy prediction
Authors: Tejas
Murugan, N Arul (Advisor)
Keywords: MNSolv dataset
Molecular mechanics
ML Models
Issue Date: Aug-2023
Publisher: IIIT-Delhi
Abstract: Drug discovery is divided into 4 phases. The first phase is in-silico processes beginning with target identification and validation, followed by the hit discovery process, assay development, high throughput screening, preparing lead, and lead optimization. Screening through all the hits is experimentally unfeasible in terms of both time and resources. Henceforth in-silico models have been developed to predict the properties such as protein-ligand binding affinity, ligand permeability and so on. Here in this thesis, the property in focus is Solvation Energy which provides us with ligand dissolution energy during the protein-ligand binding process in an aqueous medium. The classical mechanics and quantum mechanics-based deterministic approaches can be employed to predict the solvation energies, but these approaches are computationally very demanding. Machine learning and deep learning methods can be used, which can provide reliable results and can be computationally less demanding. Here ten Graph-based deep learning models have been trained using graph representations in combination with two featurizers which help in featurizing the input molecules. Various unsupervised mechanisms like convolution, attention mechanisms, and supervised mechanism interaction networks are implemented for solvation energy prediction. These algorithms work upon graph representations constructed from input molecules by mapping atoms to vertices and bonds to edges. Apart from these three, machine learning models are also trained on different types of descriptors. Weave and CIGIN models perform best in graph-based deep learning algorithms trained on the FreeSolv dataset. Among different machine learning models, the random forest model is found to work best on both datasets of FreeSolv and MNSol. In this thesis, we establish that reliable machine learning and deep learning models can be developed for predicting the solvation energies
URI: http://repository.iiitd.edu.in/xmlui/handle/123456789/1346
Appears in Collections:Year-2023

Files in This Item:
File Description SizeFormat 
Thesis_Tejas_MT21232.pdf1.5 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.