Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/1909
Full metadata record
DC FieldValueLanguage
dc.contributor.authorRiya-
dc.contributor.authorMurugan, N. Arul (Advisor)-
dc.date.accessioned2026-04-17T09:49:02Z-
dc.date.available2026-04-17T09:49:02Z-
dc.date.issued2025-08-
dc.identifier.urihttp://repository.iiitd.edu.in/xmlui/handle/123456789/1909-
dc.description.abstractIn drug discovery, the accurate prediction of pharmacokinetic properties plays a crucial role in identifying compounds with favorable absorption, distribution, metabolism, and excretion profiles. Among these, the partition coefficient (logP), blood–brain barrier permeability (logBB), and aqueous solubility (logS) are central to assessing drug–likeness and bioavailability. Traditional descriptor–based models often rely solely on classical cheminformatics features, which may fail to capture the underlying quantum–mechanical phenomena influencing molecular behavior. This work integrates classical, quantum mechanical, and shape/surface descriptors (Quantum Mechanical Polar surface area(QMPSA)) to develop predictive models for logP, logBB, and logS. Quantum descriptors, including frontier orbital energies and Gibbs free energy difference (∆G), were derived from Density Functional Theory (DFT) calculations, while surface and volumetric properties were computed using the Multiwfn program. These were complemented by classical Mordred descriptors, enabling the evaluation of six descriptor set combinations for LogP and LogS, while three descriptor set combinations were used for LogBB. Models were systematically compared using curated datasets comprising 28,930 molecules for log P , 6,460 for log S, and 7,213 (classification) / 976 (regression) for log BB. For log P , the highest performance was achieved with an ExtraTrees regressor on Classical descriptors, attaining R2 = 0.9481 and RMSE = 0.3847. For log S, Gradient Boosting with Classical descriptors produced R2 = 0.8644 and RMSE = 0.8488. The log BB models demonstrated complementary strengths: a classification model with Random Forest on a hybrid model with Classical descriptors and quantum descriptors achieved an accuracy of 0.9411 and an AUC of 0.9974 on hybrid classical and quantum descriptors, while a regression with ExtraTrees on a hybrid Classical+Quantum descriptor reached R2 = 0.665 and RMSE = 0.403. Together, these results highlight the dominance of Classical cheminformatics descriptors for lipophilicity and aqueous solubility, with hybridization yielding improvements in blood-brain barrier permeability tasks. The results demonstrate that incorporating quantum–mechanical information along- side structural and surface descriptors enhances predictive accuracy and offers a generalizable, theory driven framework for early–stage drug property estimation.en_US
dc.language.isoen_USen_US
dc.publisherIIIT-Delhien_US
dc.subjectQuantum Descriptorsen_US
dc.subjectMachine Learningen_US
dc.titleComparing the performance of quantum descriptors and classical descriptors based ML models for drug like property predictionen_US
dc.typeThesisen_US
Appears in Collections:Year-2025

Files in This Item:
File Description SizeFormat 
MT23235_Riya.pdf2.98 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.