IIIT-Delhi Institutional Repository

RealWorld clinical data synthesis using generative AI

Show simple item record

dc.contributor.author Bhalla, Raghav
dc.contributor.author Panwar, Ritesh
dc.contributor.author Sethi, Tavpritesh (Advisor)
dc.date.accessioned 2023-04-16T05:47:24Z
dc.date.available 2023-04-16T05:47:24Z
dc.date.issued 2022-12
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1201
dc.description.abstract Healthcare datasets are not easily available to researchers and innovators due to patients’ privacy and/or government regulations. We propose to create a novel pipeline for bespoke synthetic datasets matching original clinical datasets that very well represent the real-world data but a specific sample in the data cannot be traced back to a patient. In this project, we will work on publicly available clinical datasets because clinical data in healthcare are important for PCOR i.e Patient Centered Outcomes Research which focuses on effective prevention and treatment measures of an individual. So the hospitals, as well as clinical data researchers and innovators, need large sample sizes and low-cost clinical datasets to work on. Here comes our prototype, which provides several synthetic clinical datasets of the publicly available datasets created with different machine learning algorithms. The prototype will contain a thorough analysis of the different synthetic datasets with respect to various metrics like classification accuracy, precision, recall, and F1 score so that any researcher who needs to analyze the datasets has suffcient information to develop their theory or research. We can help the hospitals by providing them with an analysis of how much augmentation is needed for a particular task. This will be done by checking the efficacy and evaluation of various algorithms used to create synthetic clinical data. We expect to attain patterns through our work, indicating a best performing synthetic data augmentation model choice based on specific characteristics of clinical data like column/feature types, sample count, and more. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Classification Analysis en_US
dc.subject Generative Machine Learning en_US
dc.subject Clinical Data Synthesis en_US
dc.subject Healthcare en_US
dc.title RealWorld clinical data synthesis using generative AI en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account