IIIT-Delhi Institutional Repository

Automated discovery of abstracts reporting gene-disease relationship

Show simple item record

dc.contributor.author Sharma, Divya
dc.contributor.author Sengupta, Debarka (Advisor)
dc.date.accessioned 2023-05-27T09:48:15Z
dc.date.available 2023-05-27T09:48:15Z
dc.date.issued 2020-01
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1265
dc.description.abstract Text classification is a construction problem of models which can classify new documents into predefined classes. It is important before text mining that we know what is the most important data that we require for our research. Text mining has become an essential tool for biomedical research. Our project aims to identify the gene-disease relationship using natural language processing techniques and word embeddings. Assignment of high-dimensional vectors (embeddings) to words in a text corpus in a way that preserves their syntactic and semantic relationships is one of the most fundamental techniques in natural language processing (NLP). We present a completely generic model based on statistical word embeddings, which shows the gene similarity and proves the gene-disease relationship using word analogies. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Text classification en_US
dc.subject Text mining en_US
dc.subject biomedical research en_US
dc.subject text corpus en_US
dc.subject natural language processing en_US
dc.title Automated discovery of abstracts reporting gene-disease relationship en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account