Automated discovery of abstracts reporting gene-disease relationship

dc.contributor.author	Sharma, Divya
dc.contributor.author	Sengupta, Debarka (Advisor)
dc.date.accessioned	2023-05-27T09:48:15Z
dc.date.available	2023-05-27T09:48:15Z
dc.date.issued	2020-01
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1265
dc.description.abstract	Text classification is a construction problem of models which can classify new documents into predefined classes. It is important before text mining that we know what is the most important data that we require for our research. Text mining has become an essential tool for biomedical research. Our project aims to identify the gene-disease relationship using natural language processing techniques and word embeddings. Assignment of high-dimensional vectors (embeddings) to words in a text corpus in a way that preserves their syntactic and semantic relationships is one of the most fundamental techniques in natural language processing (NLP). We present a completely generic model based on statistical word embeddings, which shows the gene similarity and proves the gene-disease relationship using word analogies.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Text classification	en_US
dc.subject	Text mining	en_US
dc.subject	biomedical research	en_US
dc.subject	text corpus	en_US
dc.subject	natural language processing	en_US
dc.title	Automated discovery of abstracts reporting gene-disease relationship	en_US
dc.type	Thesis	en_US