Please use this identifier to cite or link to this item: http://repository.iiitd.edu.in/xmlui/handle/123456789/673
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPal, Ambar
dc.contributor.authorSharma, Gaurav (Advisor)
dc.contributor.authorArora, Chetan (Advisor)
dc.date.accessioned2018-09-24T13:30:22Z
dc.date.available2018-09-24T13:30:22Z
dc.date.issued2016-07-18
dc.identifier.urihttp://repository.iiitd.edu.in/xmlui/handle/123456789/673
dc.description.abstractRecent methods on combining textual and visual information using supervised (textual, visual) data have shown encouraging performance. However they are mostly limited to paired (textual, visual) data. We are interested in exploring methods which can leverage large, but independently annotated, datasets of visual and textual data. Applications include image and video captioning and, the induction of novel objects, wherein we try to describe objects that were not seen in the paired annotated data by harnessing knowledge from unpaired data .en_US
dc.language.isoen_USen_US
dc.publisherIIIT-Delhien_US
dc.subjectImage captioningen_US
dc.subjectRecurrent neural netsen_US
dc.subjectSupervised learningen_US
dc.subjectMulti-modal methodsen_US
dc.subjectImage understandingen_US
dc.titleExploiting independent visual and textual data sources to improve multi-modal methods for description and querying of visual dataen_US
dc.typeOtheren_US
Appears in Collections:Year-2018

Files in This Item:
File Description SizeFormat 
Ambar Pal_2014012.pdf
  Restricted Access
2.08 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.