Please use this identifier to cite or link to this item:
http://repository.iiitd.edu.in/xmlui/handle/123456789/673| Title: | Exploiting independent visual and textual data sources to improve multi-modal methods for description and querying of visual data |
| Authors: | Pal, Ambar Sharma, Gaurav (Advisor) Arora, Chetan (Advisor) |
| Keywords: | Image captioning Recurrent neural nets Supervised learning Multi-modal methods Image understanding |
| Issue Date: | 18-Jul-2016 |
| Publisher: | IIIT-Delhi |
| Abstract: | Recent methods on combining textual and visual information using supervised (textual, visual) data have shown encouraging performance. However they are mostly limited to paired (textual, visual) data. We are interested in exploring methods which can leverage large, but independently annotated, datasets of visual and textual data. Applications include image and video captioning and, the induction of novel objects, wherein we try to describe objects that were not seen in the paired annotated data by harnessing knowledge from unpaired data . |
| URI: | http://repository.iiitd.edu.in/xmlui/handle/123456789/673 |
| Appears in Collections: | Year-2018 |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Ambar Pal_2014012.pdf Restricted Access | 2.08 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.