IIIT-Delhi Institutional Repository

Mobility Assistant for the Visually Impaired

Show simple item record

dc.contributor.author Verma, Arsh
dc.contributor.author Gupta, Anubha (Advisor)
dc.contributor.author Arora, Chetan (Advisor)
dc.contributor.author Balakrishnan, M. (Advisor)
dc.date.accessioned 2022-03-30T10:15:46Z
dc.date.available 2022-03-30T10:15:46Z
dc.date.issued 2021-05
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/976
dc.description.abstract Scene Text Recognition (STR) refers to the task of recognition of text in natural scenes. The success of OCR models is hard to achieve on natural scene images due to a variety of challenges, including - variation in orientation and pixel intensities in images, low resolution and errors in bounding box detection, as well as variation in fonts and shapes of print of characters. Our main objective is to obtain a model that achieves near State of the Art performance out custom MAVI dataset, which will allow it to be used in the real world application of assisting a visually impaired person to read signboards in order to obtain directions. We provide an end-to-end detection and recognition system for the same. Problems arise when the distribution of data seen during test time differs from the training data. The model cannot make reliable predictions in such a scenario. We perform experiments to demonstrate how the model performance drops due to a shift in domain. en_US
dc.language.iso en_US en_US
dc.publisher IIIT- Delhi en_US
dc.subject Computer Vision en_US
dc.subject Deep Learning en_US
dc.subject Text Recognition en_US
dc.subject Domain Shift en_US
dc.subject Calibration en_US
dc.title Mobility Assistant for the Visually Impaired en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account