Optical character recognition tool

Mithal, Aditi; Kumaraguru, Ponnurangam (Advisor)

dc.contributor.author	Mithal, Aditi
dc.contributor.author	Kumaraguru, Ponnurangam (Advisor)
dc.date.accessioned	2017-11-13T06:40:57Z
dc.date.available	2017-11-13T06:40:57Z
dc.date.issued	2017-04-18
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/556
dc.description.abstract	A tremendous amount of impact is generated through the images on social media as they account for more than 60% of the content available online. Understanding the textual content of the image is therefore significant for making constructive inferences. Significant number of optical character recognition (OCR) tools exist - tesseract, Google vision API, Microsoft Cognitive services, ocropy for conducting research and extracting text from images. However, some of these tools are expensive and paid while others give less accurate results on memes and user generated OSM content. This report focuses on the methodology adopted for developing an OCR tool just for this purpose. This report will discuss two mainstream methods adopted for text recognition – tweaking the tesseract pipeline for improving the existing results and using a single shot multibox detector for segmenting the text regions and training it on the synthetically generated annotated data. The results have been compared using multiple string matching metrics including jaccard similarity, jaro winkler etc.	en_US
dc.language.iso	en_US	en_US
dc.title	Optical character recognition tool	en_US
dc.type	Other	en_US