IIIT-Delhi Institutional Repository

Robust OCR for information extraction from retail flyers

Show simple item record

dc.contributor.author Singh, Shashank Shekhar
dc.contributor.author Singh, Abhijeet
dc.contributor.author Shah, Rajiv Ratn (Advisor)
dc.date.accessioned 2024-05-13T11:50:20Z
dc.date.available 2024-05-13T11:50:20Z
dc.date.issued 2023-11-29
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1452
dc.description.abstract This report explores the augmentation of the RanLayNet research paper by addressing limitations in existing datasets for domain adaptation. Through a comprehensive analysis of Ran- LayNet, PubLayNet, and DocLayNet papers, deficiencies in dataset suitability for domain adaptation were identified. The study focused on leveraging a YOLOv8 model, fine-tuned using a subset of the PubLayNet dataset (1 of 13), and subsequently applied inferencing techniques on the multifaceted DocLayNet dataset, spanning Government Tenders, Laws and Regulations, Manuals, and Patents domains. This approach aimed to bridge gaps in dataset applicability and enhance document layout analysis for diverse domains. Results and implications of this methodology in the context of domain adaptation within document analysis are presented and discussed. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject YOLOv8 en_US
dc.subject PubLayNet en_US
dc.subject RanLayNet en_US
dc.subject DocLayNet en_US
dc.subject Domain Adaptation en_US
dc.subject Document Layout Detection, en_US
dc.subject Object detection & Instance Segmentation en_US
dc.subject OCR en_US
dc.subject Fine-Tuned Models en_US
dc.title Robust OCR for information extraction from retail flyers en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account