Turkish OCR on Mobile and Scanned Document Images

Karasu, Kurtulus; Bastan, Muhammet

Turkish OCR on Mobile and Scanned Document Images

dc.contributor.author	Karasu, Kurtulus
dc.contributor.author	Bastan, Muhammet
dc.date.accessioned	2025-10-24T18:10:20Z
dc.date.available	2025-10-24T18:10:20Z
dc.date.issued	2015
dc.department	Malatya Turgut Özal Üniversitesi
dc.description	23nd Signal Processing and Communications Applications Conference (SIU) -- MAY 16-19, 2015 -- Inonu Univ, Malatya, TURKEY
dc.description.abstract	Optical character recognition (OCR) systems have been widely used to convert documents into digital form. There are lots of both commercial and open source OCR systems available, but a benchmark on Turkish OCR is nonexistent. In this work, we first prepared two publicly available datasets for Turkish OCR, consisting of scanned document images and mobile camera captured document images. Then, we evaluated the Turkish OCR performance of three popular open source OCR systems (Tesseract, CuneiForm, GOCR) on the datasets. Tesseract outperformed the other two on both datasets.
dc.description.sponsorship	Dept Comp Engn & Elect & Elect Engn,Elect & Elect Engn,Bilkent Univ
dc.identifier.endpage	2077
dc.identifier.isbn	978-1-4673-7386-9
dc.identifier.issn	2165-0608
dc.identifier.startpage	2074
dc.identifier.uri	https://hdl.handle.net/20.500.12899/4096
dc.identifier.wos	WOS:000380500900499
dc.identifier.wosquality	N/A
dc.indekslendigikaynak	Web of Science
dc.language.iso	tr
dc.publisher	Ieee
dc.relation.ispartof	2015 23rd Signal Processing And Communications Applications Conference (Siu)
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_20251023
dc.subject	Turkish OCR; mobile device; scanner; dataset; benchmark; Tesseract
dc.title	Turkish OCR on Mobile and Scanned Document Images
dc.type	Conference Object

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu

Turkish OCR on Mobile and Scanned Document Images

Dosyalar

Koleksiyon