Turkish OCR on Mobile and Scanned Document Images
 Küçük Resim Yok 
Tarih
2015
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Ieee
Erişim Hakkı
info:eu-repo/semantics/closedAccess
Özet
Optical character recognition (OCR) systems have been widely used to convert documents into digital form. There are lots of both commercial and open source OCR systems available, but a benchmark on Turkish OCR is nonexistent. In this work, we first prepared two publicly available datasets for Turkish OCR, consisting of scanned document images and mobile camera captured document images. Then, we evaluated the Turkish OCR performance of three popular open source OCR systems (Tesseract, CuneiForm, GOCR) on the datasets. Tesseract outperformed the other two on both datasets.
Açıklama
23nd Signal Processing and Communications Applications Conference (SIU) -- MAY 16-19, 2015 -- Inonu Univ, Malatya, TURKEY
Anahtar Kelimeler
Turkish OCR; mobile device; scanner; dataset; benchmark; Tesseract
Kaynak
2015 23rd Signal Processing And Communications Applications Conference (Siu)
WoS Q Değeri
N/A












